zoom
ef2677b0a6
Make MatMul layer support 3d or 4d operation with const input
2022-11-10 11:41:44 +08:00
zoom
11d492b0b9
Let part of the operators in nary_eltwise support cuda
2022-11-02 14:08:21 +08:00
Zihao Mu
17f2b56291
remove never used code in onnximporter
2022-11-02 10:45:16 +08:00
Alexander Alekhin
ee9137f176
Merge pull request #22725 from zihaomu:fix_infinit_loop_in_tf
2022-10-31 17:03:03 +00:00
Zihao Mu
903bf0147e
Merge pull request #22666 from zihaomu:support_onnx_qdq_model
...
DNN: let Quant and Dequant of ONNX_importer support the Constant input.
* let Quant and Dequant support the Constant input.
* fix negative value of axis.
2022-10-31 16:06:31 +00:00
Zihao Mu
18fbb72f7d
fix the infinite loop in tf importer.
2022-10-31 20:10:25 +08:00
Alexander Smorkalov
22f8fb4d5c
Do not fail tests in Yolo v7 model was not found.
2022-10-24 17:59:18 +03:00
Alexander Smorkalov
23edec83fb
Merge pull request #22667 from zihaomu:bug_fix_in_winograd
...
DNN: bug fixed in Winograd
2022-10-21 17:54:13 +03:00
Alexander Smorkalov
e4cd430710
Merge pull request #22653 from WanliZhong:issue22597
...
DNN-TF: let StridedSlice layer support const input
2022-10-21 17:51:00 +03:00
Dmitry Kurtaev
35b2cff295
Merge pull request #22656 from dkurt:halide_fixes
...
* Fixes for Halide
* Enable some Halide tests
2022-10-21 17:49:49 +03:00
Zihao Mu
cee8c86b6e
fixed bug at winograd of SIMD128 and more robust code.
2022-10-21 19:14:54 +08:00
Alexander Smorkalov
5d292826b2
Merge pull request #22593 from zihaomu:optimize_wino
...
optimize winograd futher more
2022-10-19 13:08:32 +03:00
Alexander Smorkalov
f378f02954
Merge pull request #22652 from rogday:cuda_test_fixes
...
Address CUDA-related errors
2022-10-19 09:37:12 +03:00
Smirnov Egor
dd14cf6a9c
address CUDA-related errors and enable cuda in elementwise ops
2022-10-18 16:54:42 +03:00
Alexander Smorkalov
ec7fc5adca
Merge pull request #22529 from fengyuentau:scatter_scatternd
...
DNN: supports Scatter and ScatterND from ONNX
2022-10-17 14:57:46 +03:00
Alexander Smorkalov
02143cd0e2
Merge pull request #22531 from zihaomu:stop_rely_name
...
Parsing quantized nodes does not rely on names
2022-10-17 11:20:24 +03:00
Alexander Smorkalov
1c5dcbcac8
Merge pull request #22639 from WanliZhong:issue#22625
...
DNN: Make Unsqueeze layer support negative axes
2022-10-17 09:27:49 +03:00
fengyuentau
d24d8f2abe
implementation of scatter and scatternd with conformance tests enabled
2022-10-17 11:30:32 +08:00
Alexander Alekhin
762481411d
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-10-15 16:44:47 +00:00
zoom
d816442e4d
Make Unsqueeze layer support negative axes.
2022-10-14 18:00:19 +08:00
Zihao Mu
0fa43e3aac
Optimize the winograd futher more.
2022-10-14 10:15:45 +08:00
zoom
9119692bb8
let StridedSlice layer support const input
2022-10-12 11:50:44 +08:00
Alexander Smorkalov
ec26541771
Merge pull request #22577 from zihaomu:Disable_winograd_branch_in_tryquantize
...
DNN: add enableWinograd API for Net
2022-10-11 09:44:00 +03:00
Zihao Mu
d9eff7daeb
parse quantized nodes does not rely on name.
2022-10-10 17:08:46 +08:00
Alexander Smorkalov
3419e64dcf
Merge pull request #22611 from zihaomu:greaterOrEqual
...
DNN: support GreaterOrEqual and LessOrEqual op in ONNX
2022-10-10 11:43:44 +03:00
Zihao Mu
1e2ceca4df
add enableWinograd API for Net.
2022-10-09 09:33:07 +08:00
Alexander Alekhin
347246901e
Merge pull request #21745 from alalek:dnn_plugin_openvino
2022-10-08 22:32:25 +00:00
Zihao Mu
9821fae59d
add greater_or_equal and less_or_equal ONNX support
2022-10-08 15:51:40 +08:00
Alexander Alekhin
43b2bb2c25
dnn: plugin support for OpenVINO
2022-10-07 16:57:31 +00:00
Alexander Smorkalov
96844b0ca5
Merge pull request #22554 from WanliZhong:slice_axes_no_seq
...
DNN: Let Slice layer support non-sequential and negative axes
2022-10-03 10:15:55 +03:00
zoom
4557971481
enhance slice layer
...
refactor the code for parsing Slice layer
add test for Slice layer
let 'begin' and 'end' resize to dims
add opset message comment
2022-10-01 17:12:07 +08:00
Zihao Mu
15cfafb360
DNN: Remove unused code in onnx_importer.cpp
2022-09-29 10:53:43 +08:00
Voron
cbf43a54fb
added opencv for openvino tutorial
2022-09-28 12:05:28 +02:00
Alexander Smorkalov
a6274647a4
Merge pull request #21738 from rogday:gather
...
add Gather implementation
2022-09-19 16:21:14 +03:00
Egor Smirnov
65f71ce2eb
add Gather implementation
2022-09-19 15:06:44 +03:00
Alexander Smorkalov
6aefb8e86f
Merge pull request #22290 from fengyuentau:naive_yolov7
...
Support for YOLOv7 ONNX (not simplified)
2022-09-19 14:43:18 +03:00
fengyuentau
4aef9b1c93
dnn: support yolov7 (not simplified)
2022-09-19 18:38:03 +08:00
Alexander Smorkalov
e1e9261450
Merge pull request #22479 from scottchou007:master
...
Fix issues in opencv_test_dnn from conv48 kernels without bias
2022-09-16 09:05:55 +03:00
scottchou007
a3cb2020bc
Fix issues in opencv_test_dnn from conv48 kernels using uninitialized tensors when there is no bias.
2022-09-15 13:41:27 -07:00
Alexander Alekhin
65bdb3a544
dnn: eliminate GCC12 warning in total() call
2022-09-14 11:37:00 +00:00
Alexander Smorkalov
c2c8da2517
Merge pull request #22448 from Ichini24:reshape-permutations-fix
...
changed names of permutations if Reshpe is in NHWC
2022-09-13 09:24:56 +03:00
wxsheng
4154bd0667
Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX
...
* Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX
* Add resize.lasx.cpp for Loongson SIMD acceleration
* Add imgwarp.lasx.cpp for Loongson SIMD acceleration
* Add LASX acceleration support for dnn/conv
* Add CV_PAUSE(v) for Loongarch
* Set LASX by default on Loongarch64
* LoongArch: tune test threshold for Core/HAL.mat_decomp/15
Co-authored-by: shengwenxue <shengwenxue@loongson.cn>
2022-09-10 09:39:43 +03:00
Alexander Alekhin
ca7f964104
dnn: use inheritance for OpenVINO net impl
2022-09-06 18:05:00 +00:00
anton
337452b4c0
changed names of permutations if Reshpe is in NHWC
2022-09-03 19:02:41 +02:00
Zihao Mu
b69b1eae8f
fix bug 22450
2022-09-02 16:30:06 +08:00
Alexander Smorkalov
70fb1cd603
Merge pull request #22440 from zihaomu:fix_conv_bug
2022-08-30 07:01:05 +00:00
Alexander Smorkalov
d2c48b898c
Merge pull request #22306 from zihaomu:qgemm_and_squeeze_opset13_onnximporter
2022-08-30 06:33:57 +00:00
Zihao Mu
2d837efba7
add qgemm and squeeze op13 supported on ONNXImporter
2022-08-30 09:50:29 +08:00
Alexander Smorkalov
1fd45a1b85
Merge pull request #22362 from fengyuentau:conv_asym_pad_fuse
...
Remove asymmetric padding in Conv layer since it is supported in CPU backend
2022-08-29 17:56:17 +03:00
Zihao Mu
2cd7e17b65
replace v_add with +
2022-08-29 17:15:35 +08:00
Alexander Smorkalov
2619099fe5
Merge pull request #22337 from zihaomu:load_ONNX_fp16_as_fp32
...
DNN: load fp16 ONNX model as fp32
2022-08-29 09:32:25 +03:00
fengyuentau
2959286eb5
tengine: supports conv with asymmetric padding
2022-08-29 02:51:26 +00:00
Zihao Mu
9638e34ab0
reuse WORDS_BIGENDIAN.
2022-08-27 07:42:38 +08:00
Zihao Mu
bb64db98d8
Further optimization of Conv2D, fused Conv_Add_Activation, bring latest code from ficus OpConv.fx. ( #22401 )
2022-08-26 12:57:25 +03:00
Zihao Mu
7eaec9dd22
load fp16 as fp32 and align fp16 and double in onnx_graph_simplifie
2022-08-26 10:04:44 +08:00
Zihao Mu
5e92bf8e41
support silu activation in darknet
2022-08-22 10:51:29 +08:00
Alexander Alekhin
2ebdc04787
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-08-14 15:50:42 +00:00
fengyuentau
0cdff46725
tune for opencl
2022-08-14 17:47:48 +08:00
Alexander Alekhin
d0d115321d
Merge pull request #22350 from alalek:rework_psabi_warning
2022-08-13 15:05:41 +00:00
Alexander Smorkalov
bb71cb200e
Merge pull request #22199 from zihaomu:bug_fix_22195
...
DNN: Reduce Layer (add dynamic batch and ReduceSum support)
2022-08-11 12:59:51 +03:00
fengyuentau
e7e814fa8c
remove asymmetric padding checks
2022-08-10 19:52:44 +08:00
Alexander Alekhin
44b2f9637a
Revert "suppress warning on GCC 7 and later"
...
This reverts commit a630ad73cb
.
2022-08-07 15:43:10 +03:00
Alexander Smorkalov
b2b7193374
Merge pull request #22311 from zihaomu:layer_fused_optmized_mish
...
DNN: add another two Mish activation to onnx_graph_simplifier
2022-08-05 14:22:06 +03:00
Zihao Mu
0614c40b42
add more skip for very long test case in test_dnn.
2022-08-02 14:58:05 +08:00
Zihao Mu
d4640f4647
support ReduceLayer without reshape layer.
2022-08-02 10:32:31 +08:00
Zihao Mu
57545653b1
replace new mish impl with softplus
2022-07-28 13:19:06 +08:00
Zihao Mu
3c5377ca1b
add another Mish graph simplifier.
2022-07-28 11:21:29 +08:00
HAN Liutong
e2bfe0ce76
Use "#if" instead of "#ifdef" for CV_SIMD128.
2022-07-21 03:23:57 +00:00
Zihao Mu
98c33c605d
batchsize dynamic is set to index 0.
2022-07-20 19:02:16 +08:00
rogday
ed69bcae2d
Merge pull request #21865 from rogday:nary_eltwise_layers
...
Reimplementation of Element-wise layers with broadcasting support
* init
* semi-working initial version
* add small_vector
* wip
* remove smallvec
* add nary function
* replace auto with Mat in lambda expr used in transform
* uncomment asserts
* autobuffer shape_buf & step_buf
* fix a missing bracket
* fixed a missing addLayer in parseElementWise
* solve one-dimensional broadcast
* remove pre_broadcast_transform for the case of two constants; fix missing constBlobsExtraInfo when addConstant is called
* one autobuffer for step & shape
* temporal fix for the missing original dimension information
* fix parseUnsqueeze when it gets a 1d tensor constant
* support sum/mean/min/max with only one input
* reuse old code to handle cases of two non-constant inputs
* add condition to handle div & mul of two non-constant inputs
* use || instead of or
* remove trainling spaces
* enlarge buf in binary_forward to contain other buffer
* use autobuffer in nary_forward
* generate data randomly and add more cases for perf
* add op and, or & xor
* update perf_dnn
* remove some comments
* remove legacy; add two ONNX conformance tests in filter
* move from cpu_denylist to all_denylist
* adjust parsing for inputs>=2
Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>
2022-07-19 06:14:05 +03:00
fengyuentau
1c7b71bf9e
define data_layout as unknown for pack
2022-07-14 19:27:20 +08:00
Zihao Mu
1b8fba8e26
support ReduceSum with two input and dynamic shape batch size in ReduceLayer.
2022-07-13 13:46:16 +08:00
Zihao Mu
45fbb67aba
fix scale layer can not handle 1x1 weight correctly.
2022-07-13 11:25:27 +08:00
Zihao Mu
139c443770
Merge pull request #22183 from zihaomu:fastConv_ARMv7_compatible
...
DNN: ARMv7 compatible fastConv
* support armv7 on fastConv
* remove whitespace.
2022-07-07 13:23:08 +03:00
Tomoaki Teshima
a630ad73cb
suppress warning on GCC 7 and later
2022-07-06 23:31:31 +09:00
Zihao Mu
a80fcacd90
Merge pull request #21372 from zihaomu:dnn_quantize_per_tensor
...
Add per_tensor_quantize to int8 quantize
* add per_tensor_quantize to dnn int8 module.
* change api flag from perTensor to perChannel, and recognize quantize type and onnx importer.
* change the default to hpp
2022-07-05 19:14:42 +03:00
Zihao Mu
59b870a87a
Merge pull request #21910 from zihaomu:fast_conv_ARM
...
DNN: Accelerating convolution
* Fast Conv of ARM, X86 and universal intrinsics.
* improve code style.
* error fixed.
* improve the License
* optimize memory allocated and Adjust the threshold.
* change FasterRCNN_vgg16 to 2GB memory.
2022-07-01 13:03:15 +03:00
Zihao Mu
ef94275eb6
bug fixed of GEMM node in ONNX_importer
2022-06-22 21:08:48 +08:00
Wanli
a6ca48a1c2
Merge pull request #22100 from WanliZhong:issue_22015
...
Fix issue 22015, let Clip layer support 1-3 inputs
* Fix issue 22015.
Let layer Clip support 1-3 inputs.
* Resolve other problems caused by modifications
* Update onnx_importer.cpp
added extra checks to min/max handling in Clip
* Add assertions to check the size of the input
* Add test for clip with min and max initializers
* Separate test for "clip_init_min_max". Change the check method for input_size to provide a clearer message in case of problem.
* Add tests for clip with min or max initializers
* Change the implementation of getting input
Co-authored-by: Vadim Pisarevsky <vadim.pisarevsky@gmail.com>
2022-06-22 14:21:16 +03:00
Zihao Mu
2411b825b4
bug fixed of GEMM node in ONNX_importer
2022-06-22 15:00:17 +08:00
Alexander Alekhin
583bd1a6e2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-06-04 19:10:35 +00:00
Namgoo Lee
24547f40ff
remove const from functions returning by value
2022-05-26 21:30:41 +09:00
Alexander Alekhin
e9187ae38c
Merge pull request #22026 from alalek:update_version_3.4.18-pre
2022-05-24 20:23:28 +00:00
Alexander Alekhin
978dc76653
Merge pull request #22006 from rogday:21947_fix
2022-05-24 19:26:02 +00:00
rogday
a2ad997e97
fix vector access in TF::sortByExecutionOrder
2022-05-24 00:05:13 +03:00
Alexander Alekhin
e9428726ca
pre: OpenCV 4.6.0 (version++)
2022-05-23 19:25:16 +00:00
Alexander Alekhin
400906b433
pre: OpenCV 3.4.18 (version++)
2022-05-23 19:18:02 +00:00
berak
50d7c61c01
Update darknet_importer.cpp
...
make it more obvious, that this is a '404', not a 'parsing' problem
2022-05-23 19:18:31 +02:00
Alexander Alekhin
d9bf522b27
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-05-23 16:06:14 +00:00
rogday
93dc0679ec
Merge pull request #21818 from rogday:revert_renaming
...
* add prefixes to layer names and layer output names
* dnn: OPENCV_DNN_ONNX_USE_LEGACY_NAMES runtime parameter
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2022-05-23 14:50:42 +00:00
Alexander Alekhin
bb5462e327
Merge pull request #21991 from fengyuentau:qconv_asympad
2022-05-19 17:20:04 +00:00
fengyuentau
ff88132620
support asymmetric paddings for qconv
2022-05-16 19:01:37 +08:00
OpenCV Developers
d9a444ca1a
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-05-14 11:23:21 +00:00
Yulv-git
15ac54d5d6
Fix some typos in modules/.
2022-04-30 13:40:07 +08:00
Zihao Mu
64ded50bbf
parsing depth2space and space2depth of ONNX importer
2022-04-29 10:17:02 +08:00
rogday
9cd5a0a1e6
Merge pull request #21884 from rogday:cuda_cleanup
...
Fix CUDA compilation issues and adjust thresholds.
* Fix CUDA compilation issues and adjust thresholds.
* add conformance tests to denylist
2022-04-19 16:40:25 +00:00
OpenCV Developers
2985739b8c
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-04-16 14:41:15 +00:00
rogday
a2b84e9897
add assert to tf graph simplifier to address security concerns
2022-04-13 22:50:27 +03:00
OpenCV Pushbot
66f3c2673c
Merge pull request #21831 from zihaomu:sign_layer_onnx
...
DNN: Add sign, shrink and reciprocal for onnx_impoter
2022-04-13 17:08:30 +00:00
OpenCV Pushbot
03c9648f2e
Merge pull request #21854 from opencv-pushbot:dnn_test_update_checks_face_detector_4.x
2022-04-12 17:20:22 +00:00