Maksim Shabunin
d35fbe6bfc
dnn: updated YOLOv4-tiny model and tests
2022-12-22 15:49:21 +03:00
Alexander Alekhin
6b4f3e5fab
Merge pull request #22993 from alalek:fixup_21738
2022-12-21 19:50:51 +00:00
Yuantao Feng
a2b3acfc6e
dnn: add the CANN backend ( #22634 )
...
* cann backend impl v1
* cann backend impl v2: use opencv parsers to build models for cann
* adjust fc according to the new transA and transB
* put cann net in cann backend node and reuse forwardLayer
* use fork() to create a child process and compile cann model
* remove legacy code
* remove debug code
* fall bcak to CPU backend if there is one layer not supoorted by CANN backend
* fix netInput forward
2022-12-21 09:04:41 +03:00
Alexander Alekhin
cdbb893b27
dnn: disable OpenCL code path in MatMul processing
...
- this mode is not supported by 22828
2022-12-20 09:46:48 +00:00
Alexander Alekhin
1102b7eff8
dnn: fix gather layer implementation
...
- support FP16 data
2022-12-20 06:09:34 +00:00
zoom
4891818114
make MatMul support 3D or 4D with broadcast
2022-12-15 10:36:08 +08:00
Alexander Alekhin
8ba44e7d55
Merge pull request #22882 from zihaomu:gemm_first_const
2022-12-08 14:18:33 +00:00
Zihao Mu
0a650b573b
Merge pull request #22840 from zihaomu:optimze_conv_memory_usage
...
DNN: reduce the memory used in convolution layer
* reduce the memory in winograd and disabel the test when usage memory is larger than 2gb.
* remove VERY_LOG tag
2022-12-08 12:57:13 +00:00
Alexander Alekhin
b16f76eede
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-12-03 12:39:41 +00:00
Alexander Alekhin
d16b3b2487
dnn(test): restore openvino tests with 'Cannot get memory' message
2022-12-03 01:34:48 +00:00
Alexander Alekhin
74d0b4cc78
dnn(openvino): fix custom layers BlockingDesc
2022-12-03 01:34:10 +00:00
Alexander Smorkalov
e14ca39fd7
Merge pull request #22857 from fengyuentau:batched_nms
...
dnn: add batched nms
2022-11-30 12:37:49 +03:00
Alexander Smorkalov
421ba8730a
Merge pull request #22809 from fengyuentau:tile
...
dnn: support ONNX Tile
2022-11-29 14:42:28 +03:00
zihaomu
0d56524b72
gemm support transA and transB, and first input is constance.
2022-11-29 17:13:36 +08:00
fengyuentau
9fded9ca53
batched nms impl
2022-11-29 15:32:34 +08:00
fengyuentau
441624a5fb
tile impl
2022-11-29 11:15:38 +08:00
zoom
5044af69d1
let MatMul can work when both two inputs are const
2022-11-27 17:32:41 +08:00
Alexander Smorkalov
6ca205a029
Merge pull request #22478 from WanliZhong:nary_eltwise_cuda
...
DNN: Let part of the operators in nary_eltwise support CUDA
2022-11-22 16:15:50 +03:00
zihaomu
5bf64e7dfe
fix the infinite loop in tf importer of 3.4 branch
2022-11-15 11:42:10 +08:00
zoom
ef2677b0a6
Make MatMul layer support 3d or 4d operation with const input
2022-11-10 11:41:44 +08:00
zoom
11d492b0b9
Let part of the operators in nary_eltwise support cuda
2022-11-02 14:08:21 +08:00
Zihao Mu
17f2b56291
remove never used code in onnximporter
2022-11-02 10:45:16 +08:00
Alexander Alekhin
ee9137f176
Merge pull request #22725 from zihaomu:fix_infinit_loop_in_tf
2022-10-31 17:03:03 +00:00
Zihao Mu
903bf0147e
Merge pull request #22666 from zihaomu:support_onnx_qdq_model
...
DNN: let Quant and Dequant of ONNX_importer support the Constant input.
* let Quant and Dequant support the Constant input.
* fix negative value of axis.
2022-10-31 16:06:31 +00:00
Zihao Mu
18fbb72f7d
fix the infinite loop in tf importer.
2022-10-31 20:10:25 +08:00
Alexander Smorkalov
22f8fb4d5c
Do not fail tests in Yolo v7 model was not found.
2022-10-24 17:59:18 +03:00
Alexander Smorkalov
23edec83fb
Merge pull request #22667 from zihaomu:bug_fix_in_winograd
...
DNN: bug fixed in Winograd
2022-10-21 17:54:13 +03:00
Alexander Smorkalov
e4cd430710
Merge pull request #22653 from WanliZhong:issue22597
...
DNN-TF: let StridedSlice layer support const input
2022-10-21 17:51:00 +03:00
Dmitry Kurtaev
35b2cff295
Merge pull request #22656 from dkurt:halide_fixes
...
* Fixes for Halide
* Enable some Halide tests
2022-10-21 17:49:49 +03:00
Zihao Mu
cee8c86b6e
fixed bug at winograd of SIMD128 and more robust code.
2022-10-21 19:14:54 +08:00
Alexander Smorkalov
5d292826b2
Merge pull request #22593 from zihaomu:optimize_wino
...
optimize winograd futher more
2022-10-19 13:08:32 +03:00
Alexander Smorkalov
f378f02954
Merge pull request #22652 from rogday:cuda_test_fixes
...
Address CUDA-related errors
2022-10-19 09:37:12 +03:00
Zhi-Qiang Zhou
c8561eae2d
Update region_layer.cpp
...
Fix objectness (dstData[index + 4]) is not assigned if new_coords == 1.
2022-10-19 11:17:23 +08:00
Smirnov Egor
dd14cf6a9c
address CUDA-related errors and enable cuda in elementwise ops
2022-10-18 16:54:42 +03:00
Alexander Smorkalov
ec7fc5adca
Merge pull request #22529 from fengyuentau:scatter_scatternd
...
DNN: supports Scatter and ScatterND from ONNX
2022-10-17 14:57:46 +03:00
Alexander Smorkalov
02143cd0e2
Merge pull request #22531 from zihaomu:stop_rely_name
...
Parsing quantized nodes does not rely on names
2022-10-17 11:20:24 +03:00
Alexander Smorkalov
1c5dcbcac8
Merge pull request #22639 from WanliZhong:issue#22625
...
DNN: Make Unsqueeze layer support negative axes
2022-10-17 09:27:49 +03:00
fengyuentau
d24d8f2abe
implementation of scatter and scatternd with conformance tests enabled
2022-10-17 11:30:32 +08:00
Alexander Alekhin
762481411d
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-10-15 16:44:47 +00:00
zoom
d816442e4d
Make Unsqueeze layer support negative axes.
2022-10-14 18:00:19 +08:00
Zihao Mu
0fa43e3aac
Optimize the winograd futher more.
2022-10-14 10:15:45 +08:00
zoom
9119692bb8
let StridedSlice layer support const input
2022-10-12 11:50:44 +08:00
Alexander Smorkalov
ec26541771
Merge pull request #22577 from zihaomu:Disable_winograd_branch_in_tryquantize
...
DNN: add enableWinograd API for Net
2022-10-11 09:44:00 +03:00
Zihao Mu
d9eff7daeb
parse quantized nodes does not rely on name.
2022-10-10 17:08:46 +08:00
Alexander Smorkalov
3419e64dcf
Merge pull request #22611 from zihaomu:greaterOrEqual
...
DNN: support GreaterOrEqual and LessOrEqual op in ONNX
2022-10-10 11:43:44 +03:00
Zihao Mu
1e2ceca4df
add enableWinograd API for Net.
2022-10-09 09:33:07 +08:00
Alexander Alekhin
347246901e
Merge pull request #21745 from alalek:dnn_plugin_openvino
2022-10-08 22:32:25 +00:00
Zihao Mu
9821fae59d
add greater_or_equal and less_or_equal ONNX support
2022-10-08 15:51:40 +08:00
Alexander Alekhin
43b2bb2c25
dnn: plugin support for OpenVINO
2022-10-07 16:57:31 +00:00
Alexander Smorkalov
96844b0ca5
Merge pull request #22554 from WanliZhong:slice_axes_no_seq
...
DNN: Let Slice layer support non-sequential and negative axes
2022-10-03 10:15:55 +03:00
zoom
4557971481
enhance slice layer
...
refactor the code for parsing Slice layer
add test for Slice layer
let 'begin' and 'end' resize to dims
add opset message comment
2022-10-01 17:12:07 +08:00
Zihao Mu
15cfafb360
DNN: Remove unused code in onnx_importer.cpp
2022-09-29 10:53:43 +08:00
Voron
cbf43a54fb
added opencv for openvino tutorial
2022-09-28 12:05:28 +02:00
Alexander Smorkalov
a6274647a4
Merge pull request #21738 from rogday:gather
...
add Gather implementation
2022-09-19 16:21:14 +03:00
Egor Smirnov
65f71ce2eb
add Gather implementation
2022-09-19 15:06:44 +03:00
Alexander Smorkalov
6aefb8e86f
Merge pull request #22290 from fengyuentau:naive_yolov7
...
Support for YOLOv7 ONNX (not simplified)
2022-09-19 14:43:18 +03:00
fengyuentau
4aef9b1c93
dnn: support yolov7 (not simplified)
2022-09-19 18:38:03 +08:00
Alexander Smorkalov
e1e9261450
Merge pull request #22479 from scottchou007:master
...
Fix issues in opencv_test_dnn from conv48 kernels without bias
2022-09-16 09:05:55 +03:00
scottchou007
a3cb2020bc
Fix issues in opencv_test_dnn from conv48 kernels using uninitialized tensors when there is no bias.
2022-09-15 13:41:27 -07:00
Alexander Alekhin
65bdb3a544
dnn: eliminate GCC12 warning in total() call
2022-09-14 11:37:00 +00:00
Alexander Smorkalov
c2c8da2517
Merge pull request #22448 from Ichini24:reshape-permutations-fix
...
changed names of permutations if Reshpe is in NHWC
2022-09-13 09:24:56 +03:00
wxsheng
4154bd0667
Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX
...
* Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX
* Add resize.lasx.cpp for Loongson SIMD acceleration
* Add imgwarp.lasx.cpp for Loongson SIMD acceleration
* Add LASX acceleration support for dnn/conv
* Add CV_PAUSE(v) for Loongarch
* Set LASX by default on Loongarch64
* LoongArch: tune test threshold for Core/HAL.mat_decomp/15
Co-authored-by: shengwenxue <shengwenxue@loongson.cn>
2022-09-10 09:39:43 +03:00
Alexander Alekhin
ca7f964104
dnn: use inheritance for OpenVINO net impl
2022-09-06 18:05:00 +00:00
anton
337452b4c0
changed names of permutations if Reshpe is in NHWC
2022-09-03 19:02:41 +02:00
Zihao Mu
b69b1eae8f
fix bug 22450
2022-09-02 16:30:06 +08:00
Alexander Smorkalov
70fb1cd603
Merge pull request #22440 from zihaomu:fix_conv_bug
2022-08-30 07:01:05 +00:00
Alexander Smorkalov
d2c48b898c
Merge pull request #22306 from zihaomu:qgemm_and_squeeze_opset13_onnximporter
2022-08-30 06:33:57 +00:00
Zihao Mu
2d837efba7
add qgemm and squeeze op13 supported on ONNXImporter
2022-08-30 09:50:29 +08:00
Alexander Smorkalov
1fd45a1b85
Merge pull request #22362 from fengyuentau:conv_asym_pad_fuse
...
Remove asymmetric padding in Conv layer since it is supported in CPU backend
2022-08-29 17:56:17 +03:00
Zihao Mu
2cd7e17b65
replace v_add with +
2022-08-29 17:15:35 +08:00
Alexander Smorkalov
2619099fe5
Merge pull request #22337 from zihaomu:load_ONNX_fp16_as_fp32
...
DNN: load fp16 ONNX model as fp32
2022-08-29 09:32:25 +03:00
fengyuentau
2959286eb5
tengine: supports conv with asymmetric padding
2022-08-29 02:51:26 +00:00
Zihao Mu
9638e34ab0
reuse WORDS_BIGENDIAN.
2022-08-27 07:42:38 +08:00
Zihao Mu
bb64db98d8
Further optimization of Conv2D, fused Conv_Add_Activation, bring latest code from ficus OpConv.fx. ( #22401 )
2022-08-26 12:57:25 +03:00
Zihao Mu
7eaec9dd22
load fp16 as fp32 and align fp16 and double in onnx_graph_simplifie
2022-08-26 10:04:44 +08:00
Zihao Mu
5e92bf8e41
support silu activation in darknet
2022-08-22 10:51:29 +08:00
Alexander Alekhin
2ebdc04787
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-08-14 15:50:42 +00:00
fengyuentau
0cdff46725
tune for opencl
2022-08-14 17:47:48 +08:00
Alexander Alekhin
d0d115321d
Merge pull request #22350 from alalek:rework_psabi_warning
2022-08-13 15:05:41 +00:00
Alexander Smorkalov
bb71cb200e
Merge pull request #22199 from zihaomu:bug_fix_22195
...
DNN: Reduce Layer (add dynamic batch and ReduceSum support)
2022-08-11 12:59:51 +03:00
fengyuentau
e7e814fa8c
remove asymmetric padding checks
2022-08-10 19:52:44 +08:00
Alexander Alekhin
44b2f9637a
Revert "suppress warning on GCC 7 and later"
...
This reverts commit a630ad73cb
.
2022-08-07 15:43:10 +03:00
Alexander Smorkalov
b2b7193374
Merge pull request #22311 from zihaomu:layer_fused_optmized_mish
...
DNN: add another two Mish activation to onnx_graph_simplifier
2022-08-05 14:22:06 +03:00
Zihao Mu
0614c40b42
add more skip for very long test case in test_dnn.
2022-08-02 14:58:05 +08:00
Zihao Mu
d4640f4647
support ReduceLayer without reshape layer.
2022-08-02 10:32:31 +08:00
Zihao Mu
57545653b1
replace new mish impl with softplus
2022-07-28 13:19:06 +08:00
Zihao Mu
3c5377ca1b
add another Mish graph simplifier.
2022-07-28 11:21:29 +08:00
HAN Liutong
e2bfe0ce76
Use "#if" instead of "#ifdef" for CV_SIMD128.
2022-07-21 03:23:57 +00:00
Zihao Mu
98c33c605d
batchsize dynamic is set to index 0.
2022-07-20 19:02:16 +08:00
rogday
ed69bcae2d
Merge pull request #21865 from rogday:nary_eltwise_layers
...
Reimplementation of Element-wise layers with broadcasting support
* init
* semi-working initial version
* add small_vector
* wip
* remove smallvec
* add nary function
* replace auto with Mat in lambda expr used in transform
* uncomment asserts
* autobuffer shape_buf & step_buf
* fix a missing bracket
* fixed a missing addLayer in parseElementWise
* solve one-dimensional broadcast
* remove pre_broadcast_transform for the case of two constants; fix missing constBlobsExtraInfo when addConstant is called
* one autobuffer for step & shape
* temporal fix for the missing original dimension information
* fix parseUnsqueeze when it gets a 1d tensor constant
* support sum/mean/min/max with only one input
* reuse old code to handle cases of two non-constant inputs
* add condition to handle div & mul of two non-constant inputs
* use || instead of or
* remove trainling spaces
* enlarge buf in binary_forward to contain other buffer
* use autobuffer in nary_forward
* generate data randomly and add more cases for perf
* add op and, or & xor
* update perf_dnn
* remove some comments
* remove legacy; add two ONNX conformance tests in filter
* move from cpu_denylist to all_denylist
* adjust parsing for inputs>=2
Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>
2022-07-19 06:14:05 +03:00
fengyuentau
1c7b71bf9e
define data_layout as unknown for pack
2022-07-14 19:27:20 +08:00
Zihao Mu
1b8fba8e26
support ReduceSum with two input and dynamic shape batch size in ReduceLayer.
2022-07-13 13:46:16 +08:00
Zihao Mu
45fbb67aba
fix scale layer can not handle 1x1 weight correctly.
2022-07-13 11:25:27 +08:00
Zihao Mu
139c443770
Merge pull request #22183 from zihaomu:fastConv_ARMv7_compatible
...
DNN: ARMv7 compatible fastConv
* support armv7 on fastConv
* remove whitespace.
2022-07-07 13:23:08 +03:00
Tomoaki Teshima
a630ad73cb
suppress warning on GCC 7 and later
2022-07-06 23:31:31 +09:00
Zihao Mu
a80fcacd90
Merge pull request #21372 from zihaomu:dnn_quantize_per_tensor
...
Add per_tensor_quantize to int8 quantize
* add per_tensor_quantize to dnn int8 module.
* change api flag from perTensor to perChannel, and recognize quantize type and onnx importer.
* change the default to hpp
2022-07-05 19:14:42 +03:00
Zihao Mu
59b870a87a
Merge pull request #21910 from zihaomu:fast_conv_ARM
...
DNN: Accelerating convolution
* Fast Conv of ARM, X86 and universal intrinsics.
* improve code style.
* error fixed.
* improve the License
* optimize memory allocated and Adjust the threshold.
* change FasterRCNN_vgg16 to 2GB memory.
2022-07-01 13:03:15 +03:00
Zihao Mu
ef94275eb6
bug fixed of GEMM node in ONNX_importer
2022-06-22 21:08:48 +08:00
Wanli
a6ca48a1c2
Merge pull request #22100 from WanliZhong:issue_22015
...
Fix issue 22015, let Clip layer support 1-3 inputs
* Fix issue 22015.
Let layer Clip support 1-3 inputs.
* Resolve other problems caused by modifications
* Update onnx_importer.cpp
added extra checks to min/max handling in Clip
* Add assertions to check the size of the input
* Add test for clip with min and max initializers
* Separate test for "clip_init_min_max". Change the check method for input_size to provide a clearer message in case of problem.
* Add tests for clip with min or max initializers
* Change the implementation of getting input
Co-authored-by: Vadim Pisarevsky <vadim.pisarevsky@gmail.com>
2022-06-22 14:21:16 +03:00
Zihao Mu
2411b825b4
bug fixed of GEMM node in ONNX_importer
2022-06-22 15:00:17 +08:00
Alexander Alekhin
583bd1a6e2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-06-04 19:10:35 +00:00
Namgoo Lee
24547f40ff
remove const from functions returning by value
2022-05-26 21:30:41 +09:00
Alexander Alekhin
e9187ae38c
Merge pull request #22026 from alalek:update_version_3.4.18-pre
2022-05-24 20:23:28 +00:00
Alexander Alekhin
978dc76653
Merge pull request #22006 from rogday:21947_fix
2022-05-24 19:26:02 +00:00
rogday
a2ad997e97
fix vector access in TF::sortByExecutionOrder
2022-05-24 00:05:13 +03:00
Alexander Alekhin
e9428726ca
pre: OpenCV 4.6.0 (version++)
2022-05-23 19:25:16 +00:00
Alexander Alekhin
400906b433
pre: OpenCV 3.4.18 (version++)
2022-05-23 19:18:02 +00:00
berak
50d7c61c01
Update darknet_importer.cpp
...
make it more obvious, that this is a '404', not a 'parsing' problem
2022-05-23 19:18:31 +02:00
Alexander Alekhin
d9bf522b27
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-05-23 16:06:14 +00:00
rogday
93dc0679ec
Merge pull request #21818 from rogday:revert_renaming
...
* add prefixes to layer names and layer output names
* dnn: OPENCV_DNN_ONNX_USE_LEGACY_NAMES runtime parameter
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2022-05-23 14:50:42 +00:00
Alexander Alekhin
bb5462e327
Merge pull request #21991 from fengyuentau:qconv_asympad
2022-05-19 17:20:04 +00:00
fengyuentau
ff88132620
support asymmetric paddings for qconv
2022-05-16 19:01:37 +08:00
OpenCV Developers
d9a444ca1a
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-05-14 11:23:21 +00:00
Yulv-git
15ac54d5d6
Fix some typos in modules/.
2022-04-30 13:40:07 +08:00
Zihao Mu
64ded50bbf
parsing depth2space and space2depth of ONNX importer
2022-04-29 10:17:02 +08:00
rogday
9cd5a0a1e6
Merge pull request #21884 from rogday:cuda_cleanup
...
Fix CUDA compilation issues and adjust thresholds.
* Fix CUDA compilation issues and adjust thresholds.
* add conformance tests to denylist
2022-04-19 16:40:25 +00:00
OpenCV Developers
2985739b8c
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-04-16 14:41:15 +00:00
rogday
a2b84e9897
add assert to tf graph simplifier to address security concerns
2022-04-13 22:50:27 +03:00
OpenCV Pushbot
66f3c2673c
Merge pull request #21831 from zihaomu:sign_layer_onnx
...
DNN: Add sign, shrink and reciprocal for onnx_impoter
2022-04-13 17:08:30 +00:00
OpenCV Pushbot
03c9648f2e
Merge pull request #21854 from opencv-pushbot:dnn_test_update_checks_face_detector_4.x
2022-04-12 17:20:22 +00:00
OpenCV Developers
e3a55af336
dnn(test): update opencv_face_detector checks
...
original commit: be4a432bea
2022-04-11 20:27:06 +00:00
OpenCV Developers
be4a432bea
dnn(test): update opencv_face_detector checks
2022-04-11 20:26:25 +00:00
zihaomu
e36948cfbc
add ONNX OP sign, shrink and reciprocal
2022-04-07 15:32:12 +08:00
Alexander Alekhin
08d44f588f
dnn(test): update OpenVINO tests 2022.1.0 (OpenCV 4.x)
2022-04-05 14:13:38 +00:00
Alexander Alekhin
13a995cc1d
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-04-02 19:45:44 +00:00
Alexander Alekhin
4d927e73f1
dnn(test): update OpenVINO tests 2022.1.0
2022-04-02 17:42:53 +00:00
Alexander Alekhin
a233982931
Merge pull request #20938 from JulieBar:lstm_cuda2
2022-04-01 22:10:08 +00:00
Zihao Mu
7b582b71ba
Merge pull request #21036 from fengyuentau:timvx_backend_support
...
dnn: TIM-VX NPU backend support
* Add TimVX NPU backend for DNN module.
* use official branch from tim-vx repo; fix detecting viv sdk
Co-authored-by: fytao <yuantao.feng@outlook.com>
2022-03-31 21:42:11 +00:00
Smirnov Egor
abebbf04b1
Add CUDA support for LSTM.
...
Co-authored-by: Julia Bareeva <jbareeva@gmail.com>
2022-03-31 16:38:22 +03:00
Alexander Alekhin
5e434073d4
Merge pull request #21796 from alalek:dnn_reduce_fixup_21601
2022-03-30 22:26:28 +00:00
Alexander Alekhin
6f5cf8c15f
dnn: fix ReduceLayer implementation, update OpenVINO tests
2022-03-30 20:03:41 +00:00
Alexander Alekhin
b687bc807a
dnn(test): update OpenVINO tests 2021.4.2
2022-03-30 18:58:35 +00:00
Alexander Alekhin
1339ebaa84
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-03-26 16:00:28 +00:00
Alexander Alekhin
c9b90884da
Merge pull request #21601 from zihaomu:add_reduceLayer
2022-03-26 10:20:10 +00:00
luz paz
8e8e4bbabc
dnn: fix various dnn related typos
...
Fixes source comments and documentation related to dnn code.
2022-03-23 18:12:12 -04:00
Alexander Alekhin
4c79318694
dnn: fix index access
2022-03-19 06:54:07 +00:00
Zihao Mu
b6b5c27cec
Support for some reduce layers for onnx
2022-03-18 10:19:13 +08:00
Alexander Alekhin
685797f403
Merge pull request #21662 from alalek:dnn_split
2022-03-17 16:09:17 +00:00
rogday
93353aea70
Merge pull request #21522 from rogday:lstm
...
Fix LSTM support in ONNX
* fix LSTM and add peephole support
* disable old tests
* turn lambdas into functions
* more hacks for c++98
* add assertions
* slice fixes
* backport of cuda-related fixes
* address review comments
2022-03-15 09:14:05 +03:00
Alexander Alekhin
5bf3c1df24
Merge pull request #21715 from ilyachur:change_type_info_creation
2022-03-14 09:18:58 +00:00
Ilya Churaev
419918076e
Changed call of NodeTypeInfo constructor
2022-03-14 10:55:33 +03:00
Alexander Alekhin
a120adde63
dnn: add dnn.cpp file with information about git commits history
2022-03-08 19:22:47 +00:00
Alexander Alekhin
a80af177b6
dnn: split dnn.cpp code
...
base commit: 19926e2979
original dnn.cpp content: 19926e2979/modules/dnn/src/dnn.cpp
2022-03-08 19:22:46 +00:00
Tsukasa Sugiura
8db7d435b9
Merge pull request #21692 from UnaNancyOwen:add_softmax
...
* add apply softmax option to ClassificationModel
* remove default arguments of ClassificationModel::setSoftMax()
* fix build for python
* fix docs warning for setSoftMax()
* add impl for ClassficationModel()
* fix failed build for docs by trailing whitespace
* move to implement classify() to ClassificationModel_Impl
* move to implement softmax() to ClassificationModel_Impl
* remove softmax from public method in ClassificationModel
2022-03-07 20:26:15 +00:00
Alexander Alekhin
901e0ddfe4
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-03-05 19:46:28 +00:00
Alexander Alekhin
5cc27fd3b5
Merge pull request #21542 from rogday:split_expand
2022-02-28 22:38:24 +00:00
Egor Smirnov
375fe81311
fix slice and expand
2022-02-28 17:18:07 +03:00
Yuantao Feng
f77c3574af
Merge pull request #21607 from fengyuentau:fix_FaceDetectorYN_dynamic_shape
...
Use YuNet of fixed input shape to fix not-supported-dynamic-zero-shape for FaceDetectorYN
* use yunet with input of fixed shape
* update yunet used in face recognition regression
2022-02-21 13:49:07 +00:00
Maksim Shabunin
a251474144
Update filters in ONNX tests
2022-02-15 11:56:28 +03:00
Maksim Shabunin
45cbf70265
Update filters in ONNX tests
2022-02-14 17:16:49 +03:00
Alexander Alekhin
19926e2979
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-02-11 17:32:37 +00:00
Alexander Alekhin
effce0573b
dnn: drop legacy Inference Engine NN builder API
2022-02-10 11:55:24 +00:00
Alexander Alekhin
57d3002ee1
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-02-06 16:10:43 +00:00
Alexander Alekhin
a00a0dbfcd
Merge pull request #21564 from alalek:dnn_fix_openvino_outputs
2022-02-06 16:06:23 +00:00
Alexander Alekhin
b41d2c5c14
Merge pull request #21569 from alalek:fixup_18031
2022-02-06 16:04:38 +00:00
Alexander Alekhin
1da48beeec
dnn(ngraph): fix output names
2022-02-06 13:08:53 +00:00
Alexander Alekhin
b57ff73086
dnn(ngraph): fix outputs handling, drop 'unconnected' logic
2022-02-06 13:08:53 +00:00
Alexander Alekhin
67978b5746
dnn(ngraph): add debuging messages
2022-02-06 13:08:53 +00:00
Alexander Alekhin
062f305d1a
dnn: don't fuse 'outputs' with OpenVINO backend
2022-02-06 13:08:53 +00:00
Alexander Alekhin
1f70d4e2a5
dnn(test): re-enable ONNX split tests for OpenVINO
2022-02-06 10:36:15 +00:00
Alexander Alekhin
aa5bc20c83
dnn(ngraph): fixup get_output_as_single_output_node() replacement patch
2022-02-06 10:35:59 +00:00
Maksim Shabunin
d1e76a34a0
3.4: Use modern OpenVINO package interface
...
original commit: 437af37b13
2022-02-02 09:04:03 +00:00
Maksim Shabunin
437af37b13
Use modern OpenVINO package interface
2022-02-01 16:52:17 +00:00
Alexander Alekhin
870c8d3c4e
dnn(test): fix int8 tolerances
2022-01-31 12:54:01 +00:00
Alexander Alekhin
d573472a86
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-01-31 12:53:45 +00:00
Alexander Alekhin
a7e6a1059c
dnn(test): fix outputs handling in ONNX conformance
...
- ONNX output is 1 tensor per defined output instead of N tensors from outputs of "output" layer
2022-01-29 23:29:51 +00:00
Alexander Alekhin
85719a0a5d
dnn: support outputs registration under new names
...
- fixed ONNX importer
2022-01-29 23:29:51 +00:00
Alexander Alekhin
dc35633aa4
Merge pull request #21521 from alalek:dnn_ignore_denormals
2022-01-28 15:31:44 +00:00
Zihao Mu
9e3ba487fa
Merge pull request #21518 from zihaomu:resize_onnx_opset13
...
Add resize layer compatible with ONNX opset13 version
2022-01-28 17:55:01 +03:00
Alexander Alekhin
9188ce68aa
Merge pull request #21490 from rogday:optional_outputs
2022-01-26 15:18:07 +00:00
Alexander Alekhin
70b0274c8e
dnn: apply hint to ignore denormals processing
2022-01-26 11:28:35 +00:00
Alexander Alekhin
b796ededae
Merge pull request #21437 from alalek:dnn_api_explicit_const_4.x
2022-01-21 20:19:50 +00:00
Alexander Alekhin
eb7b45d26b
dnn: fix API - explicit ctors, const methods
2022-01-21 12:38:51 +00:00
Smirnov Egor
17b2d92a3d
add optional outputs support and fix graph links
2022-01-21 12:31:46 +03:00
Alexander Alekhin
6ffa2b01e1
Merge pull request #21357 from rogday:model_diag
2022-01-18 15:50:11 +00:00
rogday
0fe7420638
fix model diagnostic tool
2022-01-18 01:22:22 +03:00
Alexander Alekhin
b304730225
dnn: fix API - explicit ctors, const methods
2022-01-17 21:45:29 +00:00
Maksim Shabunin
d5f73f89d8
Fixed issues found by static analysis
2022-01-13 14:51:25 +03:00
Alexander Alekhin
aebb65e983
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-01-12 13:26:10 +00:00
Alexander Alekhin
80d9f624d0
dnn: don't use aligned load without alignment checks
...
- weights are unaligned in dasiamprn sample (comes from numpy)
2022-01-12 05:11:18 +00:00
Alexander Alekhin
76fb3652fc
dnn(ocl): fix fp16 kernel compilation
2021-12-29 19:58:25 +00:00
Alexander Alekhin
9699e2b483
dnn(onnx): handle non-default ONNX domains
...
- re-enable quantized models tests
2021-12-25 01:38:52 +00:00
Alexander Alekhin
217fea9667
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-24 16:48:07 +00:00
Alexander Alekhin
cdd4354256
Merge pull request #21336 from alalek:dnn_pooling_check_array_indexes
2021-12-24 08:35:11 +00:00
Alexander Alekhin
6385511e88
dnn: add checks in pooling layer implementation
...
- to avoid out of buffer access
2021-12-24 00:15:30 +00:00
Alexander Alekhin
ed4becf007
dnn(onnx): debug dump of inputs/outputs/initializers in importer
2021-12-23 21:11:40 +00:00
Alexander Alekhin
f5589445b9
Merge pull request #21322 from alalek:dnn_catch_errors
2021-12-23 20:09:22 +00:00
Alexander Alekhin
011ed380aa
Merge pull request #21323 from alalek:dnn_do_not_rebuilt_network_in_setInput
2021-12-23 15:45:28 +00:00
Alexander Alekhin
88a18c8b6a
dnn(onnx): emit error in Shape for dynamic input
2021-12-23 15:42:59 +00:00
Alexander Alekhin
51e65db715
dnn(onnx): fix Resize inputs handling
2021-12-23 15:42:59 +00:00
Alexander Alekhin
cc02fcd889
dnn: improve debug messages, add ONNX opset version
2021-12-23 15:42:59 +00:00
Alexander Alekhin
c408157a4d
dnn: do not try to rebuilt network during setInput()
...
- this doesn't make sense in case of multiple inputs
2021-12-23 02:40:33 +00:00
Alexander Alekhin
6e299b582a
dnn(test): decompose 'DynamicAxes' test
2021-12-23 00:47:27 +00:00
Alexander Alekhin
9777fbacf6
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-22 15:57:02 +00:00
Alexander Alekhin
c78a8dfd2d
fix 4.x links
2021-12-22 13:24:30 +00:00
Alexander Alekhin
b1a57c4cb2
fix 3.4 links
2021-12-22 12:38:21 +00:00
rogday
0a178a687a
fix const/x in Div
2021-12-20 19:53:37 +03:00
Alexander Alekhin
80492d663e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-18 16:19:06 +00:00
Alexander Alekhin
172c539a5a
Merge pull request #21281 from alalek:update_version_4.5.5-pre
2021-12-18 13:46:38 +00:00
Alexander Alekhin
a079acc0d9
Merge pull request #21280 from alalek:update_version_3.4.17-pre
2021-12-18 13:46:29 +00:00
Smirnov Egor
71a22e45b0
add celu, hardsigmoid, selu, thresholdedrelu layers
2021-12-18 03:19:54 +03:00
Alexander Alekhin
04ee99f1a3
Merge pull request #21282 from alalek:dnn_test_vulkan_skip
2021-12-17 19:39:35 +00:00
Smirnov Egor
1bd382c1d0
Add acos, acosh, asin, asinh, atan, atanh, cos, cosh, erf, hardswish, sin, sinh, softplus, softsign, tan layers
2021-12-17 18:19:40 +03:00
Alexander Alekhin
249c508126
dnn(test): skip failed tests on Vulkan backend
2021-12-17 14:01:04 +00:00
Smirnov Egor
fec2c7e715
fix Flatten layer
2021-12-17 16:29:56 +03:00
Alexander Alekhin
07dca8cc03
pre: OpenCV 4.5.5 (version++)
2021-12-17 10:12:11 +00:00
Alexander Alekhin
60c093f086
pre: OpenCV 3.4.17 (version++)
2021-12-17 10:05:52 +00:00
Alexander Alekhin
622b9d9276
Merge pull request #21267 from mshabunin:fix-kw-2021-12
2021-12-16 18:51:47 +00:00
Gruhuang
b4bb98ea60
Merge pull request #21268 from pccvlab:tf_Arg
...
add argmax and argmin parsing for tensorflow
* add argmax and argmin for tf
* remove whitespace
* remove whitespace
* remove static_cast
Signed-off-by: Crayon-new <1349159541@qq.com>
2021-12-16 17:06:02 +00:00
Maksim Shabunin
792b7e0629
(3.4) Fixed several issues found by static analysis
...
original commit: a079c2eb7c
2021-12-16 17:02:58 +00:00
Maksim Shabunin
a079c2eb7c
Fixed several issues found by static analysis
2021-12-16 19:21:25 +03:00
Alexander Alekhin
6d677bbd63
dnn(test): update ONNX conformance filters (4.x)
2021-12-16 12:09:31 +00:00
Alexander Alekhin
299f9837b7
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-15 16:38:56 +00:00
Alexander Alekhin
f3ba88c87c
dnn(test): update ONNX conformance filters
2021-12-15 12:53:53 +00:00
Smirnov Egor
e97c7e042b
fix max_unpool missing attributes, add default value of keepdims in reducemean/max/sum, add support for keepdims=true in full reduction branch, add new padding type to Pad
2021-12-14 22:09:27 +03:00
rogday
4827fe86bb
Merge pull request #21088 from rogday:onnx_tests
...
Onnx conformance tests
* Add ONNX conformance tests
* dnn(test): add filters for ONNX conformance tests
* add filter lists for OCV backend
* address review comments
* move test_clip_inbounds to all_denylist
* address clip issue
* avoid empty lists
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2021-12-14 16:58:06 +00:00
cqn2219076254
252ce0b581
add square layer
2021-12-13 21:43:13 +08:00
Alexander Alekhin
6e50e4b9ee
Merge pull request #21161 from rogday:elu_alpha_4x
2021-12-10 16:04:01 +00:00
HAN Liutong
1599f9f0c0
Merge pull request #21086 from hanliutong:rvv-dnn
...
Further optimize DNN for RISC-V Vector.
* Optimize DNN on RVV by using vsetvl.
* Rename vl.
* Update fastConv by using setvl instead of mask.
* Fix fastDepthwiseConv
2021-12-10 16:03:22 +00:00
Gruhuang
17bc8565f6
Merge pull request #21154 from pccvlab:MatMul_with_two_inputs
...
Add BatchMatMul layer support for tf_importer
* two inputs
* support batch_matmul
* refactor: remove useless code
* refactor: decrease nesting
2021-12-10 14:44:27 +03:00
Smirnov Egor
e608adea60
add ArgMax and ArgMin layers
2021-12-06 20:49:54 +03:00
Alexander Alekhin
73318fd514
Merge pull request #21167 from alalek:dnn_test_reenable_ov_2021_4
2021-12-04 22:12:18 +00:00
HAN Liutong
4935b14539
Merge pull request #21012 from hanliutong:rvv_clang
...
Update RVV backend for using Clang.
* Update cmake file of clang.
* Modify the RVV optimization on DNN to adapt to clang.
* Modify intrin_rvv: Disable some existing types.
* Modify intrin_rvv: Reinterpret instead of load&cast.
* Modify intrin_rvv: Update load&store without cast.
* Modify intrin_rvv: Rename vfredsum to fredosum.
* Modify intrin_rvv: Rewrite Check all/any by using vpopc.
* Modify intrin_rvv: Use reinterpret instead of c-style casting.
* Remove all macros which is not used in v_reinterpret
* Rename vpopc to vcpop according to spec.
2021-12-03 15:13:24 +00:00
Alexander Alekhin
0835611d3a
dnn(test): re-enable tests which works with OpenVINO 2021.4.x
2021-12-03 14:01:30 +00:00
Alexander Alekhin
8b4fa2605e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-03 12:32:49 +00:00
Alexander Alekhin
35ff9af6ce
Merge pull request #21162 from rogday:softmax_simplification
2021-12-02 17:14:48 +00:00
Alexander Alekhin
dad2b9aac8
Merge pull request #21160 from rogday:elu_alpha
2021-12-02 17:13:57 +00:00
rogday
1613d30544
Merge pull request #21159 from rogday:ceil_mode
...
fix ceil_mode for Average/MaxPooling
* fix ceil_mode
* add a comment
2021-12-02 20:11:11 +03:00
Alexander Alekhin
b9d0dc60b0
Merge pull request #21173 from alalek:3.4_dnn_test_reenable_ov_2021_4
2021-12-02 16:33:15 +00:00
Alexander Alekhin
d206350738
Merge pull request #21172 from alalek:dnn_test_drop_non_cpu_int8
2021-12-02 13:50:04 +00:00
Alexander Alekhin
bd396e1fd5
dnn(test): re-enable tests which works with OpenVINO 2021.4.x (3.4)
2021-12-02 11:30:45 +00:00
Alexander Alekhin
f55c9ed1ba
dnn(test): drop non OCV/CPU cases for Int8
...
- zero code coverage and up to x3-x8 tests slowdown
- implementation executes OCV/CPU in all cases
- wrong skip conditions
2021-12-02 06:27:10 +00:00
Alexander Alekhin
5da69c0b9a
Merge pull request #21164 from rogday:sum_identity
2021-12-01 22:49:02 +00:00
Alexander Alekhin
a806e8cc58
Merge pull request #21163 from rogday:transpose_default
2021-12-01 22:47:57 +00:00
Alexander Alekhin
d9e7c1626a
Merge pull request #21153 from alalek:build_warnings_msvs2017
2021-12-01 12:49:28 +00:00
Smirnov Egor
33e97e994d
add sum of 1 input
2021-11-30 15:42:20 +03:00
Smirnov Egor
11e6848bb9
add default order to transpose
2021-11-30 15:34:34 +03:00
Smirnov Egor
829410729c
add new (Log)SoftMax simplification passes
2021-11-30 15:20:52 +03:00
Smirnov Egor
4995aecd62
add alpha parameter to ELU
2021-11-30 14:43:18 +03:00
Smirnov Egor
0e2a3686c0
add alpha parameter to ELU layer
2021-11-30 12:20:35 +03:00
Alexander Alekhin
66b2140892
build: eliminate C4309 warning from protobuf files with MSVS2017
2021-11-30 04:27:39 +00:00
Alexander Alekhin
0d2857a242
Merge pull request #21152 from rogday:fix_defaults
2021-11-29 22:39:27 +00:00
Alexander Alekhin
17d99e6266
Merge pull request #21142 from alalek:dnn_two_inputs_ocl_fp16_3.4
2021-11-29 21:44:59 +00:00
Andrew Ryrie
ea7d4be3f8
Merge pull request #20658 from smbz:lstm_optimisation
...
* dnn: LSTM optimisation
This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm.
fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications:
- Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned.
- Allow for weight matrices where the number of columns is not a multiple of 8.
I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on.
* Fix warning about initialisation order
* Remove C++11 syntax
* Fix build when AVX(2) is not available
In this case the CV_TRY_X macros are defined to 0, rather than being undefined.
* Minor changes as requested:
- Don't check hardware support for AVX(2) when dispatch is disabled for these
- Add braces
* Fix out-of-bounds access in fully connected layer
The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway).
This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems.
* Improve tail mask handling
- Use static array for generating tail masks (as requested)
- Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs
* Revert whitespace change
* Improve readability of conditions for using AVX
* dnn(lstm): minor coding style changes, replaced left aligned load
2021-11-29 21:43:00 +00:00
Smirnov Egor
05db8784ae
fix Clip, LeakyReLU, LRN, Split defaults
2021-11-29 20:20:34 +03:00
Supernovae
b594ed99b8
Merge pull request #20933 from shubham-shahh:master
...
Improved overall readability of the code
* grid_nms.cu: minor fix-ups
* Update grid_stride_range.hpp
* Update tf_importer.cpp
2021-11-28 12:54:29 +00:00
Alexander Alekhin
58b06222ff
dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling
2021-11-28 07:44:05 +00:00
Alexander Alekhin
58dc397930
dnn(test): add two_inputs test with FP32/U8 data types
...
- remove similar test from IE scope under HAVE_INF_ENGINE
2021-11-28 07:44:04 +00:00
yuki takehara
a6277370ca
Merge pull request #21107 from take1014:remove_assert_21038
...
resolves #21038
* remove C assert
* revert C header
* fix several points in review
* fix test_ds.cpp
2021-11-27 18:34:52 +00:00
Alexander Alekhin
31b2d6be75
dnn(test): update InferenceEngine tests (4.x)
2021-11-27 18:16:52 +00:00