Alexander Smorkalov
1fd45a1b85
Merge pull request #22362 from fengyuentau:conv_asym_pad_fuse
...
Remove asymmetric padding in Conv layer since it is supported in CPU backend
2022-08-29 17:56:17 +03:00
Zihao Mu
2cd7e17b65
replace v_add with +
2022-08-29 17:15:35 +08:00
Alexander Smorkalov
2619099fe5
Merge pull request #22337 from zihaomu:load_ONNX_fp16_as_fp32
...
DNN: load fp16 ONNX model as fp32
2022-08-29 09:32:25 +03:00
fengyuentau
2959286eb5
tengine: supports conv with asymmetric padding
2022-08-29 02:51:26 +00:00
Zihao Mu
9638e34ab0
reuse WORDS_BIGENDIAN.
2022-08-27 07:42:38 +08:00
Zihao Mu
bb64db98d8
Further optimization of Conv2D, fused Conv_Add_Activation, bring latest code from ficus OpConv.fx. ( #22401 )
2022-08-26 12:57:25 +03:00
Zihao Mu
7eaec9dd22
load fp16 as fp32 and align fp16 and double in onnx_graph_simplifie
2022-08-26 10:04:44 +08:00
Zihao Mu
5e92bf8e41
support silu activation in darknet
2022-08-22 10:51:29 +08:00
Alexander Alekhin
2ebdc04787
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-08-14 15:50:42 +00:00
fengyuentau
0cdff46725
tune for opencl
2022-08-14 17:47:48 +08:00
Alexander Smorkalov
bb71cb200e
Merge pull request #22199 from zihaomu:bug_fix_22195
...
DNN: Reduce Layer (add dynamic batch and ReduceSum support)
2022-08-11 12:59:51 +03:00
fengyuentau
e7e814fa8c
remove asymmetric padding checks
2022-08-10 19:52:44 +08:00
Zihao Mu
d4640f4647
support ReduceLayer without reshape layer.
2022-08-02 10:32:31 +08:00
Zihao Mu
57545653b1
replace new mish impl with softplus
2022-07-28 13:19:06 +08:00
Zihao Mu
3c5377ca1b
add another Mish graph simplifier.
2022-07-28 11:21:29 +08:00
HAN Liutong
e2bfe0ce76
Use "#if" instead of "#ifdef" for CV_SIMD128.
2022-07-21 03:23:57 +00:00
Zihao Mu
98c33c605d
batchsize dynamic is set to index 0.
2022-07-20 19:02:16 +08:00
rogday
ed69bcae2d
Merge pull request #21865 from rogday:nary_eltwise_layers
...
Reimplementation of Element-wise layers with broadcasting support
* init
* semi-working initial version
* add small_vector
* wip
* remove smallvec
* add nary function
* replace auto with Mat in lambda expr used in transform
* uncomment asserts
* autobuffer shape_buf & step_buf
* fix a missing bracket
* fixed a missing addLayer in parseElementWise
* solve one-dimensional broadcast
* remove pre_broadcast_transform for the case of two constants; fix missing constBlobsExtraInfo when addConstant is called
* one autobuffer for step & shape
* temporal fix for the missing original dimension information
* fix parseUnsqueeze when it gets a 1d tensor constant
* support sum/mean/min/max with only one input
* reuse old code to handle cases of two non-constant inputs
* add condition to handle div & mul of two non-constant inputs
* use || instead of or
* remove trainling spaces
* enlarge buf in binary_forward to contain other buffer
* use autobuffer in nary_forward
* generate data randomly and add more cases for perf
* add op and, or & xor
* update perf_dnn
* remove some comments
* remove legacy; add two ONNX conformance tests in filter
* move from cpu_denylist to all_denylist
* adjust parsing for inputs>=2
Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>
2022-07-19 06:14:05 +03:00
fengyuentau
1c7b71bf9e
define data_layout as unknown for pack
2022-07-14 19:27:20 +08:00
Zihao Mu
1b8fba8e26
support ReduceSum with two input and dynamic shape batch size in ReduceLayer.
2022-07-13 13:46:16 +08:00
Zihao Mu
45fbb67aba
fix scale layer can not handle 1x1 weight correctly.
2022-07-13 11:25:27 +08:00
Zihao Mu
139c443770
Merge pull request #22183 from zihaomu:fastConv_ARMv7_compatible
...
DNN: ARMv7 compatible fastConv
* support armv7 on fastConv
* remove whitespace.
2022-07-07 13:23:08 +03:00
Zihao Mu
a80fcacd90
Merge pull request #21372 from zihaomu:dnn_quantize_per_tensor
...
Add per_tensor_quantize to int8 quantize
* add per_tensor_quantize to dnn int8 module.
* change api flag from perTensor to perChannel, and recognize quantize type and onnx importer.
* change the default to hpp
2022-07-05 19:14:42 +03:00
Zihao Mu
59b870a87a
Merge pull request #21910 from zihaomu:fast_conv_ARM
...
DNN: Accelerating convolution
* Fast Conv of ARM, X86 and universal intrinsics.
* improve code style.
* error fixed.
* improve the License
* optimize memory allocated and Adjust the threshold.
* change FasterRCNN_vgg16 to 2GB memory.
2022-07-01 13:03:15 +03:00
Zihao Mu
ef94275eb6
bug fixed of GEMM node in ONNX_importer
2022-06-22 21:08:48 +08:00
Wanli
a6ca48a1c2
Merge pull request #22100 from WanliZhong:issue_22015
...
Fix issue 22015, let Clip layer support 1-3 inputs
* Fix issue 22015.
Let layer Clip support 1-3 inputs.
* Resolve other problems caused by modifications
* Update onnx_importer.cpp
added extra checks to min/max handling in Clip
* Add assertions to check the size of the input
* Add test for clip with min and max initializers
* Separate test for "clip_init_min_max". Change the check method for input_size to provide a clearer message in case of problem.
* Add tests for clip with min or max initializers
* Change the implementation of getting input
Co-authored-by: Vadim Pisarevsky <vadim.pisarevsky@gmail.com>
2022-06-22 14:21:16 +03:00
Zihao Mu
2411b825b4
bug fixed of GEMM node in ONNX_importer
2022-06-22 15:00:17 +08:00
Alexander Alekhin
583bd1a6e2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-06-04 19:10:35 +00:00
Namgoo Lee
24547f40ff
remove const from functions returning by value
2022-05-26 21:30:41 +09:00
Alexander Alekhin
978dc76653
Merge pull request #22006 from rogday:21947_fix
2022-05-24 19:26:02 +00:00
rogday
a2ad997e97
fix vector access in TF::sortByExecutionOrder
2022-05-24 00:05:13 +03:00
berak
50d7c61c01
Update darknet_importer.cpp
...
make it more obvious, that this is a '404', not a 'parsing' problem
2022-05-23 19:18:31 +02:00
Alexander Alekhin
d9bf522b27
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-05-23 16:06:14 +00:00
rogday
93dc0679ec
Merge pull request #21818 from rogday:revert_renaming
...
* add prefixes to layer names and layer output names
* dnn: OPENCV_DNN_ONNX_USE_LEGACY_NAMES runtime parameter
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2022-05-23 14:50:42 +00:00
Alexander Alekhin
bb5462e327
Merge pull request #21991 from fengyuentau:qconv_asympad
2022-05-19 17:20:04 +00:00
fengyuentau
ff88132620
support asymmetric paddings for qconv
2022-05-16 19:01:37 +08:00
OpenCV Developers
d9a444ca1a
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-05-14 11:23:21 +00:00
Yulv-git
15ac54d5d6
Fix some typos in modules/.
2022-04-30 13:40:07 +08:00
Zihao Mu
64ded50bbf
parsing depth2space and space2depth of ONNX importer
2022-04-29 10:17:02 +08:00
rogday
9cd5a0a1e6
Merge pull request #21884 from rogday:cuda_cleanup
...
Fix CUDA compilation issues and adjust thresholds.
* Fix CUDA compilation issues and adjust thresholds.
* add conformance tests to denylist
2022-04-19 16:40:25 +00:00
OpenCV Developers
2985739b8c
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-04-16 14:41:15 +00:00
rogday
a2b84e9897
add assert to tf graph simplifier to address security concerns
2022-04-13 22:50:27 +03:00
zihaomu
e36948cfbc
add ONNX OP sign, shrink and reciprocal
2022-04-07 15:32:12 +08:00
Alexander Alekhin
a233982931
Merge pull request #20938 from JulieBar:lstm_cuda2
2022-04-01 22:10:08 +00:00
Zihao Mu
7b582b71ba
Merge pull request #21036 from fengyuentau:timvx_backend_support
...
dnn: TIM-VX NPU backend support
* Add TimVX NPU backend for DNN module.
* use official branch from tim-vx repo; fix detecting viv sdk
Co-authored-by: fytao <yuantao.feng@outlook.com>
2022-03-31 21:42:11 +00:00
Smirnov Egor
abebbf04b1
Add CUDA support for LSTM.
...
Co-authored-by: Julia Bareeva <jbareeva@gmail.com>
2022-03-31 16:38:22 +03:00
Alexander Alekhin
5e434073d4
Merge pull request #21796 from alalek:dnn_reduce_fixup_21601
2022-03-30 22:26:28 +00:00
Alexander Alekhin
6f5cf8c15f
dnn: fix ReduceLayer implementation, update OpenVINO tests
2022-03-30 20:03:41 +00:00
Alexander Alekhin
1339ebaa84
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-03-26 16:00:28 +00:00
Alexander Alekhin
c9b90884da
Merge pull request #21601 from zihaomu:add_reduceLayer
2022-03-26 10:20:10 +00:00
luz paz
8e8e4bbabc
dnn: fix various dnn related typos
...
Fixes source comments and documentation related to dnn code.
2022-03-23 18:12:12 -04:00
Alexander Alekhin
4c79318694
dnn: fix index access
2022-03-19 06:54:07 +00:00
Zihao Mu
b6b5c27cec
Support for some reduce layers for onnx
2022-03-18 10:19:13 +08:00
Alexander Alekhin
685797f403
Merge pull request #21662 from alalek:dnn_split
2022-03-17 16:09:17 +00:00
rogday
93353aea70
Merge pull request #21522 from rogday:lstm
...
Fix LSTM support in ONNX
* fix LSTM and add peephole support
* disable old tests
* turn lambdas into functions
* more hacks for c++98
* add assertions
* slice fixes
* backport of cuda-related fixes
* address review comments
2022-03-15 09:14:05 +03:00
Alexander Alekhin
5bf3c1df24
Merge pull request #21715 from ilyachur:change_type_info_creation
2022-03-14 09:18:58 +00:00
Ilya Churaev
419918076e
Changed call of NodeTypeInfo constructor
2022-03-14 10:55:33 +03:00
Alexander Alekhin
a120adde63
dnn: add dnn.cpp file with information about git commits history
2022-03-08 19:22:47 +00:00
Alexander Alekhin
a80af177b6
dnn: split dnn.cpp code
...
base commit: 19926e2979
original dnn.cpp content: 19926e2979/modules/dnn/src/dnn.cpp
2022-03-08 19:22:46 +00:00
Tsukasa Sugiura
8db7d435b9
Merge pull request #21692 from UnaNancyOwen:add_softmax
...
* add apply softmax option to ClassificationModel
* remove default arguments of ClassificationModel::setSoftMax()
* fix build for python
* fix docs warning for setSoftMax()
* add impl for ClassficationModel()
* fix failed build for docs by trailing whitespace
* move to implement classify() to ClassificationModel_Impl
* move to implement softmax() to ClassificationModel_Impl
* remove softmax from public method in ClassificationModel
2022-03-07 20:26:15 +00:00
Alexander Alekhin
901e0ddfe4
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-03-05 19:46:28 +00:00
Alexander Alekhin
5cc27fd3b5
Merge pull request #21542 from rogday:split_expand
2022-02-28 22:38:24 +00:00
Egor Smirnov
375fe81311
fix slice and expand
2022-02-28 17:18:07 +03:00
Yuantao Feng
f77c3574af
Merge pull request #21607 from fengyuentau:fix_FaceDetectorYN_dynamic_shape
...
Use YuNet of fixed input shape to fix not-supported-dynamic-zero-shape for FaceDetectorYN
* use yunet with input of fixed shape
* update yunet used in face recognition regression
2022-02-21 13:49:07 +00:00
Alexander Alekhin
19926e2979
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-02-11 17:32:37 +00:00
Alexander Alekhin
effce0573b
dnn: drop legacy Inference Engine NN builder API
2022-02-10 11:55:24 +00:00
Alexander Alekhin
57d3002ee1
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-02-06 16:10:43 +00:00
Alexander Alekhin
1da48beeec
dnn(ngraph): fix output names
2022-02-06 13:08:53 +00:00
Alexander Alekhin
b57ff73086
dnn(ngraph): fix outputs handling, drop 'unconnected' logic
2022-02-06 13:08:53 +00:00
Alexander Alekhin
67978b5746
dnn(ngraph): add debuging messages
2022-02-06 13:08:53 +00:00
Alexander Alekhin
062f305d1a
dnn: don't fuse 'outputs' with OpenVINO backend
2022-02-06 13:08:53 +00:00
Alexander Alekhin
aa5bc20c83
dnn(ngraph): fixup get_output_as_single_output_node() replacement patch
2022-02-06 10:35:59 +00:00
Alexander Alekhin
d573472a86
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-01-31 12:53:45 +00:00
Alexander Alekhin
85719a0a5d
dnn: support outputs registration under new names
...
- fixed ONNX importer
2022-01-29 23:29:51 +00:00
Alexander Alekhin
dc35633aa4
Merge pull request #21521 from alalek:dnn_ignore_denormals
2022-01-28 15:31:44 +00:00
Zihao Mu
9e3ba487fa
Merge pull request #21518 from zihaomu:resize_onnx_opset13
...
Add resize layer compatible with ONNX opset13 version
2022-01-28 17:55:01 +03:00
Alexander Alekhin
9188ce68aa
Merge pull request #21490 from rogday:optional_outputs
2022-01-26 15:18:07 +00:00
Alexander Alekhin
70b0274c8e
dnn: apply hint to ignore denormals processing
2022-01-26 11:28:35 +00:00
Alexander Alekhin
b796ededae
Merge pull request #21437 from alalek:dnn_api_explicit_const_4.x
2022-01-21 20:19:50 +00:00
Alexander Alekhin
eb7b45d26b
dnn: fix API - explicit ctors, const methods
2022-01-21 12:38:51 +00:00
Smirnov Egor
17b2d92a3d
add optional outputs support and fix graph links
2022-01-21 12:31:46 +03:00
Alexander Alekhin
6ffa2b01e1
Merge pull request #21357 from rogday:model_diag
2022-01-18 15:50:11 +00:00
rogday
0fe7420638
fix model diagnostic tool
2022-01-18 01:22:22 +03:00
Alexander Alekhin
b304730225
dnn: fix API - explicit ctors, const methods
2022-01-17 21:45:29 +00:00
Maksim Shabunin
d5f73f89d8
Fixed issues found by static analysis
2022-01-13 14:51:25 +03:00
Alexander Alekhin
aebb65e983
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2022-01-12 13:26:10 +00:00
Alexander Alekhin
80d9f624d0
dnn: don't use aligned load without alignment checks
...
- weights are unaligned in dasiamprn sample (comes from numpy)
2022-01-12 05:11:18 +00:00
Alexander Alekhin
76fb3652fc
dnn(ocl): fix fp16 kernel compilation
2021-12-29 19:58:25 +00:00
Alexander Alekhin
9699e2b483
dnn(onnx): handle non-default ONNX domains
...
- re-enable quantized models tests
2021-12-25 01:38:52 +00:00
Alexander Alekhin
217fea9667
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-24 16:48:07 +00:00
Alexander Alekhin
cdd4354256
Merge pull request #21336 from alalek:dnn_pooling_check_array_indexes
2021-12-24 08:35:11 +00:00
Alexander Alekhin
6385511e88
dnn: add checks in pooling layer implementation
...
- to avoid out of buffer access
2021-12-24 00:15:30 +00:00
Alexander Alekhin
ed4becf007
dnn(onnx): debug dump of inputs/outputs/initializers in importer
2021-12-23 21:11:40 +00:00
Alexander Alekhin
f5589445b9
Merge pull request #21322 from alalek:dnn_catch_errors
2021-12-23 20:09:22 +00:00
Alexander Alekhin
88a18c8b6a
dnn(onnx): emit error in Shape for dynamic input
2021-12-23 15:42:59 +00:00
Alexander Alekhin
51e65db715
dnn(onnx): fix Resize inputs handling
2021-12-23 15:42:59 +00:00
Alexander Alekhin
cc02fcd889
dnn: improve debug messages, add ONNX opset version
2021-12-23 15:42:59 +00:00
Alexander Alekhin
c408157a4d
dnn: do not try to rebuilt network during setInput()
...
- this doesn't make sense in case of multiple inputs
2021-12-23 02:40:33 +00:00
Alexander Alekhin
9777fbacf6
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-12-22 15:57:02 +00:00
rogday
0a178a687a
fix const/x in Div
2021-12-20 19:53:37 +03:00