Alexander Alekhin
8c2dd5fb9a
dnn(ocl4dnn): cleanup dead code, improve logging
2021-10-08 00:39:40 +00:00
Alexander Alekhin
724e04e979
dnn(ocl4dnn): add extra checks to convolution layer
...
- prevent running code over unsupported/non-tested configurations
- prevent integer div by zero
2021-10-07 23:18:32 +00:00
Oliver Kuckertz
a3d7811f24
Merge pull request #20725 from mologie:fix-dnn-tf-on-arm
...
* dnn: fix unaligned memory access crash on armv7
The getTensorContent function would return a Mat pointing to some
member of a Protobuf-encoded message. Protobuf does not make any
alignment guarantees, which results in a crash on armv7 when loading
models while bit 2 is set in /proc/cpu/alignment (or the relevant
kernel feature for alignment compatibility is disabled). Any read
attempt from the previously unaligned data member would send SIGBUS.
As workaround, this commit makes an aligned copy via existing clone
functionality in getTensorContent. The unsafe copy=false option is
removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact
writes(!) to the Protobuf message. We limit ourselves to fixing the
alignment issues in this commit, and add getTensorContentRefUnaligned
to cover the write case with a safe memcpy. A FIXME marks the issue.
* dnn: reduce amount of .clone() calls
* dnn: update FIXME comment
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2021-10-06 16:41:05 +00:00
Alexander Alekhin
646924fce8
dnn(pytest/test_input_3d): reload model between switching targets
2021-10-05 23:23:08 +00:00
Alexander Alekhin
ebef84e9ea
pre: OpenCV 3.4.16 (version++)
2021-10-04 20:47:07 +00:00
Alexander Alekhin
f977d10a19
dnn(ocl): fix conv DWCONV workgroup
2021-10-01 18:52:07 +00:00
Alexander Alekhin
846317ef37
dnn(ocl): fix conv BASIC workgroup
2021-09-29 14:55:46 +00:00
SamFC10
9c5d7716e2
fix for unsqueeze opset version 13
2021-09-17 17:40:57 +05:30
Alexander Alekhin
46fd26e366
Merge pull request #20699 from alalek:dnn_perf_update_convolution_tests
2021-09-16 17:11:32 +00:00
rogday
c410d7a97d
Merge pull request #20671 from rogday:yolov4x-mish
...
Add support for YOLOv4x-mish
* backport to 3.4 for supporting yolov4x-mish
* add YOLOv4x-mish test
* address review comments
Co-authored-by: Guo Xu <guoxu@1school.com.cn>
2021-09-14 17:49:49 +00:00
Alexander Alekhin
6e66a9222a
dnn(onnx): fix format specifier
2021-09-11 22:26:52 +00:00
Zihao Mu
51b03b87e6
BiasAdd could load Const from second place.
2021-09-11 15:34:41 +00:00
Alexander Alekhin
1aacb9bb15
dnn(perf): update convolution tests
2021-09-10 13:11:02 +00:00
Alexander Alekhin
6ace801418
Merge pull request #20661 from alalek:dnn_ocl_fix_gemm_like_kernel
2021-09-10 11:58:52 +00:00
rogday
d31b93b513
Merge pull request #20674 from rogday:prelu_slope
...
Fix PReLU negative slope access pattern
* fix prelu negative slope access pattern
* change begin() to ptr()
2021-09-10 11:07:16 +00:00
rogday
4807cd8a6e
Merge pull request #20605 from rogday:split_slice_shenanigans
...
Add Normalize subgraph, fix Slice, Mul and Expand
* Add Normalize subgraph, support for starts<0 and axis<0 in Slice, Mul broadcasting in the middle and fix Expand's unsqueeze
* remove todos
* remove range-based for loop
* address review comments
* change >> to > > in template
* fix indexation
* fix expand that does nothing
2021-09-09 14:41:40 +03:00
Alexander Alekhin
35e824c287
dnn(ocl): fix out of bound access in GEMM-like kernels
...
- dropped usage of CreateSubBuffer() - buffers lifetime management issue
- fixed elementwise offset
- avoid out of bounds read access
2021-09-06 18:17:21 +00:00
Alexander Alekhin
5578ad5e14
dnn(ocl): fix automatic globalsize adjusting
...
- if kernel code doesn't support that
2021-09-06 03:11:29 +00:00
Alexander Alekhin
0a43b23275
Merge pull request #20651 from alalek:issue_18361
2021-09-04 18:22:12 +00:00
Alexander Alekhin
7967683296
Merge pull request #20648 from alalek:issue_20615
2021-09-04 18:21:58 +00:00
Alexander Alekhin
5b2c016834
dnn(ocl): avoid out of buffer access in copyWeightsSwizzled
2021-09-04 15:45:59 +00:00
Alexander Alekhin
407adc7061
dnn(ocl): fix buffer offsets in IDLF kernel
...
- drop CreateSubBuffer
- fix FUSED_CONV_ELTWISE mode
2021-09-04 15:28:35 +00:00
rogday
d0e612dc36
Merge pull request #20647 from rogday:resize_concat_optimization
...
Fix resize+concat optimization
* fix resize+concat optimization
* add comment and fix indentation
2021-09-03 12:32:29 +00:00
Alexander Alekhin
060a76dc3e
Merge pull request #20573 from rogday:onnx_scale_fix
2021-09-01 14:09:17 +00:00
WJJ1995
edc442afdb
Merge pull request #20511 from wjj19950828:add_humanseg_support_0806
...
* support PPSeg model for dnn module
* fixed README for CI
* add test case
* fixed bug
* deal with comments
* rm dnn_model_runner
* update test case
* fixed bug for testcase
* update testcase
2021-09-01 10:10:05 +00:00
Alexander Alekhin
ae6fabc6fe
dnn(ocl): drop CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE check
...
- it is a hint and it should not block kernel execution
2021-08-30 20:40:14 +00:00
Vincent Rabaud
38d0063c36
Do not use deprecated ReleaseCleared in protobuf library.
...
This is to make code work with protobuf arenas for memory
management (ReleaseCleared is incompatible).
The cleaning of the memory is also simpler.
2021-08-26 15:36:22 +02:00
Alexander Alekhin
77a5c43d50
Merge pull request #20586 from alalek:issue_20585
2021-08-21 17:22:58 +00:00
Alexander Alekhin
f28e4b86fb
dnn(ocl): fix top initialization in verifyResult
2021-08-21 16:04:13 +00:00
Alexander Alekhin
a9817e9127
Merge pull request #20556 from rogday:onnx_split_sum_fix
2021-08-20 08:10:18 +00:00
Vincent Rabaud
9cfa84313c
Use the one argument version of SetTotalBytesLimit.
...
The two argument versions has been deprecated, cf
https://developers.google.com/protocol-buffers/docs/reference/cpp/google.protobuf.io.coded_stream
2021-08-19 14:31:29 +02:00
Smirnov Egor
fe625a558e
fix hasDynamicShapes for batch_size and fix axis selection in Scale layer
2021-08-18 19:22:24 +03:00
Smirnov Egor
9ef41f68fb
fix Split partial sum
2021-08-16 15:44:54 +03:00
Julia Bareeva
cfb36443fb
Merge pull request #20506 from JulieBar:lstm_activations
...
* Support activations(Sigmoid, Tanh) for LSTM
* fix warning
2021-08-13 15:41:00 +03:00
Alexander Alekhin
9d3826c676
Merge pull request #20525 from SamFC10:fix-prior-variances
2021-08-13 10:06:55 +00:00
Smirnov Egor
739ff84732
add Max layer to TFImporter
2021-08-09 14:01:51 +03:00
SamFC10
2a177052de
fix bug in prior-box variances
2021-08-09 12:08:55 +05:30
Julia Bareeva
633fedaa96
Merge pull request #20480 from JulieBar:lstm_pytest
...
Add Python's test for LSTM layer
* Add Python's test for LSTM layer
* Set different test threshold for FP16 target
* rename test to test_input_3d
Co-authored-by: Julie Bareeva <julia.bareeva@xperience.ai>
2021-08-05 18:13:17 +03:00
rogday
cff0168f3a
Merge pull request #20453 from rogday:onnx_importer_fix
...
Split layer dispatch into functions in ONNXImporter
* split layer dispatch into functions
* fixes
* identation and comment fixes
* fix constness
2021-07-28 18:06:24 +03:00
Julia Bareeva
4e5699fa71
Merge pull request #20450 from JulieBar:lstm_inside
...
Support non-zero hidden state for LSTM
* fully support non-zero hidden state for LSTM
* check dims of hidden state for LSTM
* fix failed test Test_Model.TextRecognition
* add new tests for LSTM w/ non-zero hidden params
Co-authored-by: Julie Bareeva <julia.bareeva@xperience.ai>
2021-07-23 17:11:50 +03:00
Smirnov Egor
024b43ca06
implement asymmetric padding for conv2d, max_pool and conv2d_backprop_input
2021-07-22 16:58:40 +03:00
Alexander Alekhin
8334ee18e6
Merge pull request #20394 from SamFC10:conv-asymmetric-pads
2021-07-16 10:33:42 +00:00
SamFC10
96d35f7c54
Fix convolution asymmetric padding bug in onnx importer
2021-07-16 09:39:41 +05:30
Alexander Alekhin
fbde0c6c96
dnn(ie): fix handling of 1D and non-32F outputs of InferenceEngine
2021-07-15 21:47:05 +00:00
Alexander Alekhin
602e7c83e2
dnn(test): add extra IR models, more checks in IE testing code
2021-07-15 21:47:05 +00:00
Alexander Alekhin
bc210b292b
dnn(test): backport test_ie_models.cpp from 4.5.3
2021-07-15 00:05:02 +00:00
César Gouveia
167a12028d
Merge pull request #20374 from cesarpgouveia:bugfix/fix_load_onnxModel_debug
...
* Fix bug while loading onnx model in debug
* dnn: fix other .at using
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2021-07-09 18:21:56 +00:00
Alexander Alekhin
0f2f966a91
Merge pull request #20345 from mitruska:update_ngraph_normalizel2
2021-07-02 23:37:03 +00:00
mitruska
18dbac203f
Use explicit version of ngraph NormalizeL2
2021-07-02 21:33:05 +00:00
Alexander Alekhin
8d1f254dcc
java: force using of 'Ptr<>' for OpenCV classes
...
backport of commit: e5841d3126
2021-07-02 21:20:08 +00:00
Alexander Alekhin
b699fe7a9d
Merge pull request #20335 from SamFC10:concat-const-input
2021-07-01 10:25:35 +00:00
SamFC10
5b8c10f2f8
modified onnx importer to concat const input blobs
2021-07-01 10:58:31 +05:30
Alexander Alekhin
24983f62e2
Merge pull request #20325 from alalek:dnn_openvino_2021.4.0
2021-06-30 23:58:26 +00:00
Alexander Alekhin
f2057ce1ab
dnn(ie): replace deprecated calls
2021-06-30 22:30:15 +00:00
Alexander Alekhin
6797fd65a5
dnn(test): update tests for OpenVINO 2021.4
2021-06-30 22:30:15 +00:00
Alexander Alekhin
7d842f5bcf
dnn: use OpenVINO 2021.4 defines
2021-06-29 18:48:21 +00:00
SamFC10
55e1dfb778
Fix BatchNorm reinitialization
2021-06-20 13:19:29 +05:30
Alexander Alekhin
c1adbe3189
Merge pull request #20190 from rogday:tf_importer_ref
2021-06-11 20:06:09 +00:00
rogday
7ee1816612
split if into map of functions
2021-06-11 13:20:45 +03:00
Alexander Alekhin
43940f7ffc
pre: OpenCV 3.4.15 (version++)
2021-06-07 20:10:34 +00:00
Paul Jurczak
ff60abb575
Merge pull request #20080 from pauljurczak:patch-3
...
* Update dnn.hpp
getPerfProfile is not supported by the CUDA backend, see https://github.com/opencv/opencv/issues/20077
* dnn.hpp: fix doxygen formatting
2021-06-02 19:15:52 +00:00
Alexander Alekhin
71bae7c23f
dnn(ie): implicit usage of IE::GPU OpenCL kernels cache
2021-04-29 12:43:22 +03:00
Suleyman TURKMEN
159534313e
Update CMakeLists.txt
2021-04-26 22:43:04 +03:00
Aleksandr Voron
2e143b8799
Merge pull request #19961 from alvoron:dnn_ngraph_int64_fix
...
Explicit usage of int64_t in CropAndResizeLayer (IE backend)
* Update crop_and_resize_layer.cpp
2021-04-21 18:29:19 +00:00
Anastasia Murzova
cc6d48959e
Added reduce sum by channel support
2021-03-30 23:01:22 +03:00
Vitaly Tuzov
aab62aa6dd
Merge pull request #18952 from terfendail:wui_doc
...
* Updated UI documentation to address WUI
* Added documentation for vx_ calls
* Removed vx_store operation overload
* Doxyfile updated to enable wide UI
* Enable doxygen documentation for vx_ WUI functions
* Wide intrinsics definition rework
* core: fix SIMD C++ emulator build (supports 128-bit only)
2021-03-30 16:18:03 +00:00
Alexander Alekhin
d27eb79fa6
Merge pull request #19785 from alalek:dnn_ocl_fix_async_kernels
2021-03-26 12:27:58 +00:00
Anastasia M
3e48a91d97
Merge pull request #19546 from LupusSanctus:am/slice_steps
...
* Added Steps support in DNN Slice layer
* Added code corrections
* dnn(slice): fix OCL and OCL_FP16 processing
2021-03-26 11:04:57 +00:00
Alexander Alekhin
86d0a86141
dnn(ocl): fix gemm kernel scheduling
2021-03-26 00:35:00 +00:00
Alexander Alekhin
26ea4760ad
Merge pull request #19774 from aarongreig:aaron/dnn/oclTestAccuracyThresholds
2021-03-25 16:58:07 +00:00
Aaron Greig
f59917bea1
Introduce relaxed accuracy thresholds for CL target in some dnn tests.
...
Partially addresses #9821
2021-03-25 10:58:23 +00:00
Alexander Alekhin
e56e4876e7
dnn(test): update tests for OpenVINO 2021.3
2021-03-24 14:50:42 +00:00
Alexander Alekhin
56bdd7db5c
dnn: use OpenVINO 2021.3 defines
...
original commit: 6291503793
2021-03-24 10:26:24 +00:00
Anastasia Murzova
e75f1b071b
Added reshape corrections
2021-03-24 10:53:11 +03:00
Anastasia Murzova
7a2b3ed471
Corrected DNN elementwise multiplication
2021-03-24 10:53:11 +03:00
Anastasia M
551d4a8ec1
Merge pull request #19477 from LupusSanctus:am/eltwice_vec
...
* Aligned OpenCV DNN and TF sum op behaviour
Support Mat (shape: [1, m, k, n] ) + Vec (shape: [1, 1, 1, n]) operation
by vec to mat expansion
* Added code corrections: backend, minor refactoring
2021-03-23 22:16:09 +00:00
Liubov Batanina
c0dd82fb53
Merge pull request #19632 from l-bat:lb/ie_arm_target
...
Added OpenVINO ARM target
* Added IE ARM target
* Added OpenVINO ARM target
* Delete ARM target
* Detect ARM platform
* Changed device name in ArmPlugin
* Change ARM detection
2021-03-20 11:20:02 +00:00
Liubov Batanina
8d29a902e4
Added ngraph::op::v6::MVN
2021-03-12 21:02:03 +03:00
Liubov Batanina
95ab9468c1
Added ngraph::op::v4::Interpolation
2021-03-12 12:00:59 +03:00
Alexander Alekhin
e4692ac079
Merge pull request #19613 from WeiChungChang:NMS_refine
2021-03-10 17:36:57 +00:00
Qoo
47337e2196
boost NMS performance
2021-03-10 15:59:26 +00:00
Alexander Alekhin
dc24663e8d
Merge pull request #19641 from LupusSanctus:am/tf_reshape
2021-03-06 17:21:34 +00:00
Alexander Alekhin
358878caf7
Merge pull request #19671 from SamFC10:sam-darknet
2021-03-05 15:12:21 +00:00
Anastasia Murzova
7894cd3c73
Aligned TF Reshape layer behaviour
2021-03-05 01:01:37 +03:00
Liubov Batanina
94533e12eb
Determine layout
2021-03-04 13:05:01 +03:00
SamFC10
a42d4da003
Added Spatial Attention Module in Darknet Importer
2021-03-03 22:42:47 +05:30
Alexander Alekhin
a123c48d4d
pre: OpenCV 3.4.14 (version++)
2021-03-02 20:47:29 +00:00
SamFC10
96947c30c0
Added exp layer
...
backport of commit: 6111935835
partial backport of commit: dd5976162b
2021-02-28 19:59:40 +00:00
Maksim Shabunin
b2321576bc
Fixed several issues found by static analysis
2021-02-25 15:44:15 +03:00
WeiChungChang
d4d12164aa
Merge pull request #19529 from WeiChungChang:3.4
...
* improve map allocation check
* fix accoring to CR
2021-02-23 21:09:55 +00:00
Alexander Alekhin
86cb435adf
dnn: fix build (nullptr issue)
2021-02-23 03:31:29 +00:00
Alexander Alekhin
513f5a8869
Merge pull request #19536 from WeiChungChang:ReduceMemory
2021-02-22 20:04:11 +00:00
WeiChungChang
f6bc4fd4c6
Merge pull request #19552 from WeiChungChang:partialSort
...
apply partial sort to save computations
* apply partial sort
* fix typo
* fix accroding to CR
2021-02-22 19:44:00 +03:00
Alexander Alekhin
8d21c84e7d
Merge pull request #19474 from aryanshomray:darknet_tanh
2021-02-17 17:10:44 +00:00
Aryansh Omray
601851cc7e
TanH darknet and test
2021-02-17 18:49:52 +05:30
Qoo
bc8727786b
save allocation of memory for fake image
2021-02-15 20:14:07 -05:00
Qoo
c909693d45
refine
2021-02-12 20:53:09 -05:00
Alexander Alekhin
4f08bb5046
Merge pull request #19428 from alalek:dnn_drop_misbehaved_clamp
2021-02-04 11:57:19 +00:00
Jebastin Nadar
09d2ca17cf
Merge pull request #19449 from SamFC10:tf-leakyrelu
...
* add LeakyRelu to tf-importer
* fix whitespace error
* added test for leakyrelu
2021-02-04 08:50:08 +00:00
Alexander Alekhin
83aa711346
dnn: rename clamp() => normalize_axis()
2021-02-04 08:13:55 +00:00