Alexander Alekhin
03a08435e2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-10-07 04:27:22 +00:00
Alexander Alekhin
822d468232
Merge pull request #20813 from rogday:soft_nms
2021-10-06 20:20:34 +00:00
Smirnov Egor
2221dcc9f2
add SoftNMS implementation
2021-10-06 21:31:45 +03:00
Oliver Kuckertz
a3d7811f24
Merge pull request #20725 from mologie:fix-dnn-tf-on-arm
...
* dnn: fix unaligned memory access crash on armv7
The getTensorContent function would return a Mat pointing to some
member of a Protobuf-encoded message. Protobuf does not make any
alignment guarantees, which results in a crash on armv7 when loading
models while bit 2 is set in /proc/cpu/alignment (or the relevant
kernel feature for alignment compatibility is disabled). Any read
attempt from the previously unaligned data member would send SIGBUS.
As workaround, this commit makes an aligned copy via existing clone
functionality in getTensorContent. The unsafe copy=false option is
removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact
writes(!) to the Protobuf message. We limit ourselves to fixing the
alignment issues in this commit, and add getTensorContentRefUnaligned
to cover the write case with a safe memcpy. A FIXME marks the issue.
* dnn: reduce amount of .clone() calls
* dnn: update FIXME comment
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2021-10-06 16:41:05 +00:00
HAN Liutong
e5fb50476c
Merge pull request #20521 from hanliutong:dev-rvv-multiVLEN
...
Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics.
* Update fastGEMM for multi VLEN.
* Update fastGEMM1T for multi VLEN.
* Update fastDepthwiseConv for multi VLEN.
* Update fastConv for multi VLEN.
* Replace malloc with cv::AutoBuffer.
2021-10-05 15:35:00 +00:00
Alexander Alekhin
1b70f94282
Merge pull request #20782 from YashasSamaga:cuda4dnn-eltwise-broadcast
2021-10-04 22:35:00 +00:00
Jebastin Nadar
cce78cc5e2
Merge pull request #20535 from SamFC10:onnx-q
...
dnn : int8 quantized layers support in onnx importer
* added quantized layers support in onnx importer
* added more cases in eltwise node, some more checks
* added tests for quantized nodes
* relax thresholds for failed tests, address review comments
* refactoring based on review comments
* added support for unsupported cases and pre-quantized resnet50 test
* relax thresholds due to int8 resize layer
2021-10-04 18:07:38 +00:00
Zihao Mu
9085b933d8
Merge pull request #20702 from zihaomu:tf_expand_dim_layer
...
Add ExpandDims layer of tf_importer.cpp
* Add ExpandDims to tf_importer.
* add -1 expand test case.
* Support different dimensions of input.
* Compatible with 5-dimensional NDHWC data
* Code align
* support 3-dim input.
* 3-dim bug fixed.
* fixing error of code format.
2021-10-04 16:37:38 +00:00
YashasSamaga
505dde09de
support broadcasting in eltwise ops
2021-10-04 12:38:45 +05:30
SamFC10
87ebf2e50b
fix illegal memory access in int8 convolution
2021-10-03 15:16:01 +05:30
Alexander Alekhin
37c3f0d8a0
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-10-02 17:57:18 +00:00
Alexander Alekhin
f977d10a19
dnn(ocl): fix conv DWCONV workgroup
2021-10-01 18:52:07 +00:00
Alexander Alekhin
846317ef37
dnn(ocl): fix conv BASIC workgroup
2021-09-29 14:55:46 +00:00
Alexander Alekhin
24fcb7f813
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-09-25 17:50:00 +00:00
rogday
38b9ec7a18
Merge pull request #20682 from rogday:min
...
* Add Min layer to CPU, OpenCL, Halide, Inference Engine, NGraph and CUDA
* fix indentation
* add min to fusion and halide tests; fix doc
2021-09-22 15:17:37 +03:00
SamFC10
9c5d7716e2
fix for unsqueeze opset version 13
2021-09-17 17:40:57 +05:30
rogday
c410d7a97d
Merge pull request #20671 from rogday:yolov4x-mish
...
Add support for YOLOv4x-mish
* backport to 3.4 for supporting yolov4x-mish
* add YOLOv4x-mish test
* address review comments
Co-authored-by: Guo Xu <guoxu@1school.com.cn>
2021-09-14 17:49:49 +00:00
YashasSamaga
50462dcdc6
fix effrank assert to allow input effrank <= output effrank
2021-09-13 20:44:33 +05:30
Alexander Alekhin
6e66a9222a
dnn(onnx): fix format specifier
2021-09-11 22:26:52 +00:00
Alexander Alekhin
c3ac834526
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-09-11 21:27:26 +00:00
Zihao Mu
51b03b87e6
BiasAdd could load Const from second place.
2021-09-11 15:34:41 +00:00
Alexander Alekhin
6ace801418
Merge pull request #20661 from alalek:dnn_ocl_fix_gemm_like_kernel
2021-09-10 11:58:52 +00:00
rogday
d31b93b513
Merge pull request #20674 from rogday:prelu_slope
...
Fix PReLU negative slope access pattern
* fix prelu negative slope access pattern
* change begin() to ptr()
2021-09-10 11:07:16 +00:00
rogday
4807cd8a6e
Merge pull request #20605 from rogday:split_slice_shenanigans
...
Add Normalize subgraph, fix Slice, Mul and Expand
* Add Normalize subgraph, support for starts<0 and axis<0 in Slice, Mul broadcasting in the middle and fix Expand's unsqueeze
* remove todos
* remove range-based for loop
* address review comments
* change >> to > > in template
* fix indexation
* fix expand that does nothing
2021-09-09 14:41:40 +03:00
Alexander Alekhin
35e824c287
dnn(ocl): fix out of bound access in GEMM-like kernels
...
- dropped usage of CreateSubBuffer() - buffers lifetime management issue
- fixed elementwise offset
- avoid out of bounds read access
2021-09-06 18:17:21 +00:00
Alexander Alekhin
5578ad5e14
dnn(ocl): fix automatic globalsize adjusting
...
- if kernel code doesn't support that
2021-09-06 03:11:29 +00:00
Alexander Alekhin
0a43b23275
Merge pull request #20651 from alalek:issue_18361
2021-09-04 18:22:12 +00:00
Alexander Alekhin
7967683296
Merge pull request #20648 from alalek:issue_20615
2021-09-04 18:21:58 +00:00
Alexander Alekhin
5b2c016834
dnn(ocl): avoid out of buffer access in copyWeightsSwizzled
2021-09-04 15:45:59 +00:00
Alexander Alekhin
407adc7061
dnn(ocl): fix buffer offsets in IDLF kernel
...
- drop CreateSubBuffer
- fix FUSED_CONV_ELTWISE mode
2021-09-04 15:28:35 +00:00
rogday
d0e612dc36
Merge pull request #20647 from rogday:resize_concat_optimization
...
Fix resize+concat optimization
* fix resize+concat optimization
* add comment and fix indentation
2021-09-03 12:32:29 +00:00
Alexander Alekhin
5aa7435d25
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-09-02 15:24:04 +00:00
Alexander Alekhin
060a76dc3e
Merge pull request #20573 from rogday:onnx_scale_fix
2021-09-01 14:09:17 +00:00
WJJ1995
edc442afdb
Merge pull request #20511 from wjj19950828:add_humanseg_support_0806
...
* support PPSeg model for dnn module
* fixed README for CI
* add test case
* fixed bug
* deal with comments
* rm dnn_model_runner
* update test case
* fixed bug for testcase
* update testcase
2021-09-01 10:10:05 +00:00
Alexander Alekhin
ae6fabc6fe
dnn(ocl): drop CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE check
...
- it is a hint and it should not block kernel execution
2021-08-30 20:40:14 +00:00
Alexander Alekhin
4c05a697fa
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-08-28 21:30:28 +00:00
Vincent Rabaud
38d0063c36
Do not use deprecated ReleaseCleared in protobuf library.
...
This is to make code work with protobuf arenas for memory
management (ReleaseCleared is incompatible).
The cleaning of the memory is also simpler.
2021-08-26 15:36:22 +02:00
Alexander Alekhin
6fbfc58602
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-08-21 17:25:18 +00:00
Alexander Alekhin
77a5c43d50
Merge pull request #20586 from alalek:issue_20585
2021-08-21 17:22:58 +00:00
Alexander Alekhin
f28e4b86fb
dnn(ocl): fix top initialization in verifyResult
2021-08-21 16:04:13 +00:00
rogday
6801dd043d
Merge pull request #20494 from rogday:onnx_diagnostic_fix
...
fix ONNXImporter diagnostic mode layer registration issue
* fix layer registration, thread unsafe access and align the behavior of DNN_DIAGNOSTICS_RUN between onnx and tf importers
* move skipModelInput
* print all missing layers
* address TF issue
2021-08-20 14:43:47 +00:00
Alexander Alekhin
a9817e9127
Merge pull request #20556 from rogday:onnx_split_sum_fix
2021-08-20 08:10:18 +00:00
Vincent Rabaud
9cfa84313c
Use the one argument version of SetTotalBytesLimit.
...
The two argument versions has been deprecated, cf
https://developers.google.com/protocol-buffers/docs/reference/cpp/google.protobuf.io.coded_stream
2021-08-19 14:31:29 +02:00
SamFC10
fa90e14b06
int8 layers and 8-bit quantization support
2021-08-19 09:56:47 +05:30
Smirnov Egor
fe625a558e
fix hasDynamicShapes for batch_size and fix axis selection in Scale layer
2021-08-18 19:22:24 +03:00
thezane
210bfaf8d6
Merge pull request #20483 from thezane:support-cumsum-layer-for-onnx
...
* Support cumsum layer for onnx
* Add unit tests
* Address review comments
2021-08-17 20:09:25 +03:00
Smirnov Egor
9ef41f68fb
fix Split partial sum
2021-08-16 15:44:54 +03:00
Alexander Alekhin
05d733e707
Merge pull request #20524 from yichenj:dnn_text_recognition_enhance
2021-08-15 12:30:25 +00:00
Alexander Alekhin
0c01cf7c85
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-08-14 18:24:00 +00:00
Julia Bareeva
cfb36443fb
Merge pull request #20506 from JulieBar:lstm_activations
...
* Support activations(Sigmoid, Tanh) for LSTM
* fix warning
2021-08-13 15:41:00 +03:00
Alexander Alekhin
9d3826c676
Merge pull request #20525 from SamFC10:fix-prior-variances
2021-08-13 10:06:55 +00:00
JIANG Yichen
955cf35d5f
Implement ctc prefix beam search decode for TextRecognitionModel.
...
The algorithm is based on Hannun's paper: First-Pass Large Vocabulary
Continuous Speech Recognition using Bi-Directional Recurrent DNNs
2021-08-12 20:33:31 +08:00
HAN Liutong
aaca4987c9
Merge pull request #20287 from hanliutong:dev-rvv-0.10
...
Optimization of DNN using native RISC-V vector intrinsics.
* Use RVV to optimize fastGEMM (FP32) in DNN.
* Use RVV to optimize fastGEMM1T in DNN.
* Use RVV to optimize fastConv in DNN.
* Use RVV to optimize fastDepthwiseConv in DNN.
* Vectorize tails using vl.
* Use "vl" instead of scalar to handle small block in fastConv.
* Fix memory access out of bound in "fastGEMM1T".
* Remove setvl.
* Remove useless initialization.
* Use loop unrolling to handle tail part instead of switch.
2021-08-11 01:16:03 +03:00
Smirnov Egor
739ff84732
add Max layer to TFImporter
2021-08-09 14:01:51 +03:00
SamFC10
2a177052de
fix bug in prior-box variances
2021-08-09 12:08:55 +05:30
Julia Bareeva
e1cafa3834
Merge pull request #20442 from JulieBar:gru_layer
...
* Add initialization and inference for GRU layer
* fix issues found on review
2021-08-07 10:07:37 +03:00
Alexander Alekhin
907743eee7
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-07-30 14:50:36 +00:00
Smirnov Egor
27392f832d
reimplement onnx refactor for master
2021-07-30 13:00:13 +03:00
rogday
cff0168f3a
Merge pull request #20453 from rogday:onnx_importer_fix
...
Split layer dispatch into functions in ONNXImporter
* split layer dispatch into functions
* fixes
* identation and comment fixes
* fix constness
2021-07-28 18:06:24 +03:00
Alexander Alekhin
f4d6a3ec4e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-07-24 15:34:58 +00:00
Julia Bareeva
4e5699fa71
Merge pull request #20450 from JulieBar:lstm_inside
...
Support non-zero hidden state for LSTM
* fully support non-zero hidden state for LSTM
* check dims of hidden state for LSTM
* fix failed test Test_Model.TextRecognition
* add new tests for LSTM w/ non-zero hidden params
Co-authored-by: Julie Bareeva <julia.bareeva@xperience.ai>
2021-07-23 17:11:50 +03:00
Smirnov Egor
024b43ca06
implement asymmetric padding for conv2d, max_pool and conv2d_backprop_input
2021-07-22 16:58:40 +03:00
Alexander Alekhin
b61a55eebf
Merge pull request #20402 from rogday:tf_diag_dummy
2021-07-16 15:44:29 +00:00
Smirnov Egor
c30078c5a3
add NotImplemented layer
2021-07-16 15:39:54 +03:00
Alexander Alekhin
39b91c97f0
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-07-16 10:35:42 +00:00
Alexander Alekhin
8334ee18e6
Merge pull request #20394 from SamFC10:conv-asymmetric-pads
2021-07-16 10:33:42 +00:00
SamFC10
96d35f7c54
Fix convolution asymmetric padding bug in onnx importer
2021-07-16 09:39:41 +05:30
Alexander Alekhin
fbde0c6c96
dnn(ie): fix handling of 1D and non-32F outputs of InferenceEngine
2021-07-15 21:47:05 +00:00
Alexander Alekhin
9e42e04b4a
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-07-10 13:01:03 +00:00
César Gouveia
167a12028d
Merge pull request #20374 from cesarpgouveia:bugfix/fix_load_onnxModel_debug
...
* Fix bug while loading onnx model in debug
* dnn: fix other .at using
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
2021-07-09 18:21:56 +00:00
Alexander Alekhin
821fae0d94
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-07-03 00:30:58 +00:00
mitruska
18dbac203f
Use explicit version of ngraph NormalizeL2
2021-07-02 21:33:05 +00:00
Alexander Alekhin
8fad85edda
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-07-01 10:52:31 +00:00
Alexander Alekhin
b699fe7a9d
Merge pull request #20335 from SamFC10:concat-const-input
2021-07-01 10:25:35 +00:00
SamFC10
5b8c10f2f8
modified onnx importer to concat const input blobs
2021-07-01 10:58:31 +05:30
Alexander Alekhin
24983f62e2
Merge pull request #20325 from alalek:dnn_openvino_2021.4.0
2021-06-30 23:58:26 +00:00
Alexander Alekhin
f2057ce1ab
dnn(ie): replace deprecated calls
2021-06-30 22:30:15 +00:00
Alexander Alekhin
7d842f5bcf
dnn: use OpenVINO 2021.4 defines
2021-06-29 18:48:21 +00:00
Smirnov Egor
dc5199feea
skipping missing layers and layer failures
2021-06-25 11:26:37 +03:00
SamFC10
55e1dfb778
Fix BatchNorm reinitialization
2021-06-20 13:19:29 +05:30
Alexander Alekhin
735a79ae83
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-06-19 18:44:16 +00:00
rogday
7ee1816612
split if into map of functions
2021-06-11 13:20:45 +03:00
YashasSamaga
32df5faa25
add MatMulOp
2021-05-22 01:01:29 +05:30
Alexander Alekhin
170bf6d7af
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-05-01 09:44:24 +00:00
Alexander Alekhin
71bae7c23f
dnn(ie): implicit usage of IE::GPU OpenCL kernels cache
2021-04-29 12:43:22 +03:00
Aleksandr Voron
2e143b8799
Merge pull request #19961 from alvoron:dnn_ngraph_int64_fix
...
Explicit usage of int64_t in CropAndResizeLayer (IE backend)
* Update crop_and_resize_layer.cpp
2021-04-21 18:29:19 +00:00
Alexander Alekhin
3e1673e8b2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-04-01 09:54:57 +00:00
Anastasia Murzova
cc6d48959e
Added reduce sum by channel support
2021-03-30 23:01:22 +03:00
Vitaly Tuzov
aab62aa6dd
Merge pull request #18952 from terfendail:wui_doc
...
* Updated UI documentation to address WUI
* Added documentation for vx_ calls
* Removed vx_store operation overload
* Doxyfile updated to enable wide UI
* Enable doxygen documentation for vx_ WUI functions
* Wide intrinsics definition rework
* core: fix SIMD C++ emulator build (supports 128-bit only)
2021-03-30 16:18:03 +00:00
Alexander Alekhin
c89084e6b7
Merge pull request #19223 from YashasSamaga:cuda4dnn-halfpix-linear-resize
2021-03-30 13:19:41 +00:00
Anastasia M
e08de1101d
Merge pull request #19693 from LupusSanctus:onnx_diagnostic
...
ONNX diagnostic tool
* Final
* Add forgotten Normalize layer to the set of supported types
* ONNX diagnostic tool corrections
* Fixed CI test warnings
* Added code minor corrections
Co-authored-by: Sergey Slashchinin <sergei.slashchinin@xperience.ai>
2021-03-29 16:38:28 +00:00
Alexander Alekhin
35eaacd1db
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-03-27 15:35:16 +00:00
Alexander Alekhin
d27eb79fa6
Merge pull request #19785 from alalek:dnn_ocl_fix_async_kernels
2021-03-26 12:27:58 +00:00
Anastasia M
3e48a91d97
Merge pull request #19546 from LupusSanctus:am/slice_steps
...
* Added Steps support in DNN Slice layer
* Added code corrections
* dnn(slice): fix OCL and OCL_FP16 processing
2021-03-26 11:04:57 +00:00
Alexander Alekhin
86d0a86141
dnn(ocl): fix gemm kernel scheduling
2021-03-26 00:35:00 +00:00
Alexander Alekhin
b62d015285
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-03-24 18:58:46 +00:00
Alexander Alekhin
56bdd7db5c
dnn: use OpenVINO 2021.3 defines
...
original commit: 6291503793
2021-03-24 10:26:24 +00:00
Anastasia Murzova
e75f1b071b
Added reshape corrections
2021-03-24 10:53:11 +03:00
Anastasia Murzova
7a2b3ed471
Corrected DNN elementwise multiplication
2021-03-24 10:53:11 +03:00
Anastasia M
551d4a8ec1
Merge pull request #19477 from LupusSanctus:am/eltwice_vec
...
* Aligned OpenCV DNN and TF sum op behaviour
Support Mat (shape: [1, m, k, n] ) + Vec (shape: [1, 1, 1, n]) operation
by vec to mat expansion
* Added code corrections: backend, minor refactoring
2021-03-23 22:16:09 +00:00
Alexander Alekhin
ca8c3dd9b5
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-03-22 12:05:23 +00:00
Liubov Batanina
c0dd82fb53
Merge pull request #19632 from l-bat:lb/ie_arm_target
...
Added OpenVINO ARM target
* Added IE ARM target
* Added OpenVINO ARM target
* Delete ARM target
* Detect ARM platform
* Changed device name in ArmPlugin
* Change ARM detection
2021-03-20 11:20:02 +00:00
Alexander Alekhin
b19f860384
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-03-13 13:02:18 +00:00
Liubov Batanina
8d29a902e4
Added ngraph::op::v6::MVN
2021-03-12 21:02:03 +03:00
Liubov Batanina
95ab9468c1
Added ngraph::op::v4::Interpolation
2021-03-12 12:00:59 +03:00
Alexander Alekhin
fbb38cc245
Merge pull request #19222 from YashasSamaga:cuda4dnn-fix-build-diagnostics
2021-03-10 17:40:36 +00:00
Alexander Alekhin
e4692ac079
Merge pull request #19613 from WeiChungChang:NMS_refine
2021-03-10 17:36:57 +00:00
Qoo
47337e2196
boost NMS performance
2021-03-10 15:59:26 +00:00
Alexander Alekhin
d643a90efa
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-03-06 17:31:08 +00:00
Alexander Alekhin
dc24663e8d
Merge pull request #19641 from LupusSanctus:am/tf_reshape
2021-03-06 17:21:34 +00:00
YashasSamaga
d0fe6ad109
fix checkVersions()
2021-03-06 19:03:03 +05:30
Alexander Alekhin
358878caf7
Merge pull request #19671 from SamFC10:sam-darknet
2021-03-05 15:12:21 +00:00
Anastasia Murzova
7894cd3c73
Aligned TF Reshape layer behaviour
2021-03-05 01:01:37 +03:00
Liubov Batanina
94533e12eb
Determine layout
2021-03-04 13:05:01 +03:00
SamFC10
a42d4da003
Added Spatial Attention Module in Darknet Importer
2021-03-03 22:42:47 +05:30
SamFC10
96947c30c0
Added exp layer
...
backport of commit: 6111935835
partial backport of commit: dd5976162b
2021-02-28 19:59:40 +00:00
Alexander Alekhin
e0265c67c6
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-02-27 17:41:57 +00:00
Maksim Shabunin
b2321576bc
Fixed several issues found by static analysis
2021-02-25 15:44:15 +03:00
Maksim Shabunin
dd5976162b
Fixed several issues found by static analysis
2021-02-25 15:08:39 +03:00
WeiChungChang
d4d12164aa
Merge pull request #19529 from WeiChungChang:3.4
...
* improve map allocation check
* fix accoring to CR
2021-02-23 21:09:55 +00:00
Alexander Alekhin
86cb435adf
dnn: fix build (nullptr issue)
2021-02-23 03:31:29 +00:00
Alexander Alekhin
513f5a8869
Merge pull request #19536 from WeiChungChang:ReduceMemory
2021-02-22 20:04:11 +00:00
WeiChungChang
f6bc4fd4c6
Merge pull request #19552 from WeiChungChang:partialSort
...
apply partial sort to save computations
* apply partial sort
* fix typo
* fix accroding to CR
2021-02-22 19:44:00 +03:00
Alexander Alekhin
0a6f216f83
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-02-21 20:14:09 +00:00
SamFC10
6111935835
Added exp layer
2021-02-20 22:16:00 +05:30
Alexander Alekhin
8d21c84e7d
Merge pull request #19474 from aryanshomray:darknet_tanh
2021-02-17 17:10:44 +00:00
Aryansh Omray
601851cc7e
TanH darknet and test
2021-02-17 18:49:52 +05:30
Qoo
bc8727786b
save allocation of memory for fake image
2021-02-15 20:14:07 -05:00
Qoo
c909693d45
refine
2021-02-12 20:53:09 -05:00
Ilya Churaev
8fa013309e
Merge pull request #19479 from ilyachur:remove_v0_multiply
...
* Switched to v1 Multiply
* Apply changes only for new OV
2021-02-09 09:46:28 +00:00
Alexander Alekhin
6b474c4051
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-02-06 00:44:11 +00:00
Alexander Alekhin
4f08bb5046
Merge pull request #19428 from alalek:dnn_drop_misbehaved_clamp
2021-02-04 11:57:19 +00:00
Jebastin Nadar
09d2ca17cf
Merge pull request #19449 from SamFC10:tf-leakyrelu
...
* add LeakyRelu to tf-importer
* fix whitespace error
* added test for leakyrelu
2021-02-04 08:50:08 +00:00
Alexander Alekhin
83aa711346
dnn: rename clamp() => normalize_axis()
2021-02-04 08:13:55 +00:00
Alexander Alekhin
0a86ddc36c
Merge pull request #19435 from l-bat:lb/onnx_normalize
2021-02-03 09:16:06 +00:00
Liubov Batanina
68eb54dc13
Added ONNX NormalizeL2 subgraph
2021-02-01 12:44:48 +03:00
Alexander Alekhin
b9dfffc237
Merge pull request #19057 from sl-sergei:changes_for_cuda_1d
2021-01-30 11:39:43 +00:00
Sergey Slashchinin
e2949c7d0a
Align 3.4 branch with master
2021-01-29 23:48:08 +03:00
Alexander Alekhin
e85b41f9be
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-01-25 22:42:13 +00:00
Alexander Alekhin
573812d0ff
Merge pull request #19373 from l-bat:lb/tf_matmul_shared
2021-01-25 22:08:24 +00:00
Liubov Batanina
c12930cdde
Merge pull request #19372 from l-bat:lb/onnx_pads_calc
...
* Fixed bug with constant Div
* Supported constant mul and div for inputs with different shapes
2021-01-25 22:07:50 +00:00
Liubov Batanina
38a49f92ab
Added shared weights for MatMul
2021-01-22 16:47:02 +03:00
Sergei Slashchinin
ea41f89b40
Merge pull request #19058 from sl-sergei:cuda_1d
...
Conv1D and Pool1D for CUDA backend
* CUDA-independent changes
* Add Conv1D and Pool1D for CUDA backend
* CUDA-independent changes
* Fix typo
* fix comment
* Update fix
* make changes more correct for pooling layer
* Minor fixes for review
* Split skip blocks
2021-01-21 22:16:56 +00:00
Alexander Alekhin
28c0cd8460
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-01-16 19:03:42 +00:00
Liubov Batanina
af9597f454
Added Mish ONNX subgraph
2021-01-15 14:01:48 +03:00
Alexander Alekhin
d3bc563c6e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2021-01-12 18:34:33 +00:00
YashasSamaga
8c74d7e4fa
add half pixel centers and align corners param
2020-12-27 15:05:39 +05:30
Alexander Alekhin
55f06b76f9
dnn: improve debugging of TensorFlow parsing errors
2020-12-26 15:31:30 +00:00
Alexander Alekhin
624d532000
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-12-17 21:05:34 +00:00
Alexander Alekhin
9b4adc9acb
Merge pull request #19146 from alalek:dnn_openvino_2021.2.0
2020-12-17 19:50:36 +00:00
Vincent Rabaud
8391a23600
Optimize calls to std::string::find() and friends for a single char.
...
The character literal overload is more efficient. More info at:
http://clang.llvm.org/extra/clang-tidy/checks/performance-faster-string-find.html
2020-12-17 09:39:23 +01:00
Alexander Alekhin
752cc26ad6
dnn: use OpenVINO 2021.2 defines
...
original commit: 4699d2ba0c
2020-12-16 22:22:17 +00:00
Alexander Alekhin
7631056b8a
Merge pull request #19114 from alalek:issue_18937
2020-12-15 20:47:05 +00:00
Alexander Alekhin
c240355cc6
dnn(ocl): avoid mess FP16/FP32 in convolution layer
2020-12-15 08:51:24 +00:00
Alexander Alekhin
4b3d2c8834
dnn(ocl): fix gemm kernels with beta=0
...
- dst is not initialized, may include NaN values
- 0*NaN produces NaN
2020-12-15 00:58:43 +00:00
Alexander Alekhin
dd1494eebf
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-12-11 19:27:20 +00:00
Alexander Alekhin
23c246882e
Merge pull request #19071 from LupusSanctus:am/dnn_nearest_resize_3.4
2020-12-11 18:25:55 +00:00
Alexander Alekhin
d6a7f5e1e0
Merge pull request #19075 from alalek:dnn_fix_halide_build
2020-12-10 20:37:37 +00:00
Alexander Alekhin
d84a9484b7
dnn: fix build with Halide, skip tests with crashes
2020-12-10 18:23:24 +00:00
Anastasia Murzova
f2422ace7d
Added TF nearest neighbour resize behaviour alignment
...
Relates to OpenCV issue #18721
2020-12-10 15:53:24 +03:00
Alexander Alekhin
de385009ae
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-12-09 18:09:00 +00:00
Alexander Alekhin
8286d84fb1
Merge pull request #19061 from alalek:dnn_load_face_detector_with_external_protobuf
2020-12-09 18:03:52 +00:00
Alexander Alekhin
d7e936de5c
dnn(caffe): add DetectionOutputParameter.clip to .proto file
...
- allow to load opencv_face_detector with external unpatched protobuf
2020-12-09 05:21:07 +00:00
Anastasia Murzova
a82c50eac2
Added TF resize_bilinear behaviour alignment
...
Fixes OpenCV issue #18721
2020-12-08 22:51:38 +03:00
berak
cf28b5e5be
dnn: add another color to Net::Impl::dump()
2020-12-07 17:58:40 +01:00
Alexander Alekhin
00f36a3149
dnn: prefer to use v_fma() instead of v_c += v_a * v_b
2020-12-05 11:51:03 +00:00
Wenqing Zhang
22d64ae08f
Merge pull request #17570 from HannibalAPE:text_det_recog_demo
...
[GSoC] High Level API and Samples for Scene Text Detection and Recognition
* APIs and samples for scene text detection and recognition
* update APIs and tutorial for Text Detection and Recognition
* API updates:
(1) put decodeType into struct Voc
(2) optimize the post-processing of DB
* sample update:
(1) add transformation into scene_text_spotting.cpp
(2) modify text_detection.cpp with API update
* update tutorial
* simplify text recognition API
update tutorial
* update impl usage in recognize() and detect()
* dnn: refactoring public API of TextRecognitionModel/TextDetectionModel
* update provided models
update opencv.bib
* dnn: adjust text rectangle angle
* remove points ordering operation in model.cpp
* update gts of DB test in test_model.cpp
* dnn: ensure to keep text rectangle angle
- avoid 90/180 degree turns
* dnn(text): use quadrangle result in TextDetectionModel API
* dnn: update Text Detection API
(1) keep points' order consistent with (bl, tl, tr, br) in unclip
(2) update contourScore with boundingRect
2020-12-03 18:47:40 +00:00
Daniel Cauchi
9d37cdaa66
Merge pull request #18891 from CowKeyMan:NMS_boxes_with_different_labels
...
Add option for NMS for boxes with different labels
* DetectionModel impl
* Add option for NMS for boxes with different labels
In the detect function in modules/dnn/include/opencv2/dnn/dnn.hpp, whose implementation can be found at modules/dnn/src/model.cpp, the Non Max Suppression (NMS) is applied only for objects of the same label. Thus, a flag
was added with the purpose to allow developers to choose if they want to keep the default implementation or wether they would like NMS to be applied to all the boxes, regardless of label.
The flag is called nmsDifferentLabels, and is given a default value of false, which applies the current default implementation, thus allowing existing projects to update opencv without disruption
Solves issue opencv#18832
* Change return type of set & Add default constr
* Add assertions due to default constructor
2020-12-01 13:50:24 +00:00
Sergei Slashchinin
9cef41000a
Merge pull request #18973 from sl-sergei:fix_vulkan_build
...
* Fix build when HAVE_VULKAN is ON
* Fix warnings
2020-12-01 12:52:09 +00:00
Alexander Alekhin
2155296a13
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-11-27 14:08:06 +00:00
Sergei Slashchinin
f4f462c50b
Merge pull request #18862 from sl-sergei:support_pool1d
...
Support for Pool1d layer for OpenCV and OpenCL targets
* Initial version of Pool1d support
* Fix variable naming
* Fix 1d pooling for OpenCL
* Change support logic, remove unnecessary variable, split the tests
* Remove other depricated variables
* Fix warning. Check tests
* Change support check logic
* Change support check logic, 2
2020-11-24 16:52:45 +00:00
Alexander Alekhin
0401d5920c
Merge pull request #18845 from joegeisbauer:fix_reduce_mean_index_error
2020-11-23 17:03:47 +00:00
YashasSamaga
0f8ab0557e
enable fusion tests, update thresholds and fix missed eltwise fusions
2020-11-21 17:35:20 +05:30
Alexander Alekhin
0105f8fa38
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-11-20 20:32:00 +00:00
Joe
e05c2e0f1d
Fix Reduce Mean error for MobileNets DNN
...
Fix for index error for Reduce Mean
Correct Reduce Mean indexing error
2020-11-20 11:17:02 -06:00
Julien
ac24a72e66
Merge pull request #18841 from JulienMaille:patch-2
...
Fixing dnn Resize layer for variable input size
* Fix onnx loading of resize/upsample layers for different opset
* group all DynamicResize tests
* cleaned up scales checks
* Simplify branching
2020-11-20 11:14:00 +00:00
Liubov Batanina
b86f129393
Fixed Test_Model.DetectionOutput
2020-11-19 12:11:52 +03:00
Alexander Alekhin
ce8027c6fb
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-11-17 21:56:26 +00:00
Alexander Alekhin
9acbfc6e05
Merge pull request #18711 from alalek:dnn_fix_model_public_api
2020-11-17 21:47:59 +00:00
Alexander Alekhin
4f3130f562
Merge pull request #18145 from sl-sergei:fix_17914
2020-11-17 21:46:08 +00:00
Omar Alzaibaq
a316b11aaa
Merge pull request #18220 from Omar-AE:hddl-supported
...
* added HDDL VPU support
* changed to return True in one line if any device connected
* dnn: use releaseHDDLPlugin()
* dnn(hddl): fix conditions
2020-11-17 19:47:24 +00:00
Liubov Batanina
72d06080c6
[ONNX] Added Reduce ops for batch and channel
2020-11-17 14:45:36 +03:00
Alexander Alekhin
23baf1a75e
dnn: fix High-Level public API (cv::dnn::Model class)
...
- proxy selected Net methods only (don't derive from Net directly)
- default Model ctor is protected
2020-11-17 11:01:31 +00:00
Alexander Alekhin
025a9647af
Merge pull request #18830 from l-bat:issue_18785
2020-11-17 10:40:53 +00:00
Sergey Slashchinin
32e7ef8a3d
Add fixes and tests for different layers
2020-11-17 13:39:32 +03:00
Sergei Slashchinin
2b82f8f12c
Merge pull request #18296 from sl-sergei:fix_16783
...
Fix loading issue for Faster RCNN model from #16783
* Add a reproducer with multi-output Gather
* Fix an issue with ONNX graph simplifier
* fix build
* Move checks to correct class
* Minor changes for better code appearence
2020-11-17 09:52:08 +00:00
Liubov Batanina
3a184ae677
[ONNX] Added handler for int32 tensors
2020-11-17 10:17:06 +03:00
Alexander Alekhin
a7c150ec66
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-11-13 22:29:14 +00:00
Sergei Slashchinin
61144f935e
Merge pull request #18783 from sl-sergei:fix_conv1d
...
Add support for Conv1D on OpenCV backend
* Add support for Conv1D on OpenCV backend
* disable tests on other targets/backends
* Fix formatting
* Restore comment
* Remove unnecessary flag and fix test logic
* Fix perf test
* fix braces
* Fix indentation, assert check and remove unnecessary condition
* Remove unnecessary changes
* Add test cases for variable weights and bias
* dnn(conv): fallback on OpenCV+CPU instead of failures
* coding style
2020-11-13 22:22:10 +00:00
Alexander Alekhin
ef32d7fd16
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-11-10 20:09:18 +00:00
Alexander Alekhin
c3e7a23da5
Merge pull request #18752 from alalek:dnn_defines_openvino_2021.1.0
2020-11-08 18:09:52 +00:00
Alexander Alekhin
bed5debca6
dnn: use OpenVINO 2021.1 defines
2020-11-07 17:27:33 +00:00
Roman Kazantsev
bb5b628cce
Use explicit opset of Unsqueeze from nGraph
...
backporting commit eb24575e2c
2020-11-06 22:29:35 +00:00
Roman Kazantsev
eb24575e2c
Use explicit opset of Unsqueeze from nGraph
...
The change is needed due to removing default opset namespace for Unsqueeze
in the scope of this refactoring activity: https://github.com/openvinotoolkit/openvino/pull/2767
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
2020-11-06 09:51:40 +03:00
Alexander Alekhin
1b443219ed
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-10-09 20:09:26 +00:00
Alexander Alekhin
cdcf7e62f3
dnn(opencl): bypass unsupported fusion cases 2
2020-10-09 18:59:08 +00:00
Alexander Alekhin
718dd9f170
dnn(opencl): bypass unsupported fusion cases
2020-10-09 12:33:06 +00:00
Alexander Alekhin
39d5e14c1f
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-10-07 09:09:37 +00:00
Alexander Alekhin
a5f0fb6008
Merge pull request #18518 from alalek:backport_17993
2020-10-06 22:47:45 +00:00
Alexander Alekhin
037a72debd
Merge pull request #18517 from alalek:backport_18031
2020-10-06 19:56:49 +00:00
Maksim Doronin
36f61f3879
[IE][VPU]: Refactor vpu configs
...
backported commit: 7fe87d9a5b
2020-10-05 20:27:52 +00:00
Ilya Churaev
aa11f7d8a3
Removed get_output_as_single_output_node method
...
backported commit: 5fd3d36fe8
2020-10-05 20:24:21 +00:00
Alexander Alekhin
a00fe15abd
dnn: check for empty Net in .forward()
2020-10-05 06:23:47 +00:00
Alexander Alekhin
199687a1c5
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-10-02 15:58:45 +00:00
zhuqiang00099
a968eadbf1
fix darknet-relu bug in darknet_io.cpp
2020-10-02 06:16:38 +00:00
Alexander Alekhin
969b55036f
Merge pull request #18438 from alalek:dnn_onnx_importer_error_reporting
2020-09-29 13:49:02 +00:00
Alexander Alekhin
295afd5882
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-09-28 21:33:29 +00:00
Alexander Alekhin
c08f29c803
dnn(opencl): fix convolution kernel w/o bias with activation
2020-09-27 23:42:30 +00:00
Alexander Alekhin
e59793cc75
dnn: improve debugging of ONNX parsing errors
2020-09-27 23:04:48 +00:00
Alexander Alekhin
233030e417
core: force check for string literals are used in the message
2020-09-27 06:37:44 +00:00
Alexander Alekhin
220b37144b
Merge pull request #18395 from tomoaki0705:fixNativePow
2020-09-23 18:27:45 +00:00
Tomoaki Teshima
74c8ccb45b
fix build error of kernel on Mali
2020-09-23 21:38:12 +09:00
NesQl
3fc1487cc9
Merge pull request #18323 from liqi-c:tengine-lite-update
...
Tengine lite update
* update tengine
* Modify for arm32 build.
* format optimization
* add teng_ befor some tengine api
* update graph_t to teng_graph_t
* update graph_t to teng_graph_t
* Code structure optimization
* optimization
* optimization
* remove space
* update tengine url
Co-authored-by: liqi <qli@openailab.com>
2020-09-23 09:34:29 +00:00
Julien
48ddb53332
Merge pull request #18386 from JulienMaille:patch-1
...
* Make sure there is a cuda device before getting it
* Update init.hpp
2020-09-23 09:15:02 +00:00
Alexander Alekhin
f52a2cf5e1
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-09-19 17:03:08 +00:00
Alexander Alekhin
4fa82809df
ocl: avoid rescheduling of async kernels
2020-09-18 14:53:50 +00:00
Alexander Alekhin
3e3787ecb6
Merge pull request #18360 from tomoaki0705:fixClampFailure
2020-09-18 13:10:36 +00:00
Liubov Batanina
ebb528976f
Merge pull request #18353 from l-bat:issue_18350
...
* Fixed bug in ONNX Mul op
* Replaced node
2020-09-18 13:01:14 +00:00
Tomoaki Teshima
f77c2d700f
add explicit cast for half
2020-09-18 21:04:24 +09:00
Sergei Slashchinin
fa953e4205
Merge pull request #18316 from sl-sergei:fix_18253
...
Fix loading of ONNX models with Resize operation with Opset 11 for newer versions of Pytorch
* Add reproducer for Resize operation from newer versions of Pytorch
* Fix loading of scales parameter for Resize layer
* Change check type for better diagnostic messages
2020-09-17 11:05:22 +00:00
Alexander Alekhin
4b24ddd70d
Merge pull request #18317 from sl-sergei:restored_pr_17629
2020-09-13 12:51:41 +00:00
Shubham Singh
23e71d1aa2
fixes #17187 probably
...
Added Eltwise Layer Support
2020-09-11 18:53:42 +03:00
Liubov Batanina
b542a1804c
Support global reduce ops
2020-09-09 11:56:20 +03:00
Alexander Alekhin
5c76f99fdd
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-09-05 19:37:38 +00:00
Alexander Alekhin
7e019af50d
checks and fixes from static code analyzers results (2)
2020-09-03 18:28:27 +00:00
Alexander Alekhin
fa11b98800
Merge pull request #18255 from alalek:backport_18243
2020-09-02 22:50:21 +00:00
Alexander Alekhin
64c67a93d3
Merge pull request #18246 from YashasSamaga:dnn-permute-fix-unwanted-ocl-init
2020-09-02 21:15:19 +00:00
Alexander Alekhin
1f2c83845d
backport: checks and fixes from static code analyzers results
...
original commit: 71f665bd8c
2020-09-02 19:05:47 +00:00
Alexander Alekhin
71f665bd8c
checks and fixes from static code analyzers results
2020-09-02 21:59:34 +03:00
Liubov Batanina
2349a09736
Support Reshape with zero dim
2020-09-02 11:58:43 +03:00
YashasSamaga
44bf748479
do not allocate UMat in non-OpenCL targets
2020-09-02 12:18:41 +05:30
Alexander Alekhin
ba147d2be2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-08-26 15:51:25 +00:00
Alexander Alekhin
792722865f
Merge pull request #17919 from zhaoyue-zephyrus:flownet2_with_anysize
2020-08-26 11:43:17 +00:00
Sergei Slashchinin
9aa401a7d0
Merge pull request #17978 from sl-sergei:fix_17516_17531
...
* Fix ONNX loading in issues opencv#17516, opencv#17531
* Add tests for Linear and Matmul layers
* Disable tests for IE versions lower than 20.4
* Skip unstable tests with OpenCL FP16 on Intel GPU
* Add correct test filtering for OpenCL FP16 tests
2020-08-26 10:15:59 +00:00
Liubov Batanina
d392b11dfb
Supported ONNX Pow op
2020-08-24 11:20:18 +03:00
Alexander Alekhin
41548a091b
Merge pull request #17993 from Maxim-Doronin:vpu/mdoronin/refactor_vpu_configs
2020-08-17 19:20:04 +00:00
Alexander Alekhin
b45273eccb
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-08-14 19:45:45 +00:00
Liubov Batanina
ad63d24dba
Merge pull request #18096 from l-bat:update_onnx_importer
...
* Added ReduceSum to ONNX importer
* Fix comments
* Fix Mul
2020-08-14 16:49:42 +00:00
Alexander Alekhin
3b5813c035
Merge pull request #18078 from l-bat:fix_matmul
2020-08-14 13:46:46 +00:00
Liubov Batanina
339b963e6b
Fix MatMul and Add axes
2020-08-14 11:18:58 +03:00
Maksim Doronin
7fe87d9a5b
[IE][VPU]: Refactor vpu configs
2020-08-13 15:25:01 +03:00
Yashas Samaga B L
2171cae8ff
Merge pull request #17976 from YashasSamaga:dnn-fusion-tests-fix-ocl
...
dnn: add exhaustive fusion tests, enable more eltwise fusions
* add eltwise fusion tests, enable more eltwise fusions
* merge weighted eltwise tests with eltwise tests
2020-08-13 10:55:41 +00:00
Liubov Batanina
f3cebb3e1b
Merge pull request #18077 from l-bat:reduce_sum
...
* Supported ReduceSum op
* Skip test
2020-08-12 14:32:16 +00:00
zhaoyue-zephyrus
e231be86b7
support flownet2 with arbitary input size
...
revise default proto to match the filename in documentations
fix a bug
beautify python codes
fix bug
beautify codes
add test samples with larger/smaller size
remove unless code
using bytearray without creating tmp file
remove useless codes
2020-08-12 00:50:58 +08:00
Alexander Alekhin
b4e9b4fe7d
Merge pull request #18060 from YashasSamaga:cuda4dnn-fix-detection-output-cterr
2020-08-10 20:26:50 +00:00
Alexander Alekhin
2ffe15f22f
Merge pull request #18031 from ilyachur:feature/ichuraev/remove_goe
2020-08-10 17:36:58 +00:00
YashasSamaga
f0149cdae2
fix compile-time errors, disable unsupported tests
2020-08-09 14:43:20 +05:30
Alexander Alekhin
fa25faa2d2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-08-06 14:15:52 +00:00
Liubov Batanina
6226ea0085
Fix bug in ONNX Gather op
2020-08-06 15:47:34 +03:00
Vadim Pisarevsky
1537ecd931
* added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks
...
* hopefully, eliminated compile warnings, errors, as well as failure in one test
* * fixed a few typos
* decreased buffer size in some cases
* added more optimal im2row branch in the case of 1x1 convolutions
* tuned fastConv to reduce the number of passes over arrays
backport of commit 77b01deb80
2020-08-04 17:34:48 +00:00
Alexander Alekhin
5b5c42d2c7
Merge pull request #18027 from dkurt:dnn_backport_ngraph
2020-08-04 16:24:11 +00:00
Alexander Alekhin
161890dad4
Merge pull request #18017 from danielenricocahall:add-relu-to-darknet
2020-08-04 16:17:07 +00:00
Ilya Churaev
5fd3d36fe8
Removed get_output_as_single_output_node method
2020-08-04 08:18:38 +03:00
danielenricocahall
8457e471fd
add relu as activation option in darknet
...
add relu option
add relu as activation option in darknet
simplify the setParams if-else ladder
add relu as activation option in darknet
correct activation_param type
format
format
add relu as activation option in darknet
spacing
spacing
add relu as activation option in darknet
2020-08-03 19:19:35 -04:00
Ilya Churaev
246de2b7f5
Replaced copy_with_new_args to clone_with_new_inputs
2020-08-03 23:08:29 +03:00
Ilya Churaev
e8c26963e9
Fixed removing is_parameter, is_constant, is_output
2020-08-03 23:08:22 +03:00
Alexander Alekhin
a28533933f
Merge pull request #17998 from dkurt:dnn_fix_ngraph
2020-08-03 18:23:11 +00:00
Liubov Batanina
d695208727
Merge pull request #17967 from l-bat:non_const_weights_for_conv
...
* Supported convolution with non-const weights
* Fix opencl blobs
* Update tests
2020-08-03 18:02:49 +00:00
Alexander Alekhin
1c8ee3f957
Merge pull request #17885 from alalek:dnn_ocl_slice_update
...
DNN: OpenCL/slice update
* dnn(ocl/slice): make slice kernel VTune friendly
- more unique names
- inline code of copy functions
* dnn(ocl/slice): prefer to spawn more work groups
- even in case with 1D copy
- perf improvement up to 2x of kernel time (due to changed configuration 128x1x1 => 128x32x1)
* dnn(ocl/slice): cache kernel exec info
2020-08-03 14:13:34 +00:00
Dmitry Kurtaev
cf8f65d806
Do not use size_t for nGraph layers
2020-08-02 20:50:44 +03:00
YashasSamaga
a3106d424b
add MVNOp
2020-08-02 12:44:35 +05:30
Vadim Pisarevsky
77b01deb80
Merge pull request #17858 from vpisarev:dnn_depthwise_conv
...
* added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks
* hopefully, eliminated compile warnings, errors, as well as failure in one test
* * fixed a few typos
* decreased buffer size in some cases
* added more optimal im2row branch in the case of 1x1 convolutions
* tuned fastConv to reduce the number of passes over arrays
2020-08-01 15:05:05 +03:00
Yashas Samaga B L
f53f491cd2
Merge pull request #17939 from YashasSamaga:cuda4dnn-fix-eltwise-fusion
...
* fix eltwise fusion segfault, more eltwise fusions, fix power fusion
* add assertion
2020-08-01 15:03:07 +03:00
YashasSamaga
ae293f27cf
add DetectionOutputOp
2020-07-29 12:28:00 +05:30
Alexander Alekhin
afe9993376
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-28 17:25:20 +00:00
Sinitsina
0ac2f0e04c
mish_functor_update
2020-07-23 09:02:00 +03:00
Maksim Shabunin
b698d0a6ee
Merge pull request #17894 from ilyachur:feature/ichuraev/remove_copy_with_new_args
2020-07-21 08:19:42 +00:00
Ilya Churaev
aaf65bb3af
Fixed removing is_parameter, is_constant, is_output
2020-07-21 08:45:16 +03:00
Ilya Churaev
0291dd3ef8
Replaced copy_with_new_args to clone_with_new_inputs
2020-07-20 16:05:01 +03:00
Liubov Batanina
a35d4f9029
Support Gather for variable inputs
2020-07-20 14:02:45 +03:00
Ilya Churaev
a46ca98c57
Revert "Fixed removing is_parameter, is_constant, is_output"
2020-07-20 07:00:09 +03:00
Ilya Churaev
33db2ac928
Fixed removing is_parameter, is_constant, is_output
2020-07-17 14:45:11 +03:00
Alexander Alekhin
9b7b22ee0e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-16 20:13:27 +00:00
Alexander Alekhin
b2ebd37ee2
Merge pull request #17856 from alalek:dnn_openvino_2020.4.0
2020-07-16 20:08:00 +00:00
Alexander Alekhin
09f24a851e
Merge pull request #17764 from alalek:issue_17762
2020-07-16 15:27:21 +00:00
Alexander Alekhin
81e027eef7
dnn: fix OpenCL implementation of Slice layer
2020-07-16 04:33:52 +00:00
Alexander Alekhin
55e8549839
dnn: eliminate IE deprecation warning
2020-07-15 23:39:06 +00:00
Alexander Alekhin
435b6df989
dnn: use OpenVINO 2020.4 defines
...
original commit: 2813aa7eb9
2020-07-15 20:13:40 +00:00
Alexander Alekhin
cd0f0384ef
Merge pull request #17788 from YashasSamaga:cuda4dnn-nice-build
2020-07-15 12:25:23 +00:00
Dmitry Kurtaev
cc584760d3
Fix TensorFlow->ONNX imports
2020-07-15 14:36:13 +03:00
Alexander Alekhin
e5e767abc1
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-13 19:54:49 +00:00
YashasSamaga
1949056423
improved diagnostics for build issues
2020-07-13 21:09:38 +05:30
Alexander Alekhin
5cb8619eca
dnn(ie): enable KEY_CPU_THREADS_NUM for Windows
2020-07-10 14:29:21 +00:00
YashasSamaga
37e2afb8f6
fix concat fusion for cuda4dnn
2020-07-09 19:40:44 +05:30
Yashas Samaga B L
d0e6d2438c
Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2
...
cuda4dnn(conv): fuse eltwise with convolutions
* fuse eltwise with convolutions
* manually rebase to avoid bad git merge
2020-07-09 16:02:21 +03:00
Alexander Alekhin
44d473fba0
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-08 21:03:43 +00:00
Alexander Alekhin
988bc804bf
Merge pull request #17748 from YashasSamaga:cuda4dnn-data-parallel
2020-07-08 20:20:19 +00:00
Alexander Alekhin
950a916952
Merge pull request #17752 from YashasSamaga:generalize-concat-fusion-3.4
2020-07-07 10:36:02 +00:00
Alexander Alekhin
524a2fffe9
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-06 23:05:04 +00:00
Alexander Alekhin
6781ca7d55
Merge pull request #17685 from YashasSamaga:cuda4dnn-cudnn8-support
2020-07-06 22:48:07 +00:00
Alexander Alekhin
d62e0a3695
Merge pull request #17743 from alalek:test_17666
2020-07-06 22:36:35 +00:00
Alexander Alekhin
d5713c657b
dnn(slice): disable buggy OCV/OCL implementation
2020-07-06 21:36:19 +00:00
Alexander Alekhin
0970ef57bd
Merge pull request #17754 from YashasSamaga:cuda4dnn-slice-concat-copy-reduction
2020-07-06 20:00:52 +00:00
Ilya Churaev
d69a7a3bbf
Fixed header paths for some nGraph ops
...
* Added dependency on IE version
backport of commit: 992c908b56
2020-07-06 19:51:57 +00:00
Ilya Churaev
992c908b56
Merge pull request #17756 from ilyachur:feature/ichuraev/fix_ngraph_headers
...
* Fixed header paths for some nGraph ops
* Added dependency on IE version
2020-07-06 08:11:59 +00:00
YashasSamaga
cbdaa93e54
reduce slice, concat to copy; enable more concat fusions
2020-07-05 20:52:35 +05:30
YashasSamaga
b7eec216e9
generalize axis for concat fusion
2020-07-04 18:57:28 +05:30
YashasSamaga
4988e131fd
transfer output blobs in background
2020-07-04 12:55:12 +05:30
Liubov Batanina
65dbbf712d
Merge pull request #17733 from l-bat:tiny_yolov4
...
* Supported yolov4-tiny
* Added comments
2020-07-03 18:07:08 +00:00
YashasSamaga
62a63021c7
add cuDNN 8 support
2020-06-30 21:51:23 +05:30