Alexander Alekhin
4fa82809df
ocl: avoid rescheduling of async kernels
2020-09-18 14:53:50 +00:00
Alexander Alekhin
3e3787ecb6
Merge pull request #18360 from tomoaki0705:fixClampFailure
2020-09-18 13:10:36 +00:00
Liubov Batanina
ebb528976f
Merge pull request #18353 from l-bat:issue_18350
...
* Fixed bug in ONNX Mul op
* Replaced node
2020-09-18 13:01:14 +00:00
Tomoaki Teshima
f77c2d700f
add explicit cast for half
2020-09-18 21:04:24 +09:00
Sergei Slashchinin
fa953e4205
Merge pull request #18316 from sl-sergei:fix_18253
...
Fix loading of ONNX models with Resize operation with Opset 11 for newer versions of Pytorch
* Add reproducer for Resize operation from newer versions of Pytorch
* Fix loading of scales parameter for Resize layer
* Change check type for better diagnostic messages
2020-09-17 11:05:22 +00:00
Alexander Alekhin
4b24ddd70d
Merge pull request #18317 from sl-sergei:restored_pr_17629
2020-09-13 12:51:41 +00:00
Shubham Singh
23e71d1aa2
fixes #17187 probably
...
Added Eltwise Layer Support
2020-09-11 18:53:42 +03:00
Liubov Batanina
b542a1804c
Support global reduce ops
2020-09-09 11:56:20 +03:00
Alexander Alekhin
5c76f99fdd
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-09-05 19:37:38 +00:00
Alexander Alekhin
7e019af50d
checks and fixes from static code analyzers results (2)
2020-09-03 18:28:27 +00:00
Alexander Alekhin
fa11b98800
Merge pull request #18255 from alalek:backport_18243
2020-09-02 22:50:21 +00:00
Alexander Alekhin
64c67a93d3
Merge pull request #18246 from YashasSamaga:dnn-permute-fix-unwanted-ocl-init
2020-09-02 21:15:19 +00:00
Alexander Alekhin
1f2c83845d
backport: checks and fixes from static code analyzers results
...
original commit: 71f665bd8c
2020-09-02 19:05:47 +00:00
Alexander Alekhin
71f665bd8c
checks and fixes from static code analyzers results
2020-09-02 21:59:34 +03:00
Liubov Batanina
2349a09736
Support Reshape with zero dim
2020-09-02 11:58:43 +03:00
YashasSamaga
44bf748479
do not allocate UMat in non-OpenCL targets
2020-09-02 12:18:41 +05:30
Alexander Alekhin
ba147d2be2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-08-26 15:51:25 +00:00
Alexander Alekhin
792722865f
Merge pull request #17919 from zhaoyue-zephyrus:flownet2_with_anysize
2020-08-26 11:43:17 +00:00
Sergei Slashchinin
9aa401a7d0
Merge pull request #17978 from sl-sergei:fix_17516_17531
...
* Fix ONNX loading in issues opencv#17516, opencv#17531
* Add tests for Linear and Matmul layers
* Disable tests for IE versions lower than 20.4
* Skip unstable tests with OpenCL FP16 on Intel GPU
* Add correct test filtering for OpenCL FP16 tests
2020-08-26 10:15:59 +00:00
Liubov Batanina
d392b11dfb
Supported ONNX Pow op
2020-08-24 11:20:18 +03:00
Alexander Alekhin
41548a091b
Merge pull request #17993 from Maxim-Doronin:vpu/mdoronin/refactor_vpu_configs
2020-08-17 19:20:04 +00:00
Alexander Alekhin
b45273eccb
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-08-14 19:45:45 +00:00
Liubov Batanina
ad63d24dba
Merge pull request #18096 from l-bat:update_onnx_importer
...
* Added ReduceSum to ONNX importer
* Fix comments
* Fix Mul
2020-08-14 16:49:42 +00:00
Alexander Alekhin
3b5813c035
Merge pull request #18078 from l-bat:fix_matmul
2020-08-14 13:46:46 +00:00
Liubov Batanina
339b963e6b
Fix MatMul and Add axes
2020-08-14 11:18:58 +03:00
Maksim Doronin
7fe87d9a5b
[IE][VPU]: Refactor vpu configs
2020-08-13 15:25:01 +03:00
Yashas Samaga B L
2171cae8ff
Merge pull request #17976 from YashasSamaga:dnn-fusion-tests-fix-ocl
...
dnn: add exhaustive fusion tests, enable more eltwise fusions
* add eltwise fusion tests, enable more eltwise fusions
* merge weighted eltwise tests with eltwise tests
2020-08-13 10:55:41 +00:00
Liubov Batanina
f3cebb3e1b
Merge pull request #18077 from l-bat:reduce_sum
...
* Supported ReduceSum op
* Skip test
2020-08-12 14:32:16 +00:00
zhaoyue-zephyrus
e231be86b7
support flownet2 with arbitary input size
...
revise default proto to match the filename in documentations
fix a bug
beautify python codes
fix bug
beautify codes
add test samples with larger/smaller size
remove unless code
using bytearray without creating tmp file
remove useless codes
2020-08-12 00:50:58 +08:00
Alexander Alekhin
b4e9b4fe7d
Merge pull request #18060 from YashasSamaga:cuda4dnn-fix-detection-output-cterr
2020-08-10 20:26:50 +00:00
Alexander Alekhin
2ffe15f22f
Merge pull request #18031 from ilyachur:feature/ichuraev/remove_goe
2020-08-10 17:36:58 +00:00
YashasSamaga
f0149cdae2
fix compile-time errors, disable unsupported tests
2020-08-09 14:43:20 +05:30
Alexander Alekhin
fa25faa2d2
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-08-06 14:15:52 +00:00
Liubov Batanina
6226ea0085
Fix bug in ONNX Gather op
2020-08-06 15:47:34 +03:00
Vadim Pisarevsky
1537ecd931
* added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks
...
* hopefully, eliminated compile warnings, errors, as well as failure in one test
* * fixed a few typos
* decreased buffer size in some cases
* added more optimal im2row branch in the case of 1x1 convolutions
* tuned fastConv to reduce the number of passes over arrays
backport of commit 77b01deb80
2020-08-04 17:34:48 +00:00
Alexander Alekhin
5b5c42d2c7
Merge pull request #18027 from dkurt:dnn_backport_ngraph
2020-08-04 16:24:11 +00:00
Alexander Alekhin
161890dad4
Merge pull request #18017 from danielenricocahall:add-relu-to-darknet
2020-08-04 16:17:07 +00:00
Ilya Churaev
5fd3d36fe8
Removed get_output_as_single_output_node method
2020-08-04 08:18:38 +03:00
danielenricocahall
8457e471fd
add relu as activation option in darknet
...
add relu option
add relu as activation option in darknet
simplify the setParams if-else ladder
add relu as activation option in darknet
correct activation_param type
format
format
add relu as activation option in darknet
spacing
spacing
add relu as activation option in darknet
2020-08-03 19:19:35 -04:00
Ilya Churaev
246de2b7f5
Replaced copy_with_new_args to clone_with_new_inputs
2020-08-03 23:08:29 +03:00
Ilya Churaev
e8c26963e9
Fixed removing is_parameter, is_constant, is_output
2020-08-03 23:08:22 +03:00
Alexander Alekhin
a28533933f
Merge pull request #17998 from dkurt:dnn_fix_ngraph
2020-08-03 18:23:11 +00:00
Liubov Batanina
d695208727
Merge pull request #17967 from l-bat:non_const_weights_for_conv
...
* Supported convolution with non-const weights
* Fix opencl blobs
* Update tests
2020-08-03 18:02:49 +00:00
Alexander Alekhin
1c8ee3f957
Merge pull request #17885 from alalek:dnn_ocl_slice_update
...
DNN: OpenCL/slice update
* dnn(ocl/slice): make slice kernel VTune friendly
- more unique names
- inline code of copy functions
* dnn(ocl/slice): prefer to spawn more work groups
- even in case with 1D copy
- perf improvement up to 2x of kernel time (due to changed configuration 128x1x1 => 128x32x1)
* dnn(ocl/slice): cache kernel exec info
2020-08-03 14:13:34 +00:00
Dmitry Kurtaev
cf8f65d806
Do not use size_t for nGraph layers
2020-08-02 20:50:44 +03:00
YashasSamaga
a3106d424b
add MVNOp
2020-08-02 12:44:35 +05:30
Vadim Pisarevsky
77b01deb80
Merge pull request #17858 from vpisarev:dnn_depthwise_conv
...
* added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks
* hopefully, eliminated compile warnings, errors, as well as failure in one test
* * fixed a few typos
* decreased buffer size in some cases
* added more optimal im2row branch in the case of 1x1 convolutions
* tuned fastConv to reduce the number of passes over arrays
2020-08-01 15:05:05 +03:00
Yashas Samaga B L
f53f491cd2
Merge pull request #17939 from YashasSamaga:cuda4dnn-fix-eltwise-fusion
...
* fix eltwise fusion segfault, more eltwise fusions, fix power fusion
* add assertion
2020-08-01 15:03:07 +03:00
YashasSamaga
ae293f27cf
add DetectionOutputOp
2020-07-29 12:28:00 +05:30
Alexander Alekhin
afe9993376
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-28 17:25:20 +00:00
Sinitsina
0ac2f0e04c
mish_functor_update
2020-07-23 09:02:00 +03:00
Maksim Shabunin
b698d0a6ee
Merge pull request #17894 from ilyachur:feature/ichuraev/remove_copy_with_new_args
2020-07-21 08:19:42 +00:00
Ilya Churaev
aaf65bb3af
Fixed removing is_parameter, is_constant, is_output
2020-07-21 08:45:16 +03:00
Ilya Churaev
0291dd3ef8
Replaced copy_with_new_args to clone_with_new_inputs
2020-07-20 16:05:01 +03:00
Liubov Batanina
a35d4f9029
Support Gather for variable inputs
2020-07-20 14:02:45 +03:00
Ilya Churaev
a46ca98c57
Revert "Fixed removing is_parameter, is_constant, is_output"
2020-07-20 07:00:09 +03:00
Ilya Churaev
33db2ac928
Fixed removing is_parameter, is_constant, is_output
2020-07-17 14:45:11 +03:00
Alexander Alekhin
9b7b22ee0e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-16 20:13:27 +00:00
Alexander Alekhin
b2ebd37ee2
Merge pull request #17856 from alalek:dnn_openvino_2020.4.0
2020-07-16 20:08:00 +00:00
Alexander Alekhin
09f24a851e
Merge pull request #17764 from alalek:issue_17762
2020-07-16 15:27:21 +00:00
Alexander Alekhin
81e027eef7
dnn: fix OpenCL implementation of Slice layer
2020-07-16 04:33:52 +00:00
Alexander Alekhin
55e8549839
dnn: eliminate IE deprecation warning
2020-07-15 23:39:06 +00:00
Alexander Alekhin
435b6df989
dnn: use OpenVINO 2020.4 defines
...
original commit: 2813aa7eb9
2020-07-15 20:13:40 +00:00
Alexander Alekhin
cd0f0384ef
Merge pull request #17788 from YashasSamaga:cuda4dnn-nice-build
2020-07-15 12:25:23 +00:00
Dmitry Kurtaev
cc584760d3
Fix TensorFlow->ONNX imports
2020-07-15 14:36:13 +03:00
Alexander Alekhin
e5e767abc1
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-13 19:54:49 +00:00
YashasSamaga
1949056423
improved diagnostics for build issues
2020-07-13 21:09:38 +05:30
Alexander Alekhin
5cb8619eca
dnn(ie): enable KEY_CPU_THREADS_NUM for Windows
2020-07-10 14:29:21 +00:00
YashasSamaga
37e2afb8f6
fix concat fusion for cuda4dnn
2020-07-09 19:40:44 +05:30
Yashas Samaga B L
d0e6d2438c
Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2
...
cuda4dnn(conv): fuse eltwise with convolutions
* fuse eltwise with convolutions
* manually rebase to avoid bad git merge
2020-07-09 16:02:21 +03:00
Alexander Alekhin
44d473fba0
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-08 21:03:43 +00:00
Alexander Alekhin
988bc804bf
Merge pull request #17748 from YashasSamaga:cuda4dnn-data-parallel
2020-07-08 20:20:19 +00:00
Alexander Alekhin
950a916952
Merge pull request #17752 from YashasSamaga:generalize-concat-fusion-3.4
2020-07-07 10:36:02 +00:00
Alexander Alekhin
524a2fffe9
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-07-06 23:05:04 +00:00
Alexander Alekhin
6781ca7d55
Merge pull request #17685 from YashasSamaga:cuda4dnn-cudnn8-support
2020-07-06 22:48:07 +00:00
Alexander Alekhin
d62e0a3695
Merge pull request #17743 from alalek:test_17666
2020-07-06 22:36:35 +00:00
Alexander Alekhin
d5713c657b
dnn(slice): disable buggy OCV/OCL implementation
2020-07-06 21:36:19 +00:00
Alexander Alekhin
0970ef57bd
Merge pull request #17754 from YashasSamaga:cuda4dnn-slice-concat-copy-reduction
2020-07-06 20:00:52 +00:00
Ilya Churaev
d69a7a3bbf
Fixed header paths for some nGraph ops
...
* Added dependency on IE version
backport of commit: 992c908b56
2020-07-06 19:51:57 +00:00
Ilya Churaev
992c908b56
Merge pull request #17756 from ilyachur:feature/ichuraev/fix_ngraph_headers
...
* Fixed header paths for some nGraph ops
* Added dependency on IE version
2020-07-06 08:11:59 +00:00
YashasSamaga
cbdaa93e54
reduce slice, concat to copy; enable more concat fusions
2020-07-05 20:52:35 +05:30
YashasSamaga
b7eec216e9
generalize axis for concat fusion
2020-07-04 18:57:28 +05:30
YashasSamaga
4988e131fd
transfer output blobs in background
2020-07-04 12:55:12 +05:30
Liubov Batanina
65dbbf712d
Merge pull request #17733 from l-bat:tiny_yolov4
...
* Supported yolov4-tiny
* Added comments
2020-07-03 18:07:08 +00:00
YashasSamaga
62a63021c7
add cuDNN 8 support
2020-06-30 21:51:23 +05:30
Ilya Lavrenov
e58ce89b10
Conditional compilation for IR v7 support
...
backported commit 86905754e4
2020-06-25 10:05:14 +00:00
Ilya Lavrenov
86905754e4
Conditional compilation for IR v7 support
2020-06-25 11:32:10 +03:00
Alexander Alekhin
c81d785ada
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-06-23 19:15:47 +00:00
Alexander Alekhin
6fb46bb34b
Merge pull request #17621 from YashasSamaga:cuda4dnn-mish-half
2020-06-23 18:44:50 +00:00
Alexander Alekhin
2e165053ef
Merge pull request #17624 from dkurt:dnn_optimize_mish
2020-06-23 18:43:51 +00:00
Dmitry Kurtaev
3bdd7bfbea
Remove deprecated Inference Engine CPU extensions
2020-06-23 14:33:22 +03:00
Dmitry Kurtaev
8e3f5fb209
Remove deprecated Inference Engine CPU extensions
2020-06-23 10:10:29 +03:00
Dmitry Kurtaev
1491934d17
Optimize Mish for CPU backend
2020-06-22 23:27:47 +03:00
YashasSamaga
6573b9ace0
use fp32 mish for fp16 mish
2020-06-22 19:09:36 +05:30
Liubov Batanina
85c0c8c7ed
Disabling dafault NMS in yolo layer
2020-06-19 14:34:13 +03:00
Alexander Alekhin
e3d502310f
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
2020-06-19 07:33:50 +00:00
Yashas Samaga B L
9ba5581d17
Merge pull request #17534 from YashasSamaga:cuda4dnn-remove-unused-funcs
...
cuda4dnn: reduce CUDA version requirements to at least CUDA 9.2
* remove half2 specializations
* do not remove atomicAdd for half in CUDA 10 and below
* remove fp16.hpp
2020-06-17 09:07:52 +00:00
Liubov Batanina
d93b6be3cc
Changed StridedSlice to VariadicSplit in Region layer
2020-06-17 10:02:53 +03:00
Ilya Lavrenov
676b818d6a
Removed plugin dispatcher
...
backport of commit 74113737f3
2020-06-15 18:03:14 +00:00
Ilya Lavrenov
74113737f3
Removed plugin dispatcher
2020-06-15 16:02:52 +03:00