opencv

mirror of https://github.com/opencv/opencv.git synced 2025-06-12 12:22:51 +08:00

Author	SHA1	Message	Date
fengyuentau	2959286eb5	tengine: supports conv with asymmetric padding	2022-08-29 02:51:26 +00:00
fengyuentau	0cdff46725	tune for opencl	2022-08-14 17:47:48 +08:00
fengyuentau	e7e814fa8c	remove asymmetric padding checks	2022-08-10 19:52:44 +08:00
Zihao Mu	a80fcacd90	Merge pull request #21372 from zihaomu:dnn_quantize_per_tensor Add per_tensor_quantize to int8 quantize * add per_tensor_quantize to dnn int8 module. * change api flag from perTensor to perChannel, and recognize quantize type and onnx importer. * change the default to hpp	2022-07-05 19:14:42 +03:00
Zihao Mu	59b870a87a	Merge pull request #21910 from zihaomu:fast_conv_ARM DNN: Accelerating convolution * Fast Conv of ARM, X86 and universal intrinsics. * improve code style. * error fixed. * improve the License * optimize memory allocated and Adjust the threshold. * change FasterRCNN_vgg16 to 2GB memory.	2022-07-01 13:03:15 +03:00
rogday	9cd5a0a1e6	Merge pull request #21884 from rogday:cuda_cleanup Fix CUDA compilation issues and adjust thresholds. * Fix CUDA compilation issues and adjust thresholds. * add conformance tests to denylist	2022-04-19 16:40:25 +00:00
Zihao Mu	7b582b71ba	Merge pull request #21036 from fengyuentau:timvx_backend_support dnn: TIM-VX NPU backend support * Add TimVX NPU backend for DNN module. * use official branch from tim-vx repo; fix detecting viv sdk Co-authored-by: fytao <yuantao.feng@outlook.com>	2022-03-31 21:42:11 +00:00
Alexander Alekhin	19926e2979	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-02-11 17:32:37 +00:00
Alexander Alekhin	effce0573b	dnn: drop legacy Inference Engine NN builder API	2022-02-10 11:55:24 +00:00
Alexander Alekhin	d573472a86	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-01-31 12:53:45 +00:00
Alexander Alekhin	70b0274c8e	dnn: apply hint to ignore denormals processing	2022-01-26 11:28:35 +00:00
Alexander Alekhin	aebb65e983	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-01-12 13:26:10 +00:00
Alexander Alekhin	80d9f624d0	dnn: don't use aligned load without alignment checks - weights are unaligned in dasiamprn sample (comes from numpy)	2022-01-12 05:11:18 +00:00
Alexander Alekhin	6d677bbd63	dnn(test): update ONNX conformance filters (4.x)	2021-12-16 12:09:31 +00:00
Hanxi Guo	1fcf7ba5bc	Merge pull request #20406 from MarkGHX:gsoc_2021_webnn [GSoC] OpenCV.js: Accelerate OpenCV.js DNN via WebNN * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Fix the build issue * Update concat_layer.cpp Still have some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Delete bib19450.aux * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Update dnn.cpp * Fix Error in dnn.cpp * Resolve duplication in conditions in convolution_layer.cpp * Fixed the issues in the comments * Fix building issue * Update tutorial * Fixed comments * Address the comments * Update CMakeLists.txt * Offer more accurate perf test on native * Add better perf tests for both native and web * Modify per tests for better results * Use more latest version of Electron * Support latest WebNN Clamp op * Add definition of HAVE_WEBNN macro * Support group convolution * Implement Scale_layer using WebNN * Add Softmax option for native classification example * Fix comments * Fix comments	2021-11-23 21:15:31 +00:00
Alexander Alekhin	cca4c47781	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-10-08 11:05:45 +00:00
Alexander Alekhin	724e04e979	dnn(ocl4dnn): add extra checks to convolution layer - prevent running code over unsupported/non-tested configurations - prevent integer div by zero	2021-10-07 23:18:32 +00:00
SamFC10	fa90e14b06	int8 layers and 8-bit quantization support	2021-08-19 09:56:47 +05:30
HAN Liutong	aaca4987c9	Merge pull request #20287 from hanliutong:dev-rvv-0.10 Optimization of DNN using native RISC-V vector intrinsics. * Use RVV to optimize fastGEMM (FP32) in DNN. * Use RVV to optimize fastGEMM1T in DNN. * Use RVV to optimize fastConv in DNN. * Use RVV to optimize fastDepthwiseConv in DNN. * Vectorize tails using vl. * Use "vl" instead of scalar to handle small block in fastConv. * Fix memory access out of bound in "fastGEMM1T". * Remove setvl. * Remove useless initialization. * Use loop unrolling to handle tail part instead of switch.	2021-08-11 01:16:03 +03:00
Alexander Alekhin	3e1673e8b2	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-04-01 09:54:57 +00:00
Vitaly Tuzov	aab62aa6dd	Merge pull request #18952 from terfendail:wui_doc * Updated UI documentation to address WUI * Added documentation for vx_ calls * Removed vx_store operation overload * Doxyfile updated to enable wide UI * Enable doxygen documentation for vx_ WUI functions * Wide intrinsics definition rework * core: fix SIMD C++ emulator build (supports 128-bit only)	2021-03-30 16:18:03 +00:00
Alexander Alekhin	ca8c3dd9b5	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-03-22 12:05:23 +00:00
Liubov Batanina	c0dd82fb53	Merge pull request #19632 from l-bat:lb/ie_arm_target Added OpenVINO ARM target * Added IE ARM target * Added OpenVINO ARM target * Delete ARM target * Detect ARM platform * Changed device name in ArmPlugin * Change ARM detection	2021-03-20 11:20:02 +00:00
Sergey Slashchinin	e2949c7d0a	Align 3.4 branch with master	2021-01-29 23:48:08 +03:00
Sergei Slashchinin	ea41f89b40	Merge pull request #19058 from sl-sergei:cuda_1d Conv1D and Pool1D for CUDA backend * CUDA-independent changes * Add Conv1D and Pool1D for CUDA backend * CUDA-independent changes * Fix typo * fix comment * Update fix * make changes more correct for pooling layer * Minor fixes for review * Split skip blocks	2021-01-21 22:16:56 +00:00
Alexander Alekhin	624d532000	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-12-17 21:05:34 +00:00
Alexander Alekhin	c240355cc6	dnn(ocl): avoid mess FP16/FP32 in convolution layer	2020-12-15 08:51:24 +00:00
Alexander Alekhin	de385009ae	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-12-09 18:09:00 +00:00
Alexander Alekhin	00f36a3149	dnn: prefer to use v_fma() instead of v_c += v_a * v_b	2020-12-05 11:51:03 +00:00
Omar Alzaibaq	a316b11aaa	Merge pull request #18220 from Omar-AE:hddl-supported * added HDDL VPU support * changed to return True in one line if any device connected * dnn: use releaseHDDLPlugin() * dnn(hddl): fix conditions	2020-11-17 19:47:24 +00:00
Alexander Alekhin	a7c150ec66	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-11-13 22:29:14 +00:00
Sergei Slashchinin	61144f935e	Merge pull request #18783 from sl-sergei:fix_conv1d Add support for Conv1D on OpenCV backend * Add support for Conv1D on OpenCV backend * disable tests on other targets/backends * Fix formatting * Restore comment * Remove unnecessary flag and fix test logic * Fix perf test * fix braces * Fix indentation, assert check and remove unnecessary condition * Remove unnecessary changes * Add test cases for variable weights and bias * dnn(conv): fallback on OpenCV+CPU instead of failures * coding style	2020-11-13 22:22:10 +00:00
Alexander Alekhin	1b443219ed	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-10-09 20:09:26 +00:00
Alexander Alekhin	cdcf7e62f3	dnn(opencl): bypass unsupported fusion cases 2	2020-10-09 18:59:08 +00:00
Alexander Alekhin	718dd9f170	dnn(opencl): bypass unsupported fusion cases	2020-10-09 12:33:06 +00:00
NesQl	3fc1487cc9	Merge pull request #18323 from liqi-c:tengine-lite-update Tengine lite update * update tengine * Modify for arm32 build. * format optimization * add teng_ befor some tengine api * update graph_t to teng_graph_t * update graph_t to teng_graph_t * Code structure optimization * optimization * optimization * remove space * update tengine url Co-authored-by: liqi <qli@openailab.com>	2020-09-23 09:34:29 +00:00
Alexander Alekhin	1f2c83845d	backport: checks and fixes from static code analyzers results original commit: `71f665bd8c`	2020-09-02 19:05:47 +00:00
Alexander Alekhin	71f665bd8c	checks and fixes from static code analyzers results	2020-09-02 21:59:34 +03:00
Alexander Alekhin	fa25faa2d2	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-08-06 14:15:52 +00:00
Vadim Pisarevsky	1537ecd931	* added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks * hopefully, eliminated compile warnings, errors, as well as failure in one test * * fixed a few typos * decreased buffer size in some cases * added more optimal im2row branch in the case of 1x1 convolutions * tuned fastConv to reduce the number of passes over arrays backport of commit `77b01deb80`	2020-08-04 17:34:48 +00:00
Liubov Batanina	d695208727	Merge pull request #17967 from l-bat:non_const_weights_for_conv * Supported convolution with non-const weights * Fix opencl blobs * Update tests	2020-08-03 18:02:49 +00:00
Vadim Pisarevsky	77b01deb80	Merge pull request #17858 from vpisarev:dnn_depthwise_conv * added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks * hopefully, eliminated compile warnings, errors, as well as failure in one test * * fixed a few typos * decreased buffer size in some cases * added more optimal im2row branch in the case of 1x1 convolutions * tuned fastConv to reduce the number of passes over arrays	2020-08-01 15:05:05 +03:00
Yashas Samaga B L	f53f491cd2	Merge pull request #17939 from YashasSamaga:cuda4dnn-fix-eltwise-fusion * fix eltwise fusion segfault, more eltwise fusions, fix power fusion * add assertion	2020-08-01 15:03:07 +03:00
Yashas Samaga B L	d0e6d2438c	Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 cuda4dnn(conv): fuse eltwise with convolutions * fuse eltwise with convolutions * manually rebase to avoid bad git merge	2020-07-09 16:02:21 +03:00
Alexander Alekhin	88d8a48b09	Merge pull request #17374 from alalek:dnn_fix_build	2020-05-25 18:46:15 +00:00
Alexander Alekhin	73aa5f567b	dnn: *_DENORMALS_ZERO_MODE is defined for SSE3	2020-05-25 17:55:36 +00:00
Alexander Alekhin	21e28adb87	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-05-22 19:50:14 +00:00
Dmitry Kurtaev	68d59a2913	Flush to zero Convolution denormal weights	2020-05-15 23:44:34 +03:00
Yashas Samaga B L	d981d04c76	Merge pull request #17200 from YashasSamaga:cuda4dnn-general-opt1 cuda4dnn: optimizations for swish, mish, sigmoid, region, resize based ops, transpose, identity-conv fusion * bunch of optimizations * more accurate implementation for mish	2020-05-09 17:20:30 +00:00
Alexander Alekhin	9b3be01b83	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-03-09 20:27:34 +00:00

1 2 3 4

177 Commits