opencv

mirror of https://github.com/opencv/opencv.git synced 2024-12-01 23:30:06 +08:00

Author	SHA1	Message	Date
Alexander Alekhin	347246901e	Merge pull request #21745 from alalek:dnn_plugin_openvino	2022-10-08 22:32:25 +00:00
Alexander Alekhin	43b2bb2c25	dnn: plugin support for OpenVINO	2022-10-07 16:57:31 +00:00
wxsheng	4154bd0667	Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX * Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX * Add resize.lasx.cpp for Loongson SIMD acceleration * Add imgwarp.lasx.cpp for Loongson SIMD acceleration * Add LASX acceleration support for dnn/conv * Add CV_PAUSE(v) for Loongarch * Set LASX by default on Loongarch64 * LoongArch: tune test threshold for Core/HAL.mat_decomp/15 Co-authored-by: shengwenxue <shengwenxue@loongson.cn>	2022-09-10 09:39:43 +03:00
Zihao Mu	7b582b71ba	Merge pull request #21036 from fengyuentau:timvx_backend_support dnn: TIM-VX NPU backend support * Add TimVX NPU backend for DNN module. * use official branch from tim-vx repo; fix detecting viv sdk Co-authored-by: fytao <yuantao.feng@outlook.com>	2022-03-31 21:42:11 +00:00
Alexander Alekhin	19926e2979	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-02-11 17:32:37 +00:00
Alexander Alekhin	effce0573b	dnn: drop legacy Inference Engine NN builder API	2022-02-10 11:55:24 +00:00
Maksim Shabunin	d1e76a34a0	3.4: Use modern OpenVINO package interface original commit: `437af37b13`	2022-02-02 09:04:03 +00:00
Maksim Shabunin	437af37b13	Use modern OpenVINO package interface	2022-02-01 16:52:17 +00:00
Alexander Alekhin	d9e7c1626a	Merge pull request #21153 from alalek:build_warnings_msvs2017	2021-12-01 12:49:28 +00:00
Alexander Alekhin	66b2140892	build: eliminate C4309 warning from protobuf files with MSVS2017	2021-11-30 04:27:39 +00:00
Hanxi Guo	1fcf7ba5bc	Merge pull request #20406 from MarkGHX:gsoc_2021_webnn [GSoC] OpenCV.js: Accelerate OpenCV.js DNN via WebNN * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Fix the build issue * Update concat_layer.cpp Still have some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Delete bib19450.aux * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Update dnn.cpp * Fix Error in dnn.cpp * Resolve duplication in conditions in convolution_layer.cpp * Fixed the issues in the comments * Fix building issue * Update tutorial * Fixed comments * Address the comments * Update CMakeLists.txt * Offer more accurate perf test on native * Add better perf tests for both native and web * Modify per tests for better results * Use more latest version of Electron * Support latest WebNN Clamp op * Add definition of HAVE_WEBNN macro * Support group convolution * Implement Scale_layer using WebNN * Add Softmax option for native classification example * Fix comments * Fix comments	2021-11-23 21:15:31 +00:00
Alexander Alekhin	d934bb15b0	Merge pull request #20998 from alalek:update_protobuf_3.19.1 3rdparty(protobuf): upgrade 3.5.2 => 3.19.1 * 3rdparty(protobuf): upgrade 3.5.2 => 3.19.1 * dnn: update protobuf files (3.19.1) * 3rdparty(protobuf): re-apply OpenCV patch for custom fields (3.19.1) * protobuf: suppress new build warnings * protobuf: remove unused files	2021-11-10 12:03:45 +00:00
Alexander Alekhin	7842181b47	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-11-05 09:27:46 +00:00
Alexander Alekhin	c1d61c88e9	dnn(cmake): don't hijack OpenCL options with Tengine	2021-11-04 09:59:19 +00:00
SamFC10	fa90e14b06	int8 layers and 8-bit quantization support	2021-08-19 09:56:47 +05:30
HAN Liutong	aaca4987c9	Merge pull request #20287 from hanliutong:dev-rvv-0.10 Optimization of DNN using native RISC-V vector intrinsics. * Use RVV to optimize fastGEMM (FP32) in DNN. * Use RVV to optimize fastGEMM1T in DNN. * Use RVV to optimize fastConv in DNN. * Use RVV to optimize fastDepthwiseConv in DNN. * Vectorize tails using vl. * Use "vl" instead of scalar to handle small block in fastConv. * Fix memory access out of bound in "fastGEMM1T". * Remove setvl. * Remove useless initialization. * Use loop unrolling to handle tail part instead of switch.	2021-08-11 01:16:03 +03:00
Alexander Alekhin	170bf6d7af	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-05-01 09:44:24 +00:00
Suleyman TURKMEN	159534313e	Update CMakeLists.txt	2021-04-26 22:43:04 +03:00
NesQl	3fc1487cc9	Merge pull request #18323 from liqi-c:tengine-lite-update Tengine lite update * update tengine * Modify for arm32 build. * format optimization * add teng_ befor some tengine api * update graph_t to teng_graph_t * update graph_t to teng_graph_t * Code structure optimization * optimization * optimization * remove space * update tengine url Co-authored-by: liqi <qli@openailab.com>	2020-09-23 09:34:29 +00:00
YashasSamaga	ead1dcf308	error if cuda4dnn depends are not resolved	2020-07-11 21:37:51 +05:30
YashasSamaga	62a63021c7	add cuDNN 8 support	2020-06-30 21:51:23 +05:30
cyy	206c843f36	Merge pull request #17499 from cyyever:fix_CUDA11 Fix cuda11 * use cudnn_version.h to detect version when it is available * remove nppi from CUDA11 * use ocv_list_filterout * dnn(cuda): temporary disable CUDNN 8.0	2020-06-27 20:34:44 +00:00
Giles Payne	02385472b6	Merge pull request #17165 from komakai:objc-binding Objc binding * Initial work on Objective-C wrapper * Objective-C generator script; update manually generated wrappers * Add Mat tests * Core Tests * Imgproc wrapper generation and tests * Fixes for Imgcodecs wrapper * Miscellaneous fixes. Swift build support * Objective-C wrapper build/install * Add Swift wrappers for videoio/objdetect/feature2d * Framework build;iOS support * Fix toArray functions;Use enum types whenever possible * Use enum types where possible;prepare test build * Update test * Add test runner scripts for iOS and macOS * Add test scripts and samples * Build fixes * Fix build (cmake 3.17.x compatibility) * Fix warnings * Fix enum name conflicting handling * Add support for document generation with Jazzy * Swift/Native fast accessor functions * Add Objective-C wrapper for calib3d, dnn, ml, photo and video modules * Remove IntOut/FloatOut/DoubleOut classes * Fix iOS default test platform value * Fix samples * Revert default framework name to opencv2 * Add converter util functions * Fix failing test * Fix whitespace * Add handling for deprecated methods;fix warnings;define __OPENCV_BUILD * Suppress cmake warnings * Reduce severity of "jazzy not found" log message * Fix incorrect #include of compatibility header in ios.h * Use explicit returns in subscript/get implementation * Reduce minimum required cmake version to 3.15 for Objective-C/Swift binding	2020-06-08 18:32:53 +00:00
Alexander Alekhin	c722625f28	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-04-28 16:53:19 +00:00
Alexander Alekhin	9181ecfc7b	cmake: fix protobuf handling	2020-04-27 02:11:19 +00:00
Alexander Alekhin	2cef100303	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-04-16 18:28:27 +00:00
Ilya Lavrenov	91b0100287	Fixed compilation when NN builder is not built	2020-04-14 15:05:01 +03:00
Alexander Alekhin	e661ad2a67	eliminate build warnings	2020-03-27 11:39:07 +00:00
Alexander Alekhin	b4b4d21212	eliminate build warnings	2020-03-26 19:18:09 +00:00
Alexander Alekhin	d00e58cdb0	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-03-10 22:49:51 +00:00
Alexander Alekhin	510a8520c7	Merge pull request #16746 from alalek:dnn_switch_ie_backend_ngraph	2020-03-10 13:52:33 +00:00
Alexander Alekhin	db95aec4a7	dnn(ie): switch to nGraph backend by default	2020-03-10 14:33:22 +03:00
Alexander Alekhin	9b3be01b83	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-03-09 20:27:34 +00:00
NesQl	0bcdf7d03e	Merge pull request #16724 from liqi-c:3.4-tengine * Add Tengine support . * Modify printf to CV_LOG_WARNING * a few minor fixes in the code * Renew Tengine version * Add header file for CV_LOG_WARNING * Add #ifdef HAVE_TENGINE in tengine_graph_convolution.cpp * remove trailing whitespace * Remove trailing whitespace * Modify for compile problem * Modify some code style error * remove whitespace * Move some code style problem * test * add ios limit and build problem * Modified as alalek suggested * Add cmake 2.8 support * modify cmake 3.5.1 problem * test and set BUILD_ANDROID_PROJECTS OFF * remove some compile error * remove some extra code in tengine * close test. * Test again * disable android. * delete ndk version judgement * Remove setenv() call . and add License information * Set tengine default OFF. Close test . Co-authored-by: Vadim Pisarevsky <vadim.pisarevsky@gmail.com>	2020-03-09 14:59:23 +00:00
Alexander Alekhin	124bf8339f	dnn(IE): use HAVE_DNN_IE_NN_BUILDER_2019 for NN Builder API code - CMake option: OPENCV_DNN_IE_NN_BUILDER_2019	2020-03-03 08:07:54 +00:00
Alexander Alekhin	29d214474f	dnn(IE): use HAVE_DNN_IE_NN_BUILDER_2019 for NN Builder API code - CMake option: OPENCV_DNN_IE_NN_BUILDER_2019	2020-03-03 07:45:09 +00:00
Julien	4e2ef8c8f5	Merge pull request #16218 from JulienMaille:cuda-dnn-for-older-gpus Enable cuda4dnn on hardware without support for __half * Enable cuda4dnn on hardware without support for half (ie. compute capability < 5.3) Update CMakeLists.txt Lowered minimum CC to 3.0 * UPD: added ifdef on new copy kernel * added fp16 support detection at runtime * Clarified #if condition on atomicAdd definition * More explicit CMake error message	2020-01-15 18:28:37 +03:00
Alexander Alekhin	92b9888837	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2019-12-12 13:02:19 +03:00
Alexander Alekhin	5ee7abbe3c	Merge pull request #16088 from alalek:dnn_eltwise_layer_different_src_channels dnn(eltwise): fix handling of different number of channels * dnn(test): reproducer for Eltwise layer issue from PR16063 * dnn(eltwise): rework support for inputs with different channels * dnn(eltwise): get rid of finalize(), variableChannels * dnn(eltwise): update input sorting by number of channels - do not swap inputs if number of channels are same after truncation * dnn(test): skip "shortcut" with batch size 2 on MYRIAD targets	2019-12-11 20:16:58 +03:00
Alexander Alekhin	4b0132ed7a	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2019-12-02 16:26:52 +03:00
Lubov Batanina	7523c777c5	Merge pull request #15537 from l-bat:ngraph * Support nGraph * Fix resize	2019-12-02 16:16:06 +03:00
Yashas Samaga B L	613c12e590	Merge pull request #14827 from YashasSamaga:cuda4dnn-csl-low CUDA backend for the DNN module * stub cuda4dnn design * minor fixes for tests and doxygen * add csl public api directory to module headers * add low-level CSL components * add high-level CSL components * integrate csl::Tensor into backbone code * switch to CPU iff unsupported; otherwise, fail on error * add fully connected layer * add softmax layer * add activation layers * support arbitary rank TensorDescriptor * pass input wrappers to `initCUDA()` * add 1d/2d/3d-convolution * add pooling layer * reorganize and refactor code * fixes for gcc, clang and doxygen; remove cxx14/17 code * add blank_layer * add LRN layer * add rounding modes for pooling layer * split tensor.hpp into tensor.hpp and tensor_ops.hpp * add concat layer * add scale layer * add batch normalization layer * split math.cu into activations.cu and math.hpp * add eltwise layer * add flatten layer * add tensor transform api * add asymmetric padding support for convolution layer * add reshape layer * fix rebase issues * add permute layer * add padding support for concat layer * refactor and reorganize code * add normalize layer * optimize bias addition in scale layer * add prior box layer * fix and optimize normalize layer * add asymmetric padding support for pooling layer * add event API * improve pooling performance for some padding scenarios * avoid over-allocation of compute resources to kernels * improve prior box performance * enable layer fusion * add const layer * add resize layer * add slice layer * add padding layer * add deconvolution layer * fix channelwise ReLU initialization * add vector traits * add vectorized versions of relu, clipped_relu, power * add vectorized concat kernels * improve concat_with_offsets performance * vectorize scale and bias kernels * add support for multi-billion element tensors * vectorize prior box kernels * fix address alignment check * improve bias addition performance of conv/deconv/fc layers * restructure code for supporting multiple targets * add DNN_TARGET_CUDA_FP64 * add DNN_TARGET_FP16 * improve vectorization * add region layer * improve tensor API, add dynamic ranks 1. use ManagedPtr instead of a Tensor in backend wrapper 2. add new methods to tensor classes - size_range: computes the combined size of for a given axis range - tensor span/view can be constructed from a raw pointer and shape 3. the tensor classes can change their rank at runtime (previously rank was fixed at compile-time) 4. remove device code from tensor classes (as they are unused) 5. enforce strict conditions on tensor class APIs to improve debugging ability * fix parametric relu activation * add squeeze/unsqueeze tensor API * add reorg layer * optimize permute and enable 2d permute * enable 1d and 2d slice * add split layer * add shuffle channel layer * allow tensors of different ranks in reshape primitive * patch SliceOp to allow Crop Layer * allow extra shape inputs in reshape layer * use `std::move_backward` instead of `std::move` for insert in resizable_static_array * improve workspace management * add spatial LRN * add nms (cpu) to region layer * add max pooling with argmax ( and a fix to limits.hpp) * add max unpooling layer * rename DNN_TARGET_CUDA_FP32 to DNN_TARGET_CUDA * update supportBackend to be more rigorous * remove stray include from preventing non-cuda build * include op_cuda.hpp outside condition #if * refactoring, fixes and many optimizations * drop DNN_TARGET_CUDA_FP64 * fix gcc errors * increase max. tensor rank limit to six * add Interp layer * drop custom layers; use BackendNode * vectorize activation kernels * fixes for gcc * remove wrong assertion * fix broken assertion in unpooling primitive * fix build errors in non-CUDA build * completely remove workspace from public API * fix permute layer * enable accuracy and perf. tests for DNN_TARGET_CUDA * add asynchronous forward * vectorize eltwise ops * vectorize fill kernel * fixes for gcc * remove CSL headers from public API * remove csl header source group from cmake * update min. cudnn version in cmake * add numerically stable FP32 log1pexp * refactor code * add FP16 specialization to cudnn based tensor addition * vectorize scale1 and bias1 + minor refactoring * fix doxygen build * fix invalid alignment assertion * clear backend wrappers before allocateLayers * ignore memory lock failures * do not allocate internal blobs * integrate NVTX * add numerically stable half precision log1pexp * fix indentation, following coding style, improve docs * remove accidental modification of IE code * Revert "add asynchronous forward" This reverts commit 1154b9da9da07e9b52f8a81bdcea48cf31c56f70. * [cmake] throw error for unsupported CC versions * fix rebase issues * add more docs, refactor code, fix bugs * minor refactoring and fixes * resolve warnings/errors from clang * remove haveCUDA() checks from supportBackend() * remove NVTX integration * changes based on review comments * avoid exception when no CUDA device is present * add color code for CUDA in Net::dump	2019-10-21 14:28:00 +03:00
Alexander Alekhin	2ad0487cec	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2019-08-13 18:32:29 +00:00
Tomoaki Teshima	40c71a2463	suppress noisy warning * add -Wno-psabi when using GCC 6 * add -Wundef for CUDA 10 * add -Wdeprecated-declarations when using GCC 7 * add -Wstrict-aliasing and -Wtautological-compare for GCC 7 * replace cudaThreadSynchronize with cudaDeviceSynchronize	2019-08-08 21:49:32 +09:00
Yashas Samaga B L	ae279966c2	Merge pull request #14660 from YashasSamaga:dnn-cuda-build add cuDNN dependency and setup build for cuda4dnn (#14660) * update cmake for cuda4dnn - Adds FindCUDNN - Adds new options: * WITH_CUDA * OPENCV_DNN_CUDA - Adds CUDA4DNN preprocessor symbol for the DNN module * FIX: append EXCLUDE_CUDA instead of overwrite * remove cuDNN dependency for user apps * fix unused variable warning	2019-06-02 14:47:15 +03:00
Alexander Alekhin	fcb07c64f3	cmake: fix build of dnn tests with shared common code - don't share .cpp files (PCH support is broken)	2019-03-31 08:52:25 +00:00
Sayed Adel	de22442046	dnn:perf add missing definition __OPENCV_TEST to fix pch	2019-03-31 03:28:33 +02:00
Lubov Batanina	7d3d6bc4e2	Merge pull request #13932 from l-bat:MyriadX_master_dldt * Fix precision in tests for MyriadX * Fix ONNX tests * Add output range in ONNX tests * Skip tests on Myriad OpenVINO 2018R5 * Add detect MyriadX * Add detect MyriadX on OpenVINO R5 * Skip tests on Myriad next version of OpenVINO * dnn(ie): VPU type from environment variable * dnn(test): validate VPU type * dnn(test): update DLIE test skip conditions	2019-03-29 16:42:58 +03:00
Alexander Alekhin	96c71dd3d2	dnn: reduce set of ignored warnings	2018-11-15 13:15:59 +03:00
Dmitry Kurtaev	c8f3579f93	Fix #12542 (#12603 ) * Fix #12542 * Remove ignore of non-virtual-dtor error	2018-09-26 16:08:51 +03:00

1 2

82 Commits