opencv

mirror of https://github.com/opencv/opencv.git synced 2025-01-09 21:27:59 +08:00

Author	SHA1	Message	Date
Alexander Smorkalov	6a8300c1a0	Merge pull request #26385 from vpisarev:fix_bfloat_test fixed typo in bfloat<=>float conversion test	2024-10-30 16:01:46 +03:00
Maksim Shabunin	e44e3ab0a7	build: raise min cmake version to 3.13 in other places	2024-10-30 14:39:04 +03:00
Vadim Pisarevsky	299aa14c4b	fixed typo in bfloat<=>float conversion test	2024-10-29 20:06:11 +03:00
Maksim Shabunin	7654d06b83	WinRT/UWP build: fix more warnings in media part	2024-10-29 19:19:09 +03:00
Alexander Smorkalov	41489f983d	Merge pull request #26381 from dkurt:dk/hotfix_dnn_debug Hotfix ie_ngraph.cpp in Debug	2024-10-29 12:33:08 +03:00
Dmitry Kurtaev	0e80a97f87	Hotfix ie_ngraph.cpp in Debug	2024-10-29 10:20:51 +03:00
Oちゃん	8791cd147c	Merge pull request #26374 from OrkWard:fix-js-build-script Fix incorrect string format in js build script #26374 I accidentally met this small problem mentioned in https://github.com/opencv/opencv/pull/25084#discussion_r1710838120 when play with wasm build. It seems https://github.com/EDVTAZ didn't fix it yet, so I create this tiny pr. Additionally, I remove a redundant argument in `add_argument` call. `'store_true'` already set the default, see https://docs.python.org/3/library/argparse.html#action. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-28 17:07:15 +03:00
WU Jia	66a29b422c	Merge pull request #25708 from kaingwade:flann2annoy Add interface to Annoy which will replace the FLANN #25708 This PR is to add interface to [Annoy](https://github.com/spotify/annoy) which will replace the FLANN, part of one of the cleanup work of OpenCV 5.0: #24998. After it, there will be consecutive patches: - [ ] Add Annoy based DescriptorMatcher - [ ] Replace FLANN based code with Annoy and remove FLANN completely ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-28 17:04:02 +03:00
Junyan721113	bf7ab8eebd	feat: medianBlur & bilateralFilter	2024-10-28 17:54:45 +08:00
alexlyulkov	a2fa1d49a4	Merge pull request #26208 from alexlyulkov:al/new-engine-caffe-parser Modified Caffe parser to support the new dnn engine #26208 Now the Caffe parser supports both the old and the new engine. It can be selected using newEngine argument in PopulateNet. All cpu Caffe tests work fine except: - Test_Caffe_nets.Colorization - Test_Caffe_layers.FasterRCNN_Proposal Both these tests doesn't work because of the bug in the new net.forward function. The function takes the name of the desired target last layer, but uses this name as the name of the desired output tensor. Also Colorization test contains a strange model with a Silence layer in the end, so it doesn't have outputs. The old parser just ignored it. I think, the proper solution is to run this model until the (number_of_layers - 2) layer using proper net.forward arguments in the test. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-28 11:32:07 +03:00
Gursimar Singh	7b0a082dd4	Merge pull request #26326 from gursimarsingh:object_detection_fixed [BUG FIX] Object detection sample preprocessing #26326 PR resloves #26315 related to incorrect preprocessing for 'Image2BlobParams' in object detection sample. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-28 11:25:45 +03:00
Gursimar Singh	f217656916	Merge pull request #25349 from gursimarsingh:videocapture_samples_cpp combined videocapture and videowriter samples for cleanup	2024-10-28 09:57:54 +03:00
Gursimar Singh	331d327760	Merge pull request #26336 from gursimarsingh:person_reid_bug_fix [BUG FIX] Fix issues in Person ReID C++ sample #26336 This PR fixes multiple issues in the Person ReID C++ sample that were causing incorrect outputs. It addresses improper matrix initialization, adds a missing return statement, and ensures that vectors are properly cleared before reuse. These changes correct the output of the sample. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-28 09:49:33 +03:00
Kumataro	3b01a4d4e9	Merge pull request #26373 from Kumataro:fix26372 doc: fix doxygen errors at Algorithm and QRCodeEncoder #26373 Close https://github.com/opencv/opencv/issues/26372 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-28 09:20:04 +03:00
Alexander Smorkalov	8a5ec4bf7b	Merge pull request #26287 from mshabunin:cpp-17 build: transition to C++17, minor changes in documentation	2024-10-26 19:59:48 +03:00
Wanli	29e712ed93	Merge pull request #26369 from WanliZhong:5x_fix_hfloat_vfunc Fix hfloat conflicts of v_func in merging 4.x to 5.x #26369 This PR solves the conflicts in merging 4.x to 5.x https://github.com/opencv/opencv/pull/26358 1. Explicitly convert the inputs number for `v_setall_` to hfloat number 2. Loosens the threshold for `v_sincos` test. (related issue: https://github.com/opencv/opencv/issues/26362) 3. Remove the new but temp api `template <> inline v_float16x8 v_setall_(float v) { return v_setall_f16((hfloat)v); }` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-26 19:54:13 +03:00
Alexander Smorkalov	dd08328228	Merge pull request #26368 from hanliutong:rvv-hal-license Add the missing license header in hal_rvv.	2024-10-26 09:36:07 +03:00
Alexander Smorkalov	24a497acd8	Merge pull request #26370 from mshabunin:fix-winrt-warnings WinRT/UWP build: fix some specific warnings	2024-10-26 09:34:00 +03:00
Maksim Shabunin	52100328d8	WinRT/UWP build: fix some specific warnings	2024-10-25 22:32:44 +03:00
Alexander Smorkalov	05e7988e9c	Merge pull request #26367 from alexlyulkovЖal/forward-to-layer-assert Added exception when calling forward to specified layer with the new dnn engine	2024-10-25 15:22:09 +03:00
Maksim Shabunin	d223e796f5	build: transition to C++17, minor changes in documentation	2024-10-25 15:05:14 +03:00
Liutong HAN	515b4a2689	Add the missing license description.	2024-10-25 11:37:07 +00:00
Alexander Lyulkov	3a4c88c33e	Added exception when calling forward to specified layer with the new dnn engine	2024-10-25 13:00:15 +03:00
Alexander Smorkalov	8e55659afe	Merge branch 4.x	2024-10-24 15:10:43 +03:00
Alexander Smorkalov	e4bcd46f64	Merge pull request #26356 from hardikkamboj:4.x Update py_thresholding.markdown	2024-10-24 12:39:43 +03:00
Liutong HAN	35571be570	Merge pull request #26318 from hanliutong:rvv-intrin-m2 Use LMUL=2 in the RISC-V Vector (RVV) backend of Universal Intrinsic. #26318 The modification of this patch involves the RVV backend of Universal Intrinsic, replacing `LMUL=1` with `LMUL=2`. Now each Universal Intrinsic type actually corresponds to two RVV vector registers, and each Intrinsic function also operates two vector registers. Considering that algorithms written using Universal Intrinsic usually do not use the maximum number of registers, this can help the RVV backend utilize more register resources without modifying the algorithm implementation This patch is generally beneficial in performance. We compiled OpenCV with `Clang-19.1.1` and `GCC-14.2.0` , ran it on `CanMV-k230` and `Banana-Pi F3`. Then we have four scenarios on combinations of compilers and devices. In `opencv_perf_core`, there are 3363 cases, of which: - 901 (26.8%) cases achieved more than `5%` performance improvement in all four scenarios, and the average speedup of these test cases (compared to scalar) increased from `3.35x` to `4.35x` - 75 (2.2%) cases had more than `5%` performance loss in all four scenarios, indicating that these cases are better with `LMUL=1` instead of `LMUL=2`. This involves `Mat_Transform`, `hasNonZero`, `KMeans`, `meanStdDev`, `merge` and `norm2`. Among them, `Mat_Transform` only has performance degradation in a few cases (`8UC3`), and the actual execution time of `hasNonZero` is so short that it can be ignored. For `KMeans`, `meanStdDev`, `merge` and `norm2`, we should be able to use the HAL to optimize/restore their performance. (In fact, we have already done this for `merge` #26216 ) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-24 10:08:43 +03:00
Alexander Smorkalov	331412dfad	Merge pull request #26357 from dkurt:dkurt/ov_out_names_from_graph OpenVINO friendly output names from non-compiled Model	2024-10-23 13:42:01 +03:00
Dmitry Kurtaev	d193554a5f	OpenVINO friendly output names from non-compiled Model	2024-10-23 09:29:05 +03:00
Alexander Smorkalov	898a2a3811	Merge pull request #26353 from asmorkalov:as/ade_1.2e ADE update to 0.1.2e	2024-10-23 08:10:16 +03:00
Hardik Kamboj	9fc7ca8ed1	Update py_thresholding.markdown Changed "If the pixel value is smaller than the threshold" to "If the pixel value is smaller than or equal to the threshold" to make the line align with the working of the code.	2024-10-23 09:49:23 +05:30
Alexander Smorkalov	983086411f	ADE update to 0.1.2e	2024-10-22 17:45:00 +03:00
Alexander Smorkalov	9f0c3f5b2b	Merge pull request #26327 from asmorkalov:as/drop_convertFp16 Finally dropped convertFp16 function in favor of cv::Mat::convertTo() #26327 Partially address https://github.com/opencv/opencv/issues/24909 Related PR to contrib: https://github.com/opencv/opencv_contrib/pull/3812 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-22 15:17:24 +03:00
Alexander Smorkalov	57ccbee25d	Merge pull request #26245 from cudawarped:cuda_update_to_npp_stream_ctx cuda - update npp calls to use the new NppStreamContext API if available	2024-10-22 14:44:42 +03:00
Kumataro	4398e0b62b	Merge pull request #26340 from Kumataro:wa26339 doc: fix the position of toggle button #26340 Close #26339 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-22 11:57:14 +03:00
alexlyulkov	a40ceff215	Merge pull request #26330 from alexlyulkov:al/new-engine-tflite-parser2 Modified TFLite parser for the new dnn engine #26330 The new dnn graph is creating just by defining input and output names of each layer. Some TFLite layers has fused activation, which doesn't have layer name and input and output names. Also some layers require additional preprocessing layers (e.g. NHWC -> NCHW). All these layers should be added to the graph with some unique layer and input and output names. I solve this problem by adding additionalPreLayer and additionalPostLayer layers. If a layer has a fused activation, I add additionalPostLayer and change input and output names this way: original: conv_relu(conv123, conv123_input, conv123_output) new: conv(conv123, conv123_input, conv123_output_additional_post_layer) + relu(conv123_relu, conv1_output_additional_post_layer, conv123_output) If a layer has additional preprocessing layer, I change input and output names this way: original: permute_reshape(reshape345, reshape345_input, reshape345_output) new: permute(reshape345_permute, reshape345_input, reshape345_input_additional_pre_layer) + reshape(reshape345, reshape345_input_additional_pre_layer, reshape345_output) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-22 09:05:58 +03:00
Alexander Smorkalov	6648482b69	Merge pull request #26324 from asmorkalov:as/model_diagnostics_engine Added DNN engine selector to model diagnostics tool.	2024-10-21 15:51:53 +03:00
Alexander Smorkalov	94d5ad09ff	Merge pull request #26284 from fzuuzf:enum_arithmetic_fixes_for_c++26 C++26 Deprecated Enum Arithmetic Conversion: Fix core/mat.inl.hpp	2024-10-21 15:47:53 +03:00
Alexander Smorkalov	e026a5ad8a	Merge pull request #26281 from kallaballa:clgl_device_discovery Rewrote OpenCL-OpenGL-interop device discovery routine without extensions and with Apple support	2024-10-18 15:52:17 +03:00
Alexander Smorkalov	c79b72a838	Merge pull request #26335 from migueldaipre:4.x fix: performance typo	2024-10-18 15:44:32 +03:00
Vadim Pisarevsky	6e3c5db1c6	Merge pull request #26333 from vpisarev:fix_26322 Fix #26322: construction of another Mat header for empty matrix #26333 The PR fixes #26322 - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-18 14:50:27 +03:00
Vadim Pisarevsky	2f35847960	Merge pull request #26321 from vpisarev:better_bfloat 2x more accurate float => bfloat conversion #26321 There is a magic trick to make float => bfloat conversion more accurate (_original reference needed, is it done this way in PyTorch?_). In simplified form it looks like: ``` uint16_t f2bf(float x) { union { unsigned u; float f; } u; u.f = x; // return (uint16_t)(u.u >> 16); <== the old method before this patch return (uint16_t)((u.u + 0x8000) >> 16); } ``` it works correctly for almost all valid floating-point values, positive, zero or negative, and even for some extreme cases, like `+/-inf`, `nan` etc. The addition of `0x8000` to integer representation of 32-bit float before retrieving the highest 16 bits reduces the rounding error by ~2x. The slight problem with this improved method is that the numbers very close to or equal to `+/-FLT_MAX` are mistakenly converted to `+/-inf`, respectively. This patch implements improved algorithm for `float => bfloat` conversion in scalar and vector form; it fixes the above-mentioned problem using some extra bit magic, i.e. 0x8000 is not added to very big (by absolute value) numbers: ``` // the actual implementation is more efficient, // without conditions or floating-point operations, see the source code return (uint16_t)(u.u + (fabsf(x) <= big_threshold ? 0x8000 : 0)) >> 16); ``` The corresponding test has been added as well and this is output from the test: ``` [----------] 1 test from Core_BFloat [ RUN ] Core_BFloat.convert maxerr0 = 0.00774842, mean0 = 0.00190643, stddev0 = 0.00186063 maxerr1 = 0.00389057, mean1 = 0.000952614, stddev1 = 0.000931268 [ OK ] Core_BFloat.convert (7 ms) ``` Here `maxerr0, mean0, stddev0` are for the original method and `maxerr1, mean1, stddev1` are for the new method. As you can see, there is a significant improvement in accuracy. Note: _Actually, on ~32,000,000 random FP32 numbers with uniformly distributed sign, exponent and mantissa the new method is always at least as accurate as the old one._ The test also checks all the corner cases, where we see no degradation either vs the original method. - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-18 14:46:40 +03:00
Kumataro	35dbf32227	Merge pull request #26211 from Kumataro:fix26207 imgcodecs: implement imencodemulti() #26211 Close #26207 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-18 14:44:55 +03:00
Miguel Daipré	888469a842	fix: performance typo	2024-10-18 08:37:32 -03:00
Alexander Smorkalov	9773694527	Added DNN engine selector to model diagnostics tool.	2024-10-17 15:09:13 +03:00
Gursimar Singh	1696819abb	Merge pull request #25667 from gursimarsingh:improved_person_reid_python Improved person reid cpp and python sample #25667 #25006 This sample has been rewritten to track a selected target in a video or camera stream. Person detection has been integrated using yolov8 and the user can provide a target image via command line or interactively select the target at start of the execution ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-17 10:46:53 +03:00
Septimiu Neaga	3919f33e21	Merge pull request #26293 from SeptimiuIoachimNeagaIntel:EISW-140103_optimization_flag G-API: Introduce level optimization flag for ONNXRT backend #26293 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-10-17 10:22:08 +03:00
FantasqueX	489df18a13	Merge pull request #26313 from FantasqueX:ipp-warp-affine-border-value Use border value in ipp version of warp affine #26313 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-17 08:50:30 +03:00
Alexander Smorkalov	d20c456ab7	Merge pull request #26320 from mshabunin:fix-cmake-in-list build: set cmake policy for if(IN_LIST) support	2024-10-17 07:36:02 +03:00
Maksim Shabunin	8ba76e65e9	build: set cmake policy for if(IN_LIST) support	2024-10-16 22:40:47 +03:00
Vadim Pisarevsky	3cd57ea09e	Merge pull request #26056 from vpisarev:new_dnn_engine New dnn engine #26056 This is the 1st PR with the new engine; CI is green and PR is ready to be merged, I think. Merge together with https://github.com/opencv/opencv_contrib/pull/3794 --- Known limitations: * [solved] OpenVINO is temporarily disabled, but is probably easy to restore (it's not a deal breaker to merge this PR, I guess) * The new engine does not support any backends nor any targets except for the default CPU implementation. But it's possible to choose the old engine when loading a model, then all the functionality is available. * [Caffe patch is here: #26208] The new engine only supports ONNX. When a model is constructed manually or is loaded from a file of different format (.tf, .tflite, .caffe, .darknet), the old engine is used. * Even in the case of ONNX some layers are not supported by the new engine, such as all quantized layers (including DequantizeLinear, QuantizeLinear, QLinearConv etc.), LSTM, GRU, .... It's planned, of course, to have full support for ONNX by OpenCV 5.0 gold release. When a loaded model contains unsupported layers, we switch to the old engine automatically (at ONNX parsing time, not at `forward()` time). * Some layers , e.g. Expat, are only partially supported by the new engine. In the case of unsupported flavours it switches to the old engine automatically (at ONNX parsing time, not at `forward()` time). * 'Concat' graph optimization is disabled. The optimization eliminates Concat layer and instead makes the layers that generate tensors to be concatenated to write the outputs to the final destination. Of course, it's only possible when `axis=0` or `axis=N=1`. The optimization is not compatible with dynamic shapes since we need to know in advance where to store the tensors. Because some of the layer implementations have been modified to become more compatible with the new engine, the feature appears to be broken even when the old engine is used. * Some `dnn::Net` API is not available with the new engine. Also, shape inference may return false if some of the output or intermediate tensors' shapes cannot be inferred without running the model. Probably this can be fixed by a dummy run of the model with zero inputs. * Some overloads of `dnn::Net::getFLOPs()` and `dnn::Net::getMemoryConsumption()` are not exposed any longer in wrapper generators; but the most useful overloads are exposed (and checked by Java tests). * [in progress] A few Einsum tests related to empty shapes have been disabled due to crashes in the tests and in Einsum implementations. The code and the tests need to be repaired. * OpenCL implementation of Deconvolution is disabled. It's very bad and very slow anyway; need to be completely revised. * Deconvolution3D test is now skipped, because it was only supported by CUDA and OpenVINO backends, both of which are not supported by the new engine. * Some tests, such as FastNeuralStyle, checked that the in the case of CUDA backend there is no fallback to CPU. Currently all layers in the new engine are processed on CPU, so there are many fallbacks. The checks, therefore, have been temporarily disabled. --- - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-16 15:28:19 +03:00

... 2 3 4 5 6 ...

35261 Commits