opencv

mirror of https://github.com/opencv/opencv.git synced 2024-11-24 19:20:28 +08:00

Author	SHA1	Message	Date
Abduragim Shtanchaev	9d0c8a9edb	Merge pull request #24445 from Abdurrahheem:ash/dev_einsum_pref Einsum Layer Performance Test #24445 ## This PR adds performance tests for Einsum Layer. See below results of performance test on different inputs Notation: - WX: windows10_x64 - MX: macos_x64 - MA: macos_arm64 - UX: ubuntu_x64 - UA: ubuntu_arm64 All data in ms (milliseconds). Gemm is backend for matrix multiplication --- Benchmarks: \| Equation \| Inputs Mat Dims \| UX (ms) \| UA (ms) \| MX (ms) \| MA (ms) \| WX (ms) \| \|-------------------------\|-----------------------------------\|----------------\|---------\|---------\|---------\|---------\| \| "ij, jk -> ik" \| [2, 3], [3,2] \| 0.04 ± 0.00 \| - \| - \| - \| - \| \| "ij, jk -> ik" \| [20, 30], [30,20] \| 0.08 ± 0.00 \| - \| - \| - \| - \| \| "ij, jk -> ik" \| [113, 127], [127,113] \| 2.41 ± 0.05 \| - \| - \| - \| - \| \| "imkj, injs -> imnks" \| [1, 4, 7, 9], [1, 5, 9, 8] \| 0.11 ± 0.00 \| - \| - \| - \| - \| \| "imkj, injs -> imnks" \| [1, 4, 70, 90], [1, 5, 90, 80] \| 15.49 ± 0.46 \| - \| - \| - \| - \| \| "imkj, injs -> imnks" \| [1, 4, 73, 91], [1, 5, 91, 57] \| 11.53 ± 0.06 \| - \| - \| - \| - \| \| "ij -> i" \| [30, 40] \| 0.03 ± 0.00 \| - \| - \| - \| - \| \| "ij -> i" \| [113, 374] \| 0.13 ± 0.00 \| - \| - \| - \| - \| \| "...ij -> ...i" \| [30, 40] \| 0.03 ± 0.00 \| - \| - \| - \| - \| \| "...ij -> ...i" \| [113, 374] \| 0.13 ± 0.00 \| - \| - \| - \| - \| \| "...ij, ...jk -> ...ik" \| [40, 50], [50,80] \| 0.37 ± 0.01 \| - \| - \| - \| - \| \| "...ij, ...jk -> ...ik" \| [47, 51], [51, 83] \| 0.43 ± 0.01 \| - \| - \| - \| - \| ----- ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-11-08 11:56:21 +03:00
Alexander Smorkalov	81907af74c	Merge pull request #24477 from asmorkalov:as/SimpleBlobDetector_js Add JavaScript bindings for SimpleBlobDetector	2023-11-08 08:37:12 +03:00
huafengchun	fb352e3098	Link lib_acl_op_compiler when compile with CANN	2023-11-08 10:42:28 +08:00
Alexander Smorkalov	26f3514992	Merge pull request #24474 from asmorkalov:as/BOWImgDescriptorExtractoor_java_ctor Added Java bindings for BOWImgDescriptorExtractor constructor	2023-11-07 22:00:32 +03:00
Yuantao Feng	6079e22523	Merge pull request #24500 from fengyuentau:test_layer_fusion dnn (onnx): add subgraph fusion tests #24500 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-11-07 17:40:31 +03:00
alexlyulkov	30549d65c2	Merge pull request #24456 from alexlyulkov:al/aar Added scripts for creating an AAR package and a local Maven repository with OpenCV library #24456 Added scripts for creating an AAR package and a local Maven repository with OpenCV library. The build_java_shared_aar.py script creates AAR with Java + C++ shared libraries. The build_static_aar.py script creates AAR with static C++ libraries. The scripts use an Android project template. The project is almost a default Android AAR library project with empty Java code and one empty C++ library. Only build.gradle.template and CMakeLists.txt.template files contain significant changes. See README.md for more information.	2023-11-07 14:23:33 +03:00
Yuantao Feng	ee0822dc4d	Merge pull request #24378 from fengyuentau:instance_norm dnn onnx: add instance norm layer #24378 Resolves https://github.com/opencv/opencv/issues/24377 Relates https://github.com/opencv/opencv/pull/24092#discussion_r1349841644 \| Perf \| multi-thread \| single-thread \| \| - \| - \| - \| \| x: [2, 64, 180, 240] \| 3.95ms \| 11.12ms \| Todo: - [x] speed up by multi-threading - [x] add perf - [x] add backend: OpenVINO - [x] add backend: CUDA - [x] add backend: OpenCL (no fp16) - [ ] add backend: CANN (will be done via https://github.com/opencv/opencv/pull/24462) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake ``` force_builders=Linux OpenCL,Win64 OpenCL,Custom buildworker:Custom=linux-4 build_image:Custom=ubuntu:18.04 modules_filter:Custom=none disable_ipp:Custom=ON ```	2023-11-07 12:59:10 +03:00
Vincent Rabaud	832f738db0	Merge pull request #24495 from vrabaud:fast_math_compile Get the SSE2 condition match the emmintrin.h inclusion condition. #24495 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-11-07 09:06:28 +03:00
Wanli	ed52f7feea	Improve and refactor softmax layer (#24466 ) * improve and refactor softmax layer * fix building error * compatible region layer * fix axisStep when disable SIMD * fix dynamic array * try to fix error * use nlanes from VTraits * move axisBias to srcOffset * fix bug caused by axisBias * remove macro * replace #ifdef with #if for CV_SIMD	2023-11-06 04:48:32 +03:00
richard28039	e95c0055af	Merge pull request #24397 from richard28039:add_fcnresnet101_to_dnn_sample Added PyTorch fcnresnet101 segmentation conversion cases #24397 We write a sample code about transforming Pytorch fcnresnet101 to ONNX running on OpenCV. The input source image was shooted by ourself. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [X] I agree to contribute to the project under Apache 2 License. - [X] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2023-11-03 15:42:43 +03:00
Alexander Smorkalov	6a65678592	Use video stream fps first in FFmpeg backend for VideoCapture.	2023-11-03 15:39:37 +03:00
Dmitry Kurtaev	fa56623458	Merge pull request #24463 from dkurt:dnn_shared_nodes_fusion DNN graph fusion with shared nodes #24463 ### Pull Request Readiness Checklist For now, nodes from matched pattern are removed during the matching process so if nodes are used in similar subgraph, they cannot be found. required for https://github.com/opencv/opencv/pull/24397 Merge with extra: https://github.com/opencv/opencv_extra/pull/1115 A part from [model_name ](https://github.com/onnx/models/blob/main/vision/object_detection_segmentation/fcn/model/fcn-resnet101-11.onnx) with two Resize subgraphs with shared nodes: ![image](https://github.com/opencv/opencv/assets/25801568/611d89d9-12fb-4add-9218-13b10d2c086a) See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-11-03 12:34:09 +03:00
Alexander Smorkalov	fe4d518d85	Merge pull request #24485 from hanliutong:rvv-opt Optimize the Implementation of RVV Universal Intrinsic.	2023-11-03 12:31:10 +03:00
Rostislav Vasilikhin	ea47cb3ffe	Merge pull request #24480 from savuor:backport_patch_nans Backport to 4.x: patchNaNs() SIMD acceleration #24480 backport from #23098 connected PR in extra: [#1118@extra](https://github.com/opencv/opencv_extra/pull/1118) ### This PR contains: * new SIMD code for `patchNaNs()` * CPU perf test <details> <summary>Performance comparison</summary> Geometric mean (ms) \|Name of Test\|noopt\|sse2\|avx2\|sse2 vs noopt (x-factor)\|avx2 vs noopt (x-factor)\| \|---\|:-:\|:-:\|:-:\|:-:\|:-:\| \|PatchNaNs::OCL_PatchNaNsFixture::(640x480, 32FC1)\|0.019\|0.017\|0.018\|1.11\|1.07\| \|PatchNaNs::OCL_PatchNaNsFixture::(640x480, 32FC4)\|0.037\|0.037\|0.033\|1.00\|1.10\| \|PatchNaNs::OCL_PatchNaNsFixture::(1280x720, 32FC1)\|0.032\|0.032\|0.033\|0.99\|0.98\| \|PatchNaNs::OCL_PatchNaNsFixture::(1280x720, 32FC4)\|0.072\|0.072\|0.070\|1.00\|1.03\| \|PatchNaNs::OCL_PatchNaNsFixture::(1920x1080, 32FC1)\|0.051\|0.051\|0.050\|1.00\|1.01\| \|PatchNaNs::OCL_PatchNaNsFixture::(1920x1080, 32FC4)\|0.137\|0.138\|0.128\|0.99\|1.06\| \|PatchNaNs::OCL_PatchNaNsFixture::(3840x2160, 32FC1)\|0.137\|0.128\|0.129\|1.07\|1.06\| \|PatchNaNs::OCL_PatchNaNsFixture::(3840x2160, 32FC4)\|0.450\|0.450\|0.448\|1.00\|1.01\| \|PatchNaNs::PatchNaNsFixture::(640x480, 32FC1)\|0.149\|0.029\|0.020\|5.13\|7.44\| \|PatchNaNs::PatchNaNsFixture::(640x480, 32FC2)\|0.304\|0.058\|0.040\|5.25\|7.65\| \|PatchNaNs::PatchNaNsFixture::(640x480, 32FC3)\|0.448\|0.086\|0.059\|5.22\|7.55\| \|PatchNaNs::PatchNaNsFixture::(640x480, 32FC4)\|0.601\|0.133\|0.083\|4.51\|7.23\| \|PatchNaNs::PatchNaNsFixture::(1280x720, 32FC1)\|0.451\|0.093\|0.060\|4.83\|7.52\| \|PatchNaNs::PatchNaNsFixture::(1280x720, 32FC2)\|0.892\|0.184\|0.126\|4.85\|7.06\| \|PatchNaNs::PatchNaNsFixture::(1280x720, 32FC3)\|1.345\|0.311\|0.230\|4.32\|5.84\| \|PatchNaNs::PatchNaNsFixture::(1280x720, 32FC4)\|1.831\|0.546\|0.436\|3.35\|4.20\| \|PatchNaNs::PatchNaNsFixture::(1920x1080, 32FC1)\|1.017\|0.250\|0.160\|4.06\|6.35\| \|PatchNaNs::PatchNaNsFixture::(1920x1080, 32FC2)\|2.077\|0.646\|0.605\|3.21\|3.43\| \|PatchNaNs::PatchNaNsFixture::(1920x1080, 32FC3)\|3.134\|1.053\|0.961\|2.97\|3.26\| \|PatchNaNs::PatchNaNsFixture::(1920x1080, 32FC4)\|4.222\|1.436\|1.288\|2.94\|3.28\| \|PatchNaNs::PatchNaNsFixture::(3840x2160, 32FC1)\|4.225\|1.401\|1.277\|3.01\|3.31\| \|PatchNaNs::PatchNaNsFixture::(3840x2160, 32FC2)\|8.310\|2.953\|2.635\|2.81\|3.15\| \|PatchNaNs::PatchNaNsFixture::(3840x2160, 32FC3)\|12.396\|4.455\|4.252\|2.78\|2.92\| \|PatchNaNs::PatchNaNsFixture::(3840x2160, 32FC4)\|17.174\|5.831\|5.824\|2.95\|2.95\| </details> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-11-03 08:58:07 +03:00
Liutong HAN	451ee3991e	Use local variable.	2023-11-03 10:21:13 +08:00
Alexander Smorkalov	7c9231ffba	Merge pull request #24478 from CCInc:mingw_fix Fix MinGW build issue due to obsensor	2023-11-02 15:26:43 +03:00
Alexander Smorkalov	2e49bf311a	Merge pull request #24468 from asmorkalov:as/python_ctor_docs Fixed Python signatures in Doxygen documentation.	2023-11-02 08:53:20 +03:00
Chris Lee	f530a24544	Fix MinGW build issue due to obsensor	2023-11-01 12:18:09 -06:00
Alexander Smorkalov	abc4eeb9a7	Add JavaScript bindings for SimpleBlobDetector.	2023-11-01 19:29:34 +03:00
Yuantao Feng	c91af16fa7	Merge pull request #24409 from fengyuentau:norm_kernel dnn: add shared fastNorm kernel for mvn, instance norm and layer norm #24409 Relates https://github.com/opencv/opencv/pull/24378#issuecomment-1756906570 TODO: - [x] add fastNorm - [x] refactor layer norm with fastNorm - [x] refactor mvn with fastNorm - [ ] add onnx mvn in importer (in a new PR?) - [ ] refactor instance norm with fastNorm (in another PR https://github.com/opencv/opencv/pull/24378, need to merge this one first though) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-11-01 14:33:57 +03:00
Alexander Alekhin	e202116b56	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2023-10-31 14:52:49 +00:00
Alexander Smorkalov	bd565df379	Added Java bindings for BOWImgDescriptorExtractor constructor.	2023-10-31 11:23:47 +03:00
Alexander Smorkalov	a3ebc0ae7f	Fixed Python signatures in Doxygen documentation.	2023-10-30 17:28:03 +03:00
Marek Kochanczyk	e9e6b1e22c	Merge pull request #24405 from kochanczyk:4.x Extend the signature of imdecodemulti() #24405 (Edited after addressing Reviewers' comments.) Add an argument to `imdecodemulti()` to enable optional selection of pages of multi-page images. Be default, all pages are decoded. If used, the additional argument may specify a continuous selection of pages to decode. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] I agree to contribute to the project under Apache 2 License. - [X] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2023-10-30 11:58:08 +03:00
Giles Payne	617d7ff575	Merge pull request #24454 from komakai:refactorObjcRange Refactor ObjectiveC Range class #24454 ### Pull Request Readiness Checklist - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch Fix for build issue in #24405	2023-10-27 14:31:41 +03:00
Yuantao Feng	77a0ffc71d	Merge pull request #24461 from fengyuentau:tracker_vit_backend_target Video tracking (dnn): set backend and target for TrackerVit #24461 Resolves #24460 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-27 14:12:44 +03:00
Alexander Alekhin	52c33f4af3	Merge pull request #24451 from eplankin:3.4	2023-10-27 10:57:03 +00:00
Kumataro	1911c63826	fix: supress GCC13 warnings (#24434 ) * fix: supress GCC13 warnings * fix for review and compile-warning on MacOS	2023-10-26 09:00:58 +03:00
eplankin	cac1695099	Update IPPICV binaries (20230919)	2023-10-25 10:01:19 -07:00
cudawarped	38bc519e4a	Merge pull request #24363 from cudawarped:videoio_ffmpeg_add_stream_encapsulation videoio: Add raw encoded video stream muxing to cv::VideoWriter with CAP_FFMPEG #24363 Allow raw encoded video streams (e.g. h264[5]) to be encapsulated by `cv::VideoWriter` to video containers (e.g. mp4/mkv). Operates in a similar way to https://github.com/opencv/opencv/pull/15290 where encapsulation is enabled by setting the `VideoWriterProperties::VIDEOWRITER_PROP_RAW_VIDEO` flag when constructing `cv::VideoWriter` e.g. ``` VideoWriter container(fileNameOut, api, fourcc, fps, { width, height }, { VideoWriterProperties::VIDEOWRITER_PROP_RAW_VIDEO, 1 }); ``` and each raw encoded frame is passed as single row of a `CV_8U` `cv::Mat`. The main reason for this PR is to allow `cudacodec::VideoWriter` to output its encoded streams to a suitable container, see https://github.com/opencv/opencv_contrib/pull/3569. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-25 13:21:01 +03:00
Abduragim Shtanchaev	a3b3a589f9	Merge pull request #24322 from Abdurrahheem:ash/dev_einsum_ellips Ellipses supported added for Einsum Layer #24322 This PR added addresses issues not covered in #24037. Namely these are: Test case for this patch is in this PR [#1106](https://github.com/opencv/opencv_extra/pull/1106) in opencv extra Added: - [x] Broadcasting reduction "...ii ->...I" - [x] Add lazy shape deduction. "...ij, ...jk->...ik" Features to add: - [ ] Add implicit output computation support. "bij,bjk ->" (output subscripts should be "bik") - [ ] Add support for CUDA backend - [ ] BatchWiseMultiply optimize - [ ] Performance test ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-24 16:47:00 +03:00
Alexander Smorkalov	1fe0fc224c	Merge pull request #24441 from vrabaud:avif_check Make sure AVIF decoder is destroyed in case of failure	2023-10-24 13:18:03 +03:00
Vincent Rabaud	44c254c09d	Make sure AVIF decoder is destroyed in case of failure	2023-10-24 10:09:40 +02:00
Alexander Smorkalov	8b47361873	Merge pull request #24440 from COOLIRON2311:4.x tutorial_py_fourier_transform wrong division operator fix	2023-10-24 09:05:04 +03:00
Alexander Smorkalov	3429c27477	Merge pull request #24438 from vrabaud:avif_check Check the return value of avifDecoderSetIOMemory.	2023-10-24 08:56:55 +03:00
COOLIRON2311	099e002667	Fixed wrong division operator in py_tutorials doc	2023-10-23 19:41:29 +03:00
Vincent Rabaud	3c9c964630	Check the return value of avifDecoderSetIOMemory. The API will soon be made no_discard.	2023-10-23 14:56:24 +02:00
Amir Hassan	c2f909fc86	Merge pull request #23894 from kallaballa:blobFromImagesWithParams Pertaining Issue: https://github.com/opencv/opencv/issues/5697 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-20 14:27:40 +03:00
CNClareChen	d142a796d8	Merge pull request #23929 from CNClareChen:4.x * Optimize some function with lasx. Optimize some function with lasx. #23929 This patch optimizes some lasx functions and reduces the runtime of opencv_test_core from 662,238ms to 633603ms on the 3A5000 platform. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-20 14:20:09 +03:00
Yuantao Feng	996b6c37c7	Merge pull request #24425 from fengyuentau:fix_timvx_test dnn: fix HAVE_TIMVX macro definition in dnn test #24425 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-20 14:16:51 +03:00
Alexander Smorkalov	1c0ca41b6e	Merge pull request #24371 from hanliutong:clean-up Clean up the obsolete API of Universal Intrinsic	2023-10-20 12:50:26 +03:00
andrewerf	b44cb33d2f	Merge pull request #21066 from andrewerf:21052-openvino-native-onnx Native ONNX to Inference Engine backend #21066 Resolves #21052 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV - [x] The PR is proposed to proper branch - [x] There is reference to original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable - [ ] The feature is well documented and sample code can be built with the project CMake	2023-10-20 11:49:27 +03:00
Alexander Smorkalov	1aa4621777	Merge pull request #24429 from vrabaud:inter_area1 Unconditionally create SuperScale in BarcodeDetector to avoid null deref	2023-10-20 11:47:03 +03:00
Vincent Rabaud	fcdaaabf7c	Unconditionally create SuperScale in BarcodeDetector to avoid null deref This pointer is called unconditionally in BarcodeImpl::initDecode assuming the size of the image is outside the specified bounds. This seems to not cause problems on optimized builds, I assume because the optimizer sees through the processImageScale call to see that it can be reduced to a resize call. Leaving it as is relies on undefined behavior. This was the least invasive change I could make, however, it might be worthwhile to pull up the logic for a resize so that a SuperScale does not need to be allocated, which seems to be the most common case.	2023-10-19 22:42:11 +02:00
Vincent Rabaud	c96f48e7c9	Merge pull request #24412 from vrabaud:inter_area1 Speed up line merging in INTER_AREA #24412 This provides a 10 to 20% speed-up. Related perf test fix: https://github.com/opencv/opencv/pull/24417 This is a split of https://github.com/opencv/opencv/pull/23525 that will be updated to only deal with column merging. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-19 14:06:50 +03:00
Alexander Smorkalov	a9664abb57	Merge pull request #24427 from fengyuentau:gather_elements_fp16 dnn: fp16 support for gather elements	2023-10-19 14:05:17 +03:00
Stefan Isak	5bffcdf7e8	Merge pull request #24382 from sisakat:cuda-compile-multicore Enable multicore CUDA compilation #24382 CUDA source files are compiled single threaded. The option `--threads` was introduced in NVCC 11.2. The option specifies the number of threads to be used for compilation (see [NVIDIA NVCC Documentation](https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html#threads-number-t)). With CMake 3.12 the environment variable `CMAKE_BUILD_PARALLEL_LEVEL` was introduced (see [CMake Documentation](https://cmake.org/cmake/help/latest/envvar/CMAKE_BUILD_PARALLEL_LEVEL.html)). This variable is used to set the NVCC `--threads` option. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2023-10-19 13:13:21 +03:00
fengyuentau	f2ef81a179	fp16 support for gather elements	2023-10-19 14:44:12 +08:00
Kumataro	6e4280ea81	Merge pull request #24372 from Kumataro:fix24369 Supporting protobuf v22 and later(with abseil-cpp/C++17) #24372 fix https://github.com/opencv/opencv/issues/24369 related https://github.com/opencv/opencv/issues/23791 1. This patch supports external protobuf v22 and later, it required abseil-cpp and c++17. Even if the built-in protobuf is upgraded to v22 or later, the dependency on abseil-cpp and the requirement for C++17 will continue. 2. Some test for caffe required patched protobuf, so this patch disable them. This patch is tested by following libraries. - Protobuf: /usr/local/lib/libprotobuf.so (4.24.4) - abseil-cpp: YES (20230125) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-10-19 08:45:08 +03:00
Vadim Pisarevsky	ba4d6c859d	added detection & dispatching of some modern NEON instructions (NEON_FP16, NEON_BF16) (#24420 ) * added more or less cross-platform (based on POSIX signal() semantics) method to detect various NEON extensions, such as FP16 SIMD arithmetics, BF16 SIMD arithmetics, SIMD dotprod etc. It could be propagated to other instruction sets if necessary. * hopefully fixed compile errors * continue to fix CI * another attempt to fix build on Linux aarch64 * * reverted to the original method to detect special arm neon instructions without signal() * renamed FP16_SIMD & BF16_SIMD to NEON_FP16 and NEON_BF16, respectively * removed extra whitespaces	2023-10-18 22:06:20 +03:00

... 3 4 5 6 7 ...

33688 Commits