opencv

mirror of https://github.com/opencv/opencv.git synced 2025-07-20 19:17:36 +08:00

Author	SHA1	Message	Date
Alexander Smorkalov	b129abfdaa	Merge pull request #27055 from hanliutong:UI-loop-condition Fix some vectorized loop conditions.	2025-03-13 14:12:30 +03:00
Vincent Rabaud	186537a315	Move the CV_Assert above the << operation to not trigger the fuzzer	2025-03-13 10:00:49 +01:00
Liutong HAN	fd62bd0991	Relax the loop condition to process the final batch.	2025-03-13 07:54:41 +00:00
GenshinImpactStarts	e30697fd42	Merge pull request #27002 from GenshinImpactStarts:magnitude [HAL RVV] impl magnitude \| add perf test #27002 Implement through the existing `cv_hal_magnitude32f` and `cv_hal_magnitude64f` interfaces. UPDATE: UI is enabled. The only difference between UI and HAL now is HAL use a approximate `sqrt`. Perf test done on MUSE-PI. ```sh $ opencv_test_core --gtest_filter="Magnitude" $ opencv_perf_core --gtest_filter="Magnitude" --perf_min_samples=300 --perf_force_samples=300 ``` Test result between enabled UI and HAL: ``` Name of Test ui rvv rvv vs ui (x-factor) Magnitude::MagnitudeFixture::(127x61, 32FC1) 0.029 0.016 1.75 Magnitude::MagnitudeFixture::(127x61, 64FC1) 0.057 0.036 1.57 Magnitude::MagnitudeFixture::(640x480, 32FC1) 1.063 0.648 1.64 Magnitude::MagnitudeFixture::(640x480, 64FC1) 2.261 1.530 1.48 Magnitude::MagnitudeFixture::(1280x720, 32FC1) 3.261 2.118 1.54 Magnitude::MagnitudeFixture::(1280x720, 64FC1) 6.802 4.682 1.45 Magnitude::MagnitudeFixture::(1920x1080, 32FC1) 7.287 4.738 1.54 Magnitude::MagnitudeFixture::(1920x1080, 64FC1) 15.226 10.334 1.47 ``` Test result before and after enabling UI: ``` Name of Test orig pr pr vs orig (x-factor) Magnitude::MagnitudeFixture::(127x61, 32FC1) 0.032 0.029 1.11 Magnitude::MagnitudeFixture::(127x61, 64FC1) 0.067 0.057 1.17 Magnitude::MagnitudeFixture::(640x480, 32FC1) 1.228 1.063 1.16 Magnitude::MagnitudeFixture::(640x480, 64FC1) 2.786 2.261 1.23 Magnitude::MagnitudeFixture::(1280x720, 32FC1) 3.762 3.261 1.15 Magnitude::MagnitudeFixture::(1280x720, 64FC1) 8.549 6.802 1.26 Magnitude::MagnitudeFixture::(1920x1080, 32FC1) 8.408 7.287 1.15 Magnitude::MagnitudeFixture::(1920x1080, 64FC1) 18.884 15.226 1.24 ``` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-13 08:34:11 +03:00
Vincent Rabaud	71fe903121	Merge pull request #27040 from vrabaud:png_leak Make sure there are enough channels to check for opacity #27040 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-12 21:06:01 +03:00
Alexander Smorkalov	7481cb50b5	Merge pull request #27013 from asmorkalov:as/imencode_animation Test for in-memory animation encoding and decoding #27013 Tests for https://github.com/opencv/opencv/pull/26964 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-12 18:10:06 +03:00
Alexander Smorkalov	bbcdbca872	Merge pull request #27041 from asmorkalov:as/decolor_opt Local decolor pipeline optimization	2025-03-12 18:03:13 +03:00
Pierre Chatelier	d83df66ff0	Merge pull request #26834 from chacha21:findContours_speedup Find contours speedup #26834 It is an attempt, as suggested by #26775, to restore lost speed when migrating `findContours()` implementation from C to C++ The patch adds an "Arena" (a pool) of pre-allocated memory so that contours points (and TreeNodes) can be picked from the Arena. The code of `findContours()` is mostly unchanged, the arena usage being implicit through a utility class Arena::Item that provides C++ overloaded operators and construct/destruct logic. As mentioned in #26775, the contour points are allocated and released in order, and can be represented by ranges of indices in their arena. No range subset will be released and drill a hole, that's why the internal representation as a range of indices makes sense. The TreeNodes use another Arena class that does not comply to that range logic. Currently, there is a significant improvement of the run-time on the test mentioned in #26775, but it is still far from the `findContours_legacy()` performance. - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [X] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-12 18:00:01 +03:00
Pierre Chatelier	0db6a496ba	Merge pull request #26842 from chacha21:threshold_with_mask Added optional mask to cv::threshold #26842 Proposal for #26777 To avoid code duplication, and keep performance when no mask is used, inner implementation always propagate the const cv::Mat& mask, but they use a template<bool useMask> parameter that let the compiler optimize out unnecessary tests when the mask is not to be used. See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [X] There is a reference to the original bug report and related work - [X] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-12 17:55:07 +03:00
Alexander Smorkalov	49ab8121b7	Merge pull request #27050 from hanliutong:rvv-fix-27003 RISC-V: Fix #27003.	2025-03-12 17:32:19 +03:00
Yuantao Feng	eefa327f30	Merge pull request #27042 from fengyuentau:4x/core/normDiff_simd core: vectorize normDiff with universal intrinsics #27042 Merge with https://github.com/opencv/opencv_extra/pull/1242. Performance results on Desktop Intel i7-12700K, Apple M2, Jetson Orin and SpaceMIT K1: [perf-normDiff.zip](https://github.com/user-attachments/files/19178689/perf-normDiff.zip) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-12 16:43:10 +03:00
Liutong HAN	2969b67bd7	Fix 27003.	2025-03-12 12:15:05 +00:00
Maxim Smolskiy	46dbc57a86	Merge pull request #26968 from MaximSmolskiy:fix-Aruco-marker-incorrect-detection-near-image-edge Fix Aruco marker incorrect detection near image edge #26968 ### Pull Request Readiness Checklist Fix #26922 As I understood the algorithm, at the first stage we search for the contours of the marker several times (adaptive threshold with different windows sizes). Therefore, for the same marker, we get several contours (inner and outer with different sizes due to the different windows sizes). In the second stage, we group the contours for the same marker into one group, from which we take the largest contour as the best candidate (which should best match the border of the marker). The problem is that using the `minDistanceToBorder` parameter, we discard contours at the first stage. Thus, we discard the best candidates most appropriate to the marker border, and inner contours may remain, representing a significantly smaller marker border (which we observe in the issue). But if we use the `minDistanceToBorder` parameter to discard the best candidate of the group at the second stage, then there will be no such problems and we will completely discard markers located too close to the border of the image. See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2025-03-12 09:47:49 +03:00
GenshinImpactStarts	60de3ff24f	Merge pull request #27015 from GenshinImpactStarts:sqrt [HAL RVV] impl sqrt and invSqrt #27015 Implement through the existing interfaces `cv_hal_sqrt32f`, `cv_hal_sqrt64f`, `cv_hal_invSqrt32f`, `cv_hal_invSqrt64f`. Perf test done on MUSE-PI and CanMV K230. Because the performance of scalar is much worse than universal intrinsic, only ui and hal rvv is compared. In RVV's UI, `invSqrt` is computed using `1 / sqrt()`. This patch first uses `frsqrt` and then applies the Newton-Raphson method to achieve higher precision. For the initial value, I tried using the famous [fast inverse square root algorithm](https://en.wikipedia.org/wiki/Fast_inverse_square_root), which involves one bit shift and one subtraction. However, on both MUSE-PI and CanMV K230, the performance was slightly lower (about 3%), so I chose to use `frsqrt` for the initial value instead. BTW, I think this patch can directly replace RVV's UI. UPDATE: Due to strange vector registers allocation strategy in clang, for `invSqrt`, clang use LMUL m4 while gcc use LMUL m8, which leads to some performance loss in clang. So the test for clang is appended. ```sh $ opencv_test_core --gtest_filter="Core_HAL/mathfuncs." $ opencv_perf_core --gtest_filter="SqrtFixture." --perf_min_samples=300 --perf_force_samples=300 ``` CanMV K230: ``` Name of Test ui rvv rvv vs ui (x-factor) Sqrt::SqrtFixture::(127x61, 5, false) 0.052 0.027 1.96 Sqrt::SqrtFixture::(127x61, 5, true) 0.101 0.026 3.80 Sqrt::SqrtFixture::(127x61, 6, false) 0.106 0.059 1.79 Sqrt::SqrtFixture::(127x61, 6, true) 0.207 0.058 3.55 Sqrt::SqrtFixture::(640x480, 5, false) 1.988 0.956 2.08 Sqrt::SqrtFixture::(640x480, 5, true) 3.920 0.948 4.13 Sqrt::SqrtFixture::(640x480, 6, false) 4.179 2.342 1.78 Sqrt::SqrtFixture::(640x480, 6, true) 8.220 2.290 3.59 Sqrt::SqrtFixture::(1280x720, 5, false) 5.969 2.881 2.07 Sqrt::SqrtFixture::(1280x720, 5, true) 11.731 2.857 4.11 Sqrt::SqrtFixture::(1280x720, 6, false) 12.533 7.031 1.78 Sqrt::SqrtFixture::(1280x720, 6, true) 24.643 6.917 3.56 Sqrt::SqrtFixture::(1920x1080, 5, false) 13.423 6.483 2.07 Sqrt::SqrtFixture::(1920x1080, 5, true) 26.379 6.436 4.10 Sqrt::SqrtFixture::(1920x1080, 6, false) 28.200 15.833 1.78 Sqrt::SqrtFixture::(1920x1080, 6, true) 55.434 15.565 3.56 ``` MUSE-PI: ``` GCC \| clang Name of Test ui rvv rvv \| ui rvv rvv vs \| vs ui \| ui (x-factor) \| (x-factor) Sqrt::SqrtFixture::(127x61, 5, false) 0.027 0.018 1.46 \| 0.027 0.016 1.65 Sqrt::SqrtFixture::(127x61, 5, true) 0.050 0.017 2.98 \| 0.050 0.017 2.99 Sqrt::SqrtFixture::(127x61, 6, false) 0.053 0.031 1.72 \| 0.052 0.032 1.64 Sqrt::SqrtFixture::(127x61, 6, true) 0.100 0.030 3.31 \| 0.101 0.035 2.86 Sqrt::SqrtFixture::(640x480, 5, false) 0.955 0.483 1.98 \| 0.959 0.499 1.92 Sqrt::SqrtFixture::(640x480, 5, true) 1.873 0.489 3.83 \| 1.873 0.520 3.60 Sqrt::SqrtFixture::(640x480, 6, false) 2.027 1.163 1.74 \| 2.037 1.218 1.67 Sqrt::SqrtFixture::(640x480, 6, true) 3.961 1.153 3.44 \| 3.961 1.341 2.95 Sqrt::SqrtFixture::(1280x720, 5, false) 2.916 1.538 1.90 \| 2.912 1.598 1.82 Sqrt::SqrtFixture::(1280x720, 5, true) 5.735 1.534 3.74 \| 5.726 1.661 3.45 Sqrt::SqrtFixture::(1280x720, 6, false) 6.121 3.585 1.71 \| 6.109 3.725 1.64 Sqrt::SqrtFixture::(1280x720, 6, true) 12.059 3.501 3.44 \| 12.053 4.080 2.95 Sqrt::SqrtFixture::(1920x1080, 5, false) 6.540 3.535 1.85 \| 6.540 3.643 1.80 Sqrt::SqrtFixture::(1920x1080, 5, true) 12.943 3.445 3.76 \| 12.908 3.706 3.48 Sqrt::SqrtFixture::(1920x1080, 6, false) 13.714 8.062 1.70 \| 13.711 8.376 1.64 Sqrt::SqrtFixture::(1920x1080, 6, true) 27.011 7.989 3.38 \| 27.115 9.245 2.93 ``` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-12 08:34:27 +03:00
Suleyman TURKMEN	656038346b	Merge pull request #26441 from sturkmen72:upd_tutorials Update tutorials #26441 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-11 16:17:21 +03:00
Alexander Smorkalov	1f63b986a1	Merge pull request #26976 from MaximSmolskiy/refactor-ArucoDetector-ArucoDetectorImpl-filterTooCloseCandidates Refactor ArucoDetector::ArucoDetectorImpl::filterTooCloseCandidates	2025-03-11 16:10:48 +03:00
Alexander Smorkalov	a48e78cdfc	Merge pull request #27026 from amane-ame/filter_hal_rvv Add RISC-V HAL implementation for cv::filter series	2025-03-11 16:09:45 +03:00
Alexander Smorkalov	d9956fc24f	Merge pull request #26934 from BenjaminKnecht/new_4.x Extend ArUcoDetector to run multiple dictionaries in an efficient manner.	2025-03-11 14:37:00 +03:00
amane-ame	2dd72201af	Remove CV_ASSERT. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-11 18:37:58 +08:00
Alexander Smorkalov	6fb082ae7f	Merge pull request #27001 from DanBmh/opt_newoptcm Optimize camera matrix undistortion	2025-03-11 12:47:35 +03:00
amane-ame	d9ec808b15	Use the macro from interface.h. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-11 17:44:55 +08:00
Alexander Smorkalov	fa092b4597	Merge pull request #27043 from asmorkalov/as/debayer_warn_fix Warning fix on Windows.	2025-03-11 12:07:57 +03:00
Alexander Smorkalov	f833519506	Warning fix on Windows.	2025-03-11 11:17:20 +03:00
Alexander Smorkalov	4be88e934f	Merge pull request #27010 from GenshinImpactStarts/exp_log [HAL RVV] impl exp and log \| add log perf test	2025-03-11 10:51:03 +03:00
Alexander Smorkalov	e342d2f339	Local decolor pipeline optimization.	2025-03-11 10:16:01 +03:00
Alexander Smorkalov	4bb57ceb73	Merge pull request #26868 from FantasqueX/bayer2gray-simd-2 Use universal intrinsics in bayer2gray	2025-03-11 09:55:09 +03:00
Alexander Smorkalov	2fbb310265	Merge pull request #27037 from sturkmen72/ImageCollection_animations Add a test to ensure ImageCollection class works good with animations	2025-03-11 08:18:22 +03:00
Suleyman TURKMEN	6004badce2	ImageCollection animations	2025-03-10 21:02:43 +03:00
Pierre Chatelier	e813326c17	Merge pull request #27039 from chacha21:threshold_otsu_doc_update Threshold otsu doc update #27039 PR for #27038 (I had already done that, but encounters git madness after branch renaming) - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [X] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-10 19:40:45 +03:00
Alexander Smorkalov	3236436892	Merge pull request #27036 from CodeLinaro:xuezha_3rdPost Fix gaussianBlur5x5 performance regression	2025-03-10 18:21:20 +03:00
Xue Zhang	accebdecf7	Fix gaussianBlur5x5 performance regression	2025-03-10 16:16:56 +05:30
Alexander Smorkalov	316b5d7b08	Merge pull request #27031 from sturkmen72:libjpeg-turbo_ver_3.1.0 Libjpeg-turbo update to version 3.1.0	2025-03-10 13:44:00 +03:00
Daniel	f4a2c35c73	Small updates.	2025-03-10 11:22:24 +01:00
amane-ame	54da5c3e77	Add some algorithm comments. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-10 16:42:58 +08:00
GenshinImpactStarts	830d031213	Merge pull request #26977 from GenshinImpactStarts:helper_hal_rvv [Refactor](HAL RVV): Consolidate Helpers for Code Reusability #26977 This PR introduces a new helper file with utility types and templates to standardize function interfaces. This refactor allows us to avoid duplicate code when types differ but logic remains the same. The `flip` and `minmax` implementations have been updated to use the new generic helpers, replacing the previously defined, redundant classes. Due to the large number of functions, not all interfaces are unified yet. Future development can extend the types as needed. While the usage of function templates is currently limited, this will ease future development. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-10 10:36:48 +03:00
amane-ame	02253dd76b	Copy cv::borderInterpolate from core. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-10 15:26:41 +08:00
quic-xuezha	797068853f	Merge pull request #27033 from CodeLinaro:xuezha_3rdPost Fix assert failure in Sobel test when enable FastCV #27033 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-10 10:24:28 +03:00
Suleyman TURKMEN	6d161c25ef	Update libjpeg-turbo version:3.1.0	2025-03-09 00:02:20 +03:00
GenshinImpactStarts	0fed1fa184	fix exp, log \| enable ui for log \| strengthen test Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-07 17:11:26 +00:00
GenshinImpactStarts	524d8ae01c	impl exp and log \| add log perf test Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-07 17:11:26 +00:00
amane-ame	e06502a254	Add Morph for MORPH_ERODE and MORPH_DILATE. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-08 00:35:50 +08:00
Alexander Smorkalov	40843d06ab	Disable CV_SIMD_SCALABLE for demosaicing as the implementation is not efficient on RISC-V RVV.	2025-03-07 16:24:20 +03:00
amane-ame	a2d784b6f5	Add sepFilter. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-07 20:56:04 +08:00
Alexander Smorkalov	12d182bf9e	Merge pull request #27025 from shyama7004:link fix the not working link	2025-03-07 15:55:59 +03:00
Alexander Smorkalov	648424eaf2	Code review fixes.	2025-03-07 15:33:54 +03:00
shyama7004	a9b2467868	fix the not working link	2025-03-07 17:39:49 +05:30
Alexander Smorkalov	fbffaa5276	Warning fix.	2025-03-07 11:56:26 +03:00
天音あめ	e89e2fd7ea	Merge pull request #27007 from amane-ame:color_hal_rvv Add RISC-V HAL implementation for cv::cvtColor #27007 This patch implements the following functions in RVV_HAL using native intrinsics, optimizing the performance of `cv::cvtColor` for all possible data types and modes (except for `COLOR_Bayer`, `COLOR_YUV2GRAY_420` and `COLOR_mRGBA`, as these modes have no HAL interface): ``` cv_hal_cvtBGRtoBGR cv_hal_cvtBGRtoBGR5x5 cv_hal_cvtBGR5x5toBGR cv_hal_cvtBGRtoGray cv_hal_cvtGraytoBGR cv_hal_cvtBGR5x5toGray cv_hal_cvtGraytoBGR5x5 cv_hal_cvtBGRtoYUV cv_hal_cvtYUVtoBGR cv_hal_cvtBGRtoXYZ cv_hal_cvtXYZtoBGR cv_hal_cvtBGRtoHSV cv_hal_cvtHSVtoBGR cv_hal_cvtBGRtoLab cv_hal_cvtLabtoBGR cv_hal_cvtTwoPlaneYUVtoBGR cv_hal_cvtBGRtoTwoPlaneYUV cv_hal_cvtThreePlaneYUVtoBGR cv_hal_cvtBGRtoThreePlaneYUV cv_hal_cvtOnePlaneYUVtoBGR cv_hal_cvtOnePlaneBGRtoYUV ``` Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ ./opencv_test_imgproc --gtest_filter="Color-Bayer" $ ./opencv_perf_imgproc --gtest_filter="Color-Bayer" --gtest_also_run_disabled_tests --perf_min_samples=100 --perf_force_samples=100 ``` View the full perf table here: [hal_rvv_color.pdf](https://github.com/user-attachments/files/19055417/hal_rvv_color.pdf) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable	2025-03-07 11:24:48 +03:00
天音あめ	00956d5c15	Merge pull request #26892 from amane-ame:solve_hal_rvv Add RISC-V HAL implementation for cv::solve #26892 This patch implements `cv_hal_LU/cv_hal_Cholesky/cv_hal_SVD/cv_hal_QR` function in RVV_HAL using native intrinsics, optimizing the performance for `cv::solve` with method `DECOMP_LU/DECOMP_SVD/DECOMP_CHOLESKY/DECOMP_QR` and data types `32FC1/64FC1`. Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ ./opencv_test_core --gtest_filter="Solve:SVD:Cholesky" $ ./opencv_perf_core --gtest_filter="SolveTest" --perf_min_samples=100 --perf_force_samples=100 ``` The tail of the perf table is shown below since the table is too long. View the full perf table here: [hal_rvv_solve.pdf](https://github.com/user-attachments/files/18725067/hal_rvv_solve.pdf) <img width="1078" alt="Untitled" src="https://github.com/user-attachments/assets/c01d849c-f000-4bcc-bfe0-a302d6605d9e" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-07 11:14:09 +03:00
天音あめ	bb525fe91d	Merge pull request #26865 from amane-ame:dxt_hal_rvv Add RISC-V HAL implementation for cv::dft and cv::dct #26865 This patch implements `static cv::DFT` function in RVV_HAL using native intrinsic, optimizing the performance for `cv::dft` and `cv::dct` with data types `32FC1/64FC1/32FC2/64FC2`. The reason I chose to create a new `cv_hal_dftOcv` interface is that if I were to use the existing interfaces (`cv_hal_dftInit1D` and `cv_hal_dft1D`), it would require handling and parsing the dft flags within HAL, as well as performing preprocessing operations such as handling unit roots. Since these operations are not performance hotspots and do not require optimization, reusing the existing interfaces would result in copying approximately 300 lines of code from `core/src/dxt.cpp` into HAL, which I believe is unnecessary. Moreover, if I insert the new interface into `static cv::DFT`, both `static cv::RealDFT` and `static cv::DCT` can be optimized as well. The processing performed before and after calling `static cv::DFT` in these functions is also not a performance hotspot. Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ opencv_test_core --gtest_filter="DFT" $ opencv_perf_core --gtest_filter="dft:dct" --perf_min_samples=30 --perf_force_samples=30 ``` The head of the perf table is shown below since the table is too long. View the full perf table here: [hal_rvv_dxt.pdf](https://github.com/user-attachments/files/18622645/hal_rvv_dxt.pdf) <img width="1017" alt="Untitled" src="https://github.com/user-attachments/assets/609856e7-9c7d-4a95-9923-45c1b77eb3a2" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-07 11:08:41 +03:00

1 2 3 4 5 ...

35174 Commits