opencv

mirror of https://github.com/opencv/opencv.git synced 2025-07-24 14:06:27 +08:00

Author	SHA1	Message	Date
amane-ame	a2d784b6f5	Add sepFilter. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-07 20:56:04 +08:00
天音あめ	e89e2fd7ea	Merge pull request #27007 from amane-ame:color_hal_rvv Add RISC-V HAL implementation for cv::cvtColor #27007 This patch implements the following functions in RVV_HAL using native intrinsics, optimizing the performance of `cv::cvtColor` for all possible data types and modes (except for `COLOR_Bayer`, `COLOR_YUV2GRAY_420` and `COLOR_mRGBA`, as these modes have no HAL interface): ``` cv_hal_cvtBGRtoBGR cv_hal_cvtBGRtoBGR5x5 cv_hal_cvtBGR5x5toBGR cv_hal_cvtBGRtoGray cv_hal_cvtGraytoBGR cv_hal_cvtBGR5x5toGray cv_hal_cvtGraytoBGR5x5 cv_hal_cvtBGRtoYUV cv_hal_cvtYUVtoBGR cv_hal_cvtBGRtoXYZ cv_hal_cvtXYZtoBGR cv_hal_cvtBGRtoHSV cv_hal_cvtHSVtoBGR cv_hal_cvtBGRtoLab cv_hal_cvtLabtoBGR cv_hal_cvtTwoPlaneYUVtoBGR cv_hal_cvtBGRtoTwoPlaneYUV cv_hal_cvtThreePlaneYUVtoBGR cv_hal_cvtBGRtoThreePlaneYUV cv_hal_cvtOnePlaneYUVtoBGR cv_hal_cvtOnePlaneBGRtoYUV ``` Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ ./opencv_test_imgproc --gtest_filter="Color-Bayer" $ ./opencv_perf_imgproc --gtest_filter="Color-Bayer" --gtest_also_run_disabled_tests --perf_min_samples=100 --perf_force_samples=100 ``` View the full perf table here: [hal_rvv_color.pdf](https://github.com/user-attachments/files/19055417/hal_rvv_color.pdf) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable	2025-03-07 11:24:48 +03:00
天音あめ	00956d5c15	Merge pull request #26892 from amane-ame:solve_hal_rvv Add RISC-V HAL implementation for cv::solve #26892 This patch implements `cv_hal_LU/cv_hal_Cholesky/cv_hal_SVD/cv_hal_QR` function in RVV_HAL using native intrinsics, optimizing the performance for `cv::solve` with method `DECOMP_LU/DECOMP_SVD/DECOMP_CHOLESKY/DECOMP_QR` and data types `32FC1/64FC1`. Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ ./opencv_test_core --gtest_filter="Solve:SVD:Cholesky" $ ./opencv_perf_core --gtest_filter="SolveTest" --perf_min_samples=100 --perf_force_samples=100 ``` The tail of the perf table is shown below since the table is too long. View the full perf table here: [hal_rvv_solve.pdf](https://github.com/user-attachments/files/18725067/hal_rvv_solve.pdf) <img width="1078" alt="Untitled" src="https://github.com/user-attachments/assets/c01d849c-f000-4bcc-bfe0-a302d6605d9e" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-07 11:14:09 +03:00
天音あめ	bb525fe91d	Merge pull request #26865 from amane-ame:dxt_hal_rvv Add RISC-V HAL implementation for cv::dft and cv::dct #26865 This patch implements `static cv::DFT` function in RVV_HAL using native intrinsic, optimizing the performance for `cv::dft` and `cv::dct` with data types `32FC1/64FC1/32FC2/64FC2`. The reason I chose to create a new `cv_hal_dftOcv` interface is that if I were to use the existing interfaces (`cv_hal_dftInit1D` and `cv_hal_dft1D`), it would require handling and parsing the dft flags within HAL, as well as performing preprocessing operations such as handling unit roots. Since these operations are not performance hotspots and do not require optimization, reusing the existing interfaces would result in copying approximately 300 lines of code from `core/src/dxt.cpp` into HAL, which I believe is unnecessary. Moreover, if I insert the new interface into `static cv::DFT`, both `static cv::RealDFT` and `static cv::DCT` can be optimized as well. The processing performed before and after calling `static cv::DFT` in these functions is also not a performance hotspot. Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ opencv_test_core --gtest_filter="DFT" $ opencv_perf_core --gtest_filter="dft:dct" --perf_min_samples=30 --perf_force_samples=30 ``` The head of the perf table is shown below since the table is too long. View the full perf table here: [hal_rvv_dxt.pdf](https://github.com/user-attachments/files/18622645/hal_rvv_dxt.pdf) <img width="1017" alt="Untitled" src="https://github.com/user-attachments/assets/609856e7-9c7d-4a95-9923-45c1b77eb3a2" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-07 11:08:41 +03:00
GenshinImpactStarts	57a78cb9df	Merge pull request #26941 from GenshinImpactStarts:lut_hal_rvv Impl hal_rvv LUT \| Add more LUT test #26941 Implement through the existing `cv_hal_lut` interfaces. Add more LUT accuracy and performance tests: - Accuracy test: Multi-channel table tests are added, and the boundary of `randu` used for generating test data is broadened to make the test more robust. - Performance test: Multi-channel input and multi-channel table tests are added. Perf test done on - MUSE-PI (vlen=256) - Compiler: gcc 14.2 (riscv-collab/riscv-gnu-toolchain Nightly: December 16, 2024) ```sh $ opencv_test_core --gtest_filter="Core_LUT" $ opencv_perf_core --gtest_filter="SizePrm_LUT" --perf_min_samples=300 --perf_force_samples=300 ``` ```sh Geometric mean (ms) Name of Test scalar ui rvv ui rvv vs vs scalar scalar (x-factor) (x-factor) LUT::SizePrm::320x240 0.248 0.249 0.052 1.00 4.74 LUT::SizePrm::640x480 0.277 0.275 0.085 1.01 3.28 LUT::SizePrm::1920x1080 0.950 0.947 0.634 1.00 1.50 LUT_multi2::SizePrm::320x240 2.051 2.045 2.049 1.00 1.00 LUT_multi2::SizePrm::640x480 2.128 2.134 2.125 1.00 1.00 LUT_multi2::SizePrm::1920x1080 7.397 7.380 7.390 1.00 1.00 LUT_multi::SizePrm::320x240 0.715 0.747 0.154 0.96 4.64 LUT_multi::SizePrm::640x480 0.741 0.766 0.257 0.97 2.88 LUT_multi::SizePrm::1920x1080 2.766 2.765 1.925 1.00 1.44 ``` This optimization is achieved by loading the entire lookup table into vector registers. Due to register size limitations, the optimization is only effective under the following conditions: - For the U8C1 table type, the optimization works when `vlen >= 256` - For U16C1, it works when `vlen >= 512` - For U32C1, it works when `vlen >= 1024` Since I don’t have real hardware with `vlen > 256`, the corresponding accuracy tests were conducted on QEMU built from the `riscv-collab/riscv-gnu-toolchain`. This patch does not implement optimizations for multi-channel tables. Previous attempts: 1. For the U8C1 table type, when `vlen = 128`, it is possible to use four `u8m4` vectors to load the entire table, perform gathering, and merge the results. However, the performance is almost the same as the scalar version. 2. Loading part of the table and repeatedly loading the source data is faster for small sizes. But as the table size grows, the performance quickly degrades compared to the scalar version. 3. Using `vluxei8` as a general solution does not show any performance improvement. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-06 11:17:00 +03:00
amane-ame	83104bed32	Add Filter2D. Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-03-06 14:10:06 +08:00
天音あめ	cbcfd772ce	Merge pull request #26958 from amane-ame:pyramids_hal_rvv Add RISC-V HAL implementation for cv::pyrDown and cv::pyrUp #26958 This patch implements `cv_hal_pyrdown/cv_hal_pyrup` function in RVV_HAL using native intrinsics, optimizing the performance for `cv::pyrDown`, `cv::pyrUp` and `cv::buildPyramids` with data types `{8U,16S,32F} x {C1,C2,C3,C4,Cn}`. Tested on MUSE-PI (Spacemit X60) for both gcc 14.2 and clang 20.0. ``` $ ./opencv_test_imgproc --gtest_filter="pyr:Pyr" $ ./opencv_perf_imgproc --gtest_filter="pyr:Pyr" --perf_min_samples=300 --perf_force_samples=300 ``` <img width="1112" alt="Untitled" src="https://github.com/user-attachments/assets/235a9fba-0d29-434e-8a10-498212bac657" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-03-04 15:41:15 +03:00
GenshinImpactStarts	33d632f85e	impl hal_rvv norm_hamming Co-authored-by: Liutong HAN <liutong2020@iscas.ac.cn>	2025-02-25 02:31:02 +00:00
GenshinImpactStarts	6a6a5a765d	Merge pull request #26943 from GenshinImpactStarts:flip_hal_rvv Impl RISC-V HAL for cv::flip \| Add perf test for flip #26943 Implement through the existing `cv_hal_flip` interfaces. Add perf test for `cv::flip`. The reason why select these args for testing: - size: copied from perf_lut - type: - U8C1: basic situation - U8C3: unaligned element size - U8C4: large element size Tested on - MUSE-PI (vlen=256) - Compiler: gcc 14.2 (riscv-collab/riscv-gnu-toolchain Nightly: December 16, 2024) ```sh $ opencv_test_core --gtest_filter="Core_Flip/ElemWiseTest." $ opencv_perf_core --gtest_filter="Size_MatType_FlipCode" --perf_min_samples=300 --perf_force_samples=300 ``` ``` Geometric mean (ms) Name of Test scalar ui rvv ui rvv vs vs scalar scalar (x-factor) (x-factor) flip::Size_MatType_FlipCode::(320x240, 8UC1, FLIP_X) 0.026 0.033 0.031 0.81 0.84 flip::Size_MatType_FlipCode::(320x240, 8UC1, FLIP_XY) 0.206 0.212 0.091 0.97 2.26 flip::Size_MatType_FlipCode::(320x240, 8UC1, FLIP_Y) 0.185 0.189 0.082 0.98 2.25 flip::Size_MatType_FlipCode::(320x240, 8UC3, FLIP_X) 0.070 0.084 0.084 0.83 0.83 flip::Size_MatType_FlipCode::(320x240, 8UC3, FLIP_XY) 0.616 0.612 0.235 1.01 2.62 flip::Size_MatType_FlipCode::(320x240, 8UC3, FLIP_Y) 0.587 0.603 0.204 0.97 2.88 flip::Size_MatType_FlipCode::(320x240, 8UC4, FLIP_X) 0.263 0.110 0.109 2.40 2.41 flip::Size_MatType_FlipCode::(320x240, 8UC4, FLIP_XY) 0.930 0.831 0.316 1.12 2.95 flip::Size_MatType_FlipCode::(320x240, 8UC4, FLIP_Y) 1.175 1.129 0.313 1.04 3.75 flip::Size_MatType_FlipCode::(640x480, 8UC1, FLIP_X) 0.303 0.118 0.111 2.57 2.73 flip::Size_MatType_FlipCode::(640x480, 8UC1, FLIP_XY) 0.949 0.836 0.405 1.14 2.34 flip::Size_MatType_FlipCode::(640x480, 8UC1, FLIP_Y) 0.784 0.783 0.409 1.00 1.92 flip::Size_MatType_FlipCode::(640x480, 8UC3, FLIP_X) 1.084 0.360 0.355 3.01 3.06 flip::Size_MatType_FlipCode::(640x480, 8UC3, FLIP_XY) 3.768 3.348 1.364 1.13 2.76 flip::Size_MatType_FlipCode::(640x480, 8UC3, FLIP_Y) 4.361 4.473 1.296 0.97 3.37 flip::Size_MatType_FlipCode::(640x480, 8UC4, FLIP_X) 1.252 0.469 0.451 2.67 2.78 flip::Size_MatType_FlipCode::(640x480, 8UC4, FLIP_XY) 5.732 5.220 1.303 1.10 4.40 flip::Size_MatType_FlipCode::(640x480, 8UC4, FLIP_Y) 5.041 5.105 1.203 0.99 4.19 flip::Size_MatType_FlipCode::(1920x1080, 8UC1, FLIP_X) 2.382 0.903 0.903 2.64 2.64 flip::Size_MatType_FlipCode::(1920x1080, 8UC1, FLIP_XY) 8.606 7.508 2.581 1.15 3.33 flip::Size_MatType_FlipCode::(1920x1080, 8UC1, FLIP_Y) 8.421 8.535 2.219 0.99 3.80 flip::Size_MatType_FlipCode::(1920x1080, 8UC3, FLIP_X) 6.312 2.416 2.429 2.61 2.60 flip::Size_MatType_FlipCode::(1920x1080, 8UC3, FLIP_XY) 29.174 26.055 12.761 1.12 2.29 flip::Size_MatType_FlipCode::(1920x1080, 8UC3, FLIP_Y) 25.373 25.500 13.382 1.00 1.90 flip::Size_MatType_FlipCode::(1920x1080, 8UC4, FLIP_X) 7.620 3.204 3.115 2.38 2.45 flip::Size_MatType_FlipCode::(1920x1080, 8UC4, FLIP_XY) 32.876 29.310 12.976 1.12 2.53 flip::Size_MatType_FlipCode::(1920x1080, 8UC4, FLIP_Y) 28.831 29.094 14.919 0.99 1.93 ``` The optimization for vlen <= 256 and > 256 are different, but I have no real hardware with vlen > 256. So accuracy tests for that like 512 and 1024 are conducted on QEMU built from the `riscv-collab/riscv-gnu-toolchain`. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-02-24 08:56:23 +03:00
Dmitry Kurtaev	7a2b048c92	Merge pull request #26923 from dkurt:merge_rvv_opt Further optimization of cv::merge RVV HAL for 8U and 16S #26923 ### Pull Request Readiness Checklist * Banana Pi BF3 (SpacemiT K1) RISC-V * Compiler: Syntacore Clang 18.1.4 (build 2024.12) ``` Geometric mean (ms) Name of Test baseline pr pr merge vs baseline merge (x-factor) merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 2) 0.013 0.003 3.76 merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 3) 0.020 0.006 3.46 merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 4) 0.026 0.010 2.61 merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 5) 0.043 0.028 1.56 merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 6) 0.054 0.035 1.53 merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 7) 0.065 0.050 1.30 merge::Size_SrcDepth_DstChannels::(127x61, 8UC1, 8) 0.070 0.036 1.95 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 2) 0.015 0.008 1.82 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 3) 0.022 0.015 1.48 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 4) 0.029 0.018 1.63 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 5) 0.067 0.044 1.54 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 6) 0.088 0.056 1.58 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 7) 0.104 0.076 1.38 merge::Size_SrcDepth_DstChannels::(127x61, 16SC1, 8) 0.116 0.065 1.79 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 2) 0.421 0.176 2.39 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 3) 0.792 0.284 2.79 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 4) 1.090 0.370 2.95 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 5) 1.835 1.399 1.31 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 6) 2.389 1.776 1.35 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 7) 3.000 2.471 1.21 merge::Size_SrcDepth_DstChannels::(640x480, 8UC1, 8) 3.178 2.104 1.51 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 2) 0.490 0.377 1.30 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 3) 1.348 0.602 2.24 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 4) 1.827 0.813 2.25 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 5) 3.283 2.692 1.22 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 6) 4.922 3.334 1.48 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 7) 5.725 4.399 1.30 merge::Size_SrcDepth_DstChannels::(640x480, 16SC1, 8) 6.278 4.748 1.32 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 2) 1.267 0.603 2.10 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 3) 2.394 0.934 2.56 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 4) 3.236 1.434 2.26 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 5) 5.398 4.345 1.24 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 6) 7.127 5.459 1.31 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 7) 8.590 7.298 1.18 merge::Size_SrcDepth_DstChannels::(1280x720, 8UC1, 8) 9.360 6.152 1.52 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 2) 1.482 1.242 1.19 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 3) 4.008 1.817 2.21 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 4) 6.079 2.468 2.46 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 5) 11.300 8.644 1.31 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 6) 15.125 12.126 1.25 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 7) 17.555 14.804 1.19 merge::Size_SrcDepth_DstChannels::(1280x720, 16SC1, 8) 18.890 14.163 1.33 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 2) 2.910 1.326 2.19 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 3) 5.351 1.997 2.68 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 4) 7.290 2.629 2.77 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 5) 12.426 9.611 1.29 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 6) 16.453 12.162 1.35 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 7) 19.420 16.190 1.20 merge::Size_SrcDepth_DstChannels::(1920x1080, 8UC1, 8) 20.588 13.699 1.50 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 2) 3.400 2.640 1.29 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 3) 8.986 3.952 2.27 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 4) 11.972 5.273 2.27 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 5) 20.544 17.996 1.14 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 6) 28.677 22.086 1.30 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 7) 32.958 27.713 1.19 merge::Size_SrcDepth_DstChannels::(1920x1080, 16SC1, 8) 36.499 27.439 1.33 ``` See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2025-02-20 17:28:28 +03:00
Vincent Rabaud	a6bfd87943	Bump openjp2 to v2.5.3 This should quiet some fuzzer bugs	2025-02-20 10:46:33 +03:00
Alexander Smorkalov	acc9084044	Move OpenVX integrations to imgproc to OpenVX HAL Covered functions: - medianBlur - Sobel - Canny - pyrDown - BoxFilter - equalizeHist - GaussianBlur - remap - threshold	2025-02-15 09:55:37 +03:00
Alexander Smorkalov	1de6e20463	Move OpenVX implementation for FAST to HAL.	2025-02-14 17:47:48 +03:00
Alexander Smorkalov	58e557d059	Merge pull request #26903 from asmorkalov:as/openvx_hal Migrate remaning OpenVX integrations to OpenVX HAL (core) #26903 Tested with OpenVX 1.2 & 1.3 sample implementation. Steps to build and test: ``` git clone git@github.com:KhronosGroup/OpenVX-sample-impl.git cd OpenVX-sample-impl python3 Build.py --os=Linux --conf=Release cd .. mkdir build cmake -DWITH_OPENVX=ON -DOPENVX_ROOT=/mnt/Projects/Projects/OpenVX-sample-impl/install/Linux/x64/Release/ ../opencv make -j8 ``` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-02-14 11:55:20 +03:00
Alexander Smorkalov	5921aae2b3	Switch to static instance of FastCV on Linux.	2025-02-13 15:58:25 +03:00
lve-gh	d8c2f0bcdf	Merge pull request #26884 from lve-gh:split8u_rvv_hal [HAL] split8u RVV 1.0 #26884 ### Pull Request Readiness Checklist * Banana Pi BF3 (SpacemiT K1) * Compiler: Syntacore Clang 18.1.4 (build 2024.12) ``` Geometric mean (ms) Name of Test baseline hal hal ui vs baseline ui (x-factor) split::Size_Depth_Channels::(127x61, 8UC1, 2) 0.012 0.004 3.12 split::Size_Depth_Channels::(127x61, 8UC1, 3) 0.019 0.006 2.91 split::Size_Depth_Channels::(127x61, 8UC1, 4) 0.028 0.011 2.64 split::Size_Depth_Channels::(127x61, 8UC1, 5) 0.067 0.033 2.02 split::Size_Depth_Channels::(127x61, 8UC1, 6) 0.084 0.040 2.11 split::Size_Depth_Channels::(127x61, 8UC1, 7) 0.103 0.055 1.88 split::Size_Depth_Channels::(127x61, 8UC1, 8) 0.113 0.032 3.50 split::Size_Depth_Channels::(640x480, 8UC1, 2) 0.454 0.179 2.54 split::Size_Depth_Channels::(640x480, 8UC1, 3) 0.677 0.298 2.27 split::Size_Depth_Channels::(640x480, 8UC1, 4) 0.901 0.410 2.20 split::Size_Depth_Channels::(640x480, 8UC1, 5) 3.781 3.010 1.26 split::Size_Depth_Channels::(640x480, 8UC1, 6) 4.886 4.009 1.22 split::Size_Depth_Channels::(640x480, 8UC1, 7) 5.777 4.770 1.21 split::Size_Depth_Channels::(640x480, 8UC1, 8) 4.596 1.330 3.46 split::Size_Depth_Channels::(1280x720, 8UC1, 2) 1.377 0.709 1.94 split::Size_Depth_Channels::(1280x720, 8UC1, 3) 2.091 1.034 2.02 split::Size_Depth_Channels::(1280x720, 8UC1, 4) 2.744 1.573 1.74 split::Size_Depth_Channels::(1280x720, 8UC1, 5) 9.542 6.284 1.52 split::Size_Depth_Channels::(1280x720, 8UC1, 6) 11.114 7.850 1.42 split::Size_Depth_Channels::(1280x720, 8UC1, 7) 14.083 11.879 1.19 split::Size_Depth_Channels::(1280x720, 8UC1, 8) 13.524 3.865 3.50 split::Size_Depth_Channels::(1920x1080, 8UC1, 2) 3.108 1.395 2.23 split::Size_Depth_Channels::(1920x1080, 8UC1, 3) 4.659 2.128 2.19 split::Size_Depth_Channels::(1920x1080, 8UC1, 4) 6.127 2.818 2.17 split::Size_Depth_Channels::(1920x1080, 8UC1, 5) 26.733 16.625 1.61 split::Size_Depth_Channels::(1920x1080, 8UC1, 6) 31.242 22.414 1.39 split::Size_Depth_Channels::(1920x1080, 8UC1, 7) 35.968 27.658 1.30 split::Size_Depth_Channels::(1920x1080, 8UC1, 8) 29.997 8.655 3.47 ``` See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2025-02-11 17:57:05 +03:00
天音あめ	2e909c38dc	Merge pull request #26804 from amane-ame:norm_hal_rvv Add RISC-V HAL implementation for cv::norm and cv::normalize #26804 This patch implements `cv::norm` with norm types `NORM_INF/NORM_L1/NORM_L2/NORM_L2SQR` and `Mat::convertTo` function in RVV_HAL using native intrinsic, optimizing the performance for `cv::norm(src)`, `cv::norm(src1, src2)`, and `cv::normalize(src)` with data types `8UC1/8UC4/32FC1`. `cv::normalize` also calls `minMaxIdx`, #26789 implements RVV_HAL for this. Tested on MUSE-PI for both gcc 14.2 and clang 20.0. ``` $ opencv_test_core --gtest_filter="Norm" $ opencv_perf_core --gtest_filter="norm" --perf_min_samples=300 --perf_force_samples=300 ``` The head of the perf table is shown below since the table is too long. View the full perf table here: [hal_rvv_norm.pdf](https://github.com/user-attachments/files/18468255/hal_rvv_norm.pdf) <img width="1304" alt="Untitled" src="https://github.com/user-attachments/assets/3550b671-6d96-4db3-8b5b-d4cb241da650" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-02-06 19:34:54 +03:00
Alexander Smorkalov	b7663086fb	Do not rely on cv namespace in HAL.	2025-02-06 10:00:28 +03:00
天音あめ	13b2caffe0	Merge pull request #26789 from amane-ame:minmax_hal_rvv Add RISC-V HAL implementation for minMaxIdx #26789 On the RISC-V platform, `minMaxIdx` cannot benefit from Universal Intrinsics because the UI-optimized `minMaxIdx` only supports `CV_SIMD128` (and does not accept `CV_SIMD_SCALABLE` for RVV). `1d701d1690/modules/core/src/minmax.cpp (L209-L214)` This patch implements `minMaxIdx` function in RVV_HAL using native intrinsic, optimizing the performance for all data types with one channel. Tested on MUSE-PI for both gcc 14.2 and clang 20.0. ``` $ opencv_test_core --gtest_filter="MinMaxLoc" $ opencv_perf_core --gtest_filter="minMaxLoc" ``` <img width="1122" alt="Untitled" src="https://github.com/user-attachments/assets/6a246852-87af-42c5-a50b-c349c2765f3f" /> ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-01-31 14:26:49 +03:00
Horror Proton	86241653a7	Add RISC-V HAL implementation for cv::phase	2025-01-29 12:07:59 +08:00
eplankin	ae57c54d83	Merge pull request #26463 from eplankin:icv_update_2022.0.0 Update IPP integration #26463 Please merge together with https://github.com/opencv/opencv_3rdparty/pull/88 Supported IPP version was updated to IPP 2022.0.0 for Linux and Windows. 32-bit binaries are dropped since this release. Previous update: https://github.com/opencv/opencv/pull/25935	2025-01-27 17:02:36 +03:00
Kumataro	3e1fafefbe	Merge pull request #26802 from Kumataro:fix26801 3rdparty:ittnotify: update to v3.25.4 #26802 Close https://github.com/opencv/opencv/issues/26801 See https://github.com/opencv/opencv/pull/26797 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2025-01-20 10:54:13 +03:00
Alexander Smorkalov	9c33baebbd	Merge pull request #26675 from hanliutong:rvv-hal-fix Add test cases and fix bugs in the RISC-V Vector HAL.	2024-12-29 18:09:21 +03:00
Alexander Alekhin	1b48eafe48	Merge pull request #26672 from opencv-pushbot:gitee/alalek/update_ffmpeg_4.x	2024-12-29 01:42:29 +00:00
Liutong HAN	b31f7694c5	Add test cases and fix bugs in the RVV HAL.	2024-12-27 08:39:52 +00:00
Alexander Alekhin	c64fe91ff4	ffmpeg/4.x: update FFmpeg wrapper 2024.12	2024-12-26 12:30:48 +00:00
Alexander Smorkalov	745a12c03b	Sevral fixes for FastCV handling.	2024-12-26 09:54:09 +03:00
quic-apreetam	d037b40faa	Merge pull request #26621 from CodeLinaro:apreetam_2ndPost FastCV-based HAL for OpenCV acceleration 2ndpost-3 #26621 ### Detailed description: - Add cv_hal_canny for Canny API Requires binary from [opencv/opencv_3rdparty#90](https://github.com/opencv/opencv_3rdparty/pull/90) Depends on: [opencv/opencv#26617](https://github.com/opencv/opencv/pull/26617) Depends on: [opencv/opencv#26619](https://github.com/opencv/opencv/pull/26619) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-12-19 13:31:26 +03:00
adsha-quic	59f762b2f0	Merge pull request #26619 from CodeLinaro:adsha_2ndPost FastCV-based HAL for OpenCV acceleration 2ndpost-2 #26619 ### Detailed description: - Add support for multiply 8u, 16s and 32f - Add support for cv_hal_pyrdown 8u - Add support for cv_hal_cvtBGRtoHSV and cv_hal_cvtBGRtoYUVApprox 8u Requires binary from [opencv/opencv_3rdparty#90](https://github.com/opencv/opencv_3rdparty/pull/90) Depends on: [opencv/opencv#26617](https://github.com/opencv/opencv/pull/26617) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-12-19 08:28:24 +03:00
Liutong HAN	3fbaad36d7	Merge pull request #26624 from hanliutong:rvv-mean Add RISC-V HAL implementation for meanStdDev #26624 `meanStdDev` benefits from the Universal Intrinsic backend of RVV, but we also found that the performance on the `8UC4` type is worse than the scalar version when there is a mask, and there is no optimization implementation on `32FC1`. This patch implements `meanStdDev` function in RVV_HAL using native intrinsic, significantly optimizing the performance for `8UC1`, `8UC4` and `32FC1`. This patch is tested on BPI-F3 for both gcc 14.2 and clang 19.1. ``` $ opencv_test_core --gtest_filter="MeanStdDev" $ opencv_perf_core --gtest_filter="Size_MatType_meanStdDev* ``` ![1734077611879](https://github.com/user-attachments/assets/71c85c9d-1db1-470d-81d1-bf546e27ad86) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-12-18 22:19:02 +03:00
quic-xuezha	1c28a98b34	Merge pull request #26617 from CodeLinaro:xuezha_2ndPost FastCV-based HAL for OpenCV acceleration 2ndpost-1 #26617 ### Detailed description: - Add parallel support for cv_hal_sobel - Add cv_hal_gaussianBlurBinomial and parallel support. - Add cv_hal_addWeighted8u and parallel support - Add cv_hal_warpPerspective and parallel support Requires binary from [opencv/opencv_3rdparty#90](https://github.com/opencv/opencv_3rdparty/pull/90) Related patch to opencv_contrib: [opencv/opencv_contrib#3844](https://github.com/opencv/opencv_contrib/pull/3844) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-12-18 09:34:13 +03:00
Alexander Smorkalov	a4c8d318e6	Update KleidiCV to version 0.3.	2024-12-16 15:32:44 +03:00
Alexander Smorkalov	5f1b05af0e	Merge pull request #26556 from asmorkalov:FastcvHAL_1stPost Added Fastcv HAL changes in the 3rdparty folder. Code Changes includes HAL code , Fastcv libs and Headers Change-Id: I2f0ddb1f57515c82ae86ba8c2a82965b1a9626ec Requires binaries from https://github.com/opencv/opencv_3rdparty/pull/86. Related patch to opencv_contrib: https://github.com/opencv/opencv_contrib/pull/3811 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-12-02 10:50:38 +03:00
Alexander Smorkalov	1a775198ce	Skip KleidiCV in offline build.	2024-11-13 15:13:19 +03:00
Alexander Smorkalov	5817b562b3	Merge pull request #26364 from plctlab:rvp_pt2 3rdparty: NDSRVP - Part 2.1: Filter-Related Functions	2024-11-05 18:53:00 +03:00
Alexander Smorkalov	cf87380fad	Disable SME2 branches in KleidiCV as it's incompatible with some CLang versions, e.g. NDK 28b1.	2024-10-31 08:14:30 +03:00
Junyan721113	bf7ab8eebd	feat: medianBlur & bilateralFilter	2024-10-28 17:54:45 +08:00
Liutong HAN	515b4a2689	Add the missing license description.	2024-10-25 11:37:07 +00:00
Alexander Smorkalov	69803e7b99	Merge pull request #26216 from hanliutong:rvv-hal-merge Add the HAL implementation for the merge function on RISC-V Vector.	2024-10-09 17:07:57 +03:00
Alexander Smorkalov	3901426d85	Merge pull request #26241 from asmorkalov:as/kelidicv-0.2 Updated KleidiCV HAL to version 0.2. #26241 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-10-03 15:04:25 +03:00
Liutong HAN	8a36f119ce	Add the HAL implementation for the merge function on RISC-V Vector	2024-09-29 13:39:53 +00:00
Alexander Smorkalov	a6ec12f58b	Merge pull request #26163 from asmorkalov:as/HAL_schaar_deriv HAL interface for Sharr derivatives needed for Lukas-Kanade algorithm #26163 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-09-23 08:44:22 +03:00
Alexander Smorkalov	881440c6c6	Merge pull request #26143 from asmorkalov:as/HAL_opticalFlowLK Added HAL interface for Lukas-Kanade optical flow #26143 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-09-16 17:07:06 +03:00
FantasqueX	85923c8f30	Merge pull request #26113 from FantasqueX:zlib-ng-2-2-1 Update zlib-ng to 2.2.1 #26113 Release: https://github.com/zlib-ng/zlib-ng/releases/tag/2.2.1 ARM diagnostics patch: https://github.com/zlib-ng/zlib-ng/pull/1774 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-09-12 16:05:24 +03:00
Alexander Smorkalov	7de3a8e960	Merge pull request #26088 from plctlab:rvp_pt2 3rdparty: NDSRVP - Part 2: Filter	2024-09-11 12:18:42 +03:00
Alexander Smorkalov	a905526f71	Got rid of CAROTENE_NEON_ARCH and use standard __ARM_ARCH check.	2024-08-30 12:09:04 +03:00
llh721113	e087cc8fd1	feat: NDSRVP Filter	2024-08-30 07:59:51 +08:00
Letu Ren	c340b3b466	Fix typo in openvx readme	2024-08-15 16:37:40 +08:00
Junyan721113	35463e079c	feat: Part 1.5 - New Interfaces	2024-08-02 13:47:45 +08:00
eplankin	a5dacb5bed	Update IPP integration	2024-07-18 08:16:19 -07:00
Alexander Smorkalov	2799c74d50	Use Carotene implementation of TEGRA_GaussianBlurBinomial 3x3 and 5x5 on ARM.	2024-07-02 12:50:09 +03:00
Thirumalai Nagalingam	ce1d840adf	Fix: compilation Issue on ARM64 (msys2 clangarm) Added a definition for M_PI in the code to resolve a compilation error encountered when building OpenCV on the MSYS2 environment. The M_PI constant was not defined, causing the compilation to fail.	2024-06-30 18:58:56 +00:00
Maxim Milashchenko	786726719f	Merge pull request #25793 from MaximMilashchenko:hal_rvv Fixed build error hal_rvv_071 #25793 Fixed bug with enabling vector header when vector extension is disabled (RVV=OFF) in hal_rvv_071	2024-06-28 09:00:16 +03:00
eplankin	860b688cdd	Enable build with both old and new layouts of IPP	2024-06-17 03:26:01 -07:00
Maxim Milashchenko	adcb070396	Merge pull request #25307 from MaximMilashchenko:halrvv071 * added hal for cv_hal_cvtBGRtoBGR rvv 0.7.1	2024-06-06 15:31:59 +03:00
Junyan721113	d9421ac148	Merge pull request #25167 from plctlab:rvp_3rdparty 3rdparty: NDSRVP - A New 3rdparty Library with Optimizations Based on RISC-V P Extension v0.5.2 - Part 1: Basic Functions #25167 # Summary ### Previous context From PR #24556: >> * As you wrote, the P-extension differs from RVV thus can not be easily implemented via Universal Intrinsics mechanism, but there is another HAL mechanism for lower-level CPU optimizations which is used by the [Carotene](https://github.com/opencv/opencv/tree/4.x/3rdparty/carotene) library on ARM platforms. I suggest moving all non-dnn code to similar third-party component. For example, FAST algorithm should allow such optimization-shortcut: see https://github.com/opencv/opencv/blob/4.x/modules/features2d/src/hal_replacement.hpp >> Reference documentation is here: >> >> * https://docs.opencv.org/4.x/d1/d1b/group__core__hal__interface.html >> * https://docs.opencv.org/4.x/dd/d8b/group__imgproc__hal__interface.html >> * https://docs.opencv.org/4.x/db/d47/group__features2d__hal__interface.html >> * Carotene library is turned on here: `8bbf08f0de/CMakeLists.txt (L906-L911)` > As a test outside of this PR, A 3rdparty component called ndsrvp is created, containing one of the non-dnn code (integral_SIMD), and it works very well. > All the non-dnn code in this PR have been removed, currently this PR can be focused on dnn optinizations. > This HAL mechanism is quite suitable for rvp optimizations, all the non-dnn code is expected to be moved into ndsrvp soon. ### Progress #### Part 1 (This PR) - [Core](https://docs.opencv.org/4.x/d1/d1b/group__core__hal__interface.html) - [x] Element-wise add and subtract - [x] Element-wise minimum or maximum - [x] Element-wise absolute difference - [x] Bitwise logical operations - [x] Element-wise compare - [ImgProc](https://docs.opencv.org/4.x/dd/d8b/group__imgproc__hal__interface.html) - [x] Integral - [x] Threshold - [x] WarpAffine - [x] WarpPerspective - [Features2D](https://docs.opencv.org/4.x/db/d47/group__features2d__hal__interface.html) #### Part 2 (Next PR) Rough Estimate. Todo List May Change. - [Core](https://docs.opencv.org/4.x/d1/d1b/group__core__hal__interface.html) - [ImgProc](https://docs.opencv.org/4.x/dd/d8b/group__imgproc__hal__interface.html) - smaller remap HAL interface - AdaptiveThreshold - BoxFilter - Canny - Convert - Filter - GaussianBlur - MedianBlur - Morph - Pyrdown - Resize - Scharr - SepFilter - Sobel - [Features2D](https://docs.opencv.org/4.x/db/d47/group__features2d__hal__interface.html) - FAST ### Performance Tests The optimization does not contain floating point opreations. Absolute Difference Geometric mean (ms) \|Name of Test\|opencv perf core Absdiff\|opencv perf core Absdiff\|opencv perf core Absdiff vs opencv perf core Absdiff (x-factor)\| \|---\|:-:\|:-:\|:-:\| \|Absdiff::OCL_AbsDiffFixture::(640x480, 8UC1)\|23.104\|5.972\|3.87\| \|Absdiff::OCL_AbsDiffFixture::(640x480, 32FC1)\|39.500\|40.830\|0.97\| \|Absdiff::OCL_AbsDiffFixture::(640x480, 8UC3)\|69.155\|15.051\|4.59\| \|Absdiff::OCL_AbsDiffFixture::(640x480, 32FC3)\|118.715\|120.509\|0.99\| \|Absdiff::OCL_AbsDiffFixture::(640x480, 8UC4)\|93.001\|19.770\|4.70\| \|Absdiff::OCL_AbsDiffFixture::(640x480, 32FC4)\|161.136\|160.791\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(1280x720, 8UC1)\|69.211\|15.140\|4.57\| \|Absdiff::OCL_AbsDiffFixture::(1280x720, 32FC1)\|118.762\|119.263\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(1280x720, 8UC3)\|212.414\|44.692\|4.75\| \|Absdiff::OCL_AbsDiffFixture::(1280x720, 32FC3)\|367.512\|366.569\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(1280x720, 8UC4)\|285.337\|59.708\|4.78\| \|Absdiff::OCL_AbsDiffFixture::(1280x720, 32FC4)\|490.395\|491.118\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(1920x1080, 8UC1)\|158.827\|33.462\|4.75\| \|Absdiff::OCL_AbsDiffFixture::(1920x1080, 32FC1)\|273.503\|273.668\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(1920x1080, 8UC3)\|484.175\|100.520\|4.82\| \|Absdiff::OCL_AbsDiffFixture::(1920x1080, 32FC3)\|828.758\|829.689\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(1920x1080, 8UC4)\|648.592\|137.195\|4.73\| \|Absdiff::OCL_AbsDiffFixture::(1920x1080, 32FC4)\|1116.755\|1109.587\|1.01\| \|Absdiff::OCL_AbsDiffFixture::(3840x2160, 8UC1)\|648.715\|134.875\|4.81\| \|Absdiff::OCL_AbsDiffFixture::(3840x2160, 32FC1)\|1115.939\|1113.818\|1.00\| \|Absdiff::OCL_AbsDiffFixture::(3840x2160, 8UC3)\|1944.791\|413.420\|4.70\| \|Absdiff::OCL_AbsDiffFixture::(3840x2160, 32FC3)\|3354.193\|3324.672\|1.01\| \|Absdiff::OCL_AbsDiffFixture::(3840x2160, 8UC4)\|2594.585\|553.486\|4.69\| \|Absdiff::OCL_AbsDiffFixture::(3840x2160, 32FC4)\|4473.543\|4438.453\|1.01\| Bitwise Operation Geometric mean (ms) \|Name of Test\|opencv perf core Bit\|opencv perf core Bit\|opencv perf core Bit vs opencv perf core Bit (x-factor)\| \|---\|:-:\|:-:\|:-:\| \|Bitwise_and::OCL_BitwiseAndFixture::(640x480, 8UC1)\|22.542\|4.971\|4.53\| \|Bitwise_and::OCL_BitwiseAndFixture::(640x480, 32FC1)\|90.210\|19.917\|4.53\| \|Bitwise_and::OCL_BitwiseAndFixture::(640x480, 8UC3)\|68.429\|15.037\|4.55\| \|Bitwise_and::OCL_BitwiseAndFixture::(640x480, 32FC3)\|280.168\|59.239\|4.73\| \|Bitwise_and::OCL_BitwiseAndFixture::(640x480, 8UC4)\|90.565\|19.735\|4.59\| \|Bitwise_and::OCL_BitwiseAndFixture::(640x480, 32FC4)\|374.695\|79.257\|4.73\| \|Bitwise_and::OCL_BitwiseAndFixture::(1280x720, 8UC1)\|67.824\|14.873\|4.56\| \|Bitwise_and::OCL_BitwiseAndFixture::(1280x720, 32FC1)\|279.514\|59.232\|4.72\| \|Bitwise_and::OCL_BitwiseAndFixture::(1280x720, 8UC3)\|208.337\|44.234\|4.71\| \|Bitwise_and::OCL_BitwiseAndFixture::(1280x720, 32FC3)\|851.211\|182.522\|4.66\| \|Bitwise_and::OCL_BitwiseAndFixture::(1280x720, 8UC4)\|279.529\|59.095\|4.73\| \|Bitwise_and::OCL_BitwiseAndFixture::(1280x720, 32FC4)\|1132.065\|244.877\|4.62\| \|Bitwise_and::OCL_BitwiseAndFixture::(1920x1080, 8UC1)\|155.685\|33.078\|4.71\| \|Bitwise_and::OCL_BitwiseAndFixture::(1920x1080, 32FC1)\|635.253\|137.482\|4.62\| \|Bitwise_and::OCL_BitwiseAndFixture::(1920x1080, 8UC3)\|474.494\|100.166\|4.74\| \|Bitwise_and::OCL_BitwiseAndFixture::(1920x1080, 32FC3)\|1907.340\|412.841\|4.62\| \|Bitwise_and::OCL_BitwiseAndFixture::(1920x1080, 8UC4)\|635.538\|134.544\|4.72\| \|Bitwise_and::OCL_BitwiseAndFixture::(1920x1080, 32FC4)\|2552.666\|556.397\|4.59\| \|Bitwise_and::OCL_BitwiseAndFixture::(3840x2160, 8UC1)\|634.736\|136.355\|4.66\| \|Bitwise_and::OCL_BitwiseAndFixture::(3840x2160, 32FC1)\|2548.283\|561.827\|4.54\| \|Bitwise_and::OCL_BitwiseAndFixture::(3840x2160, 8UC3)\|1911.454\|421.571\|4.53\| \|Bitwise_and::OCL_BitwiseAndFixture::(3840x2160, 32FC3)\|7663.803\|1677.289\|4.57\| \|Bitwise_and::OCL_BitwiseAndFixture::(3840x2160, 8UC4)\|2543.983\|562.780\|4.52\| \|Bitwise_and::OCL_BitwiseAndFixture::(3840x2160, 32FC4)\|10211.693\|2237.393\|4.56\| \|Bitwise_not::OCL_BitwiseNotFixture::(640x480, 8UC1)\|22.341\|4.811\|4.64\| \|Bitwise_not::OCL_BitwiseNotFixture::(640x480, 32FC1)\|89.975\|19.288\|4.66\| \|Bitwise_not::OCL_BitwiseNotFixture::(640x480, 8UC3)\|67.237\|14.643\|4.59\| \|Bitwise_not::OCL_BitwiseNotFixture::(640x480, 32FC3)\|276.324\|58.609\|4.71\| \|Bitwise_not::OCL_BitwiseNotFixture::(640x480, 8UC4)\|89.587\|19.554\|4.58\| \|Bitwise_not::OCL_BitwiseNotFixture::(640x480, 32FC4)\|370.986\|77.136\|4.81\| \|Bitwise_not::OCL_BitwiseNotFixture::(1280x720, 8UC1)\|67.227\|14.541\|4.62\| \|Bitwise_not::OCL_BitwiseNotFixture::(1280x720, 32FC1)\|276.357\|58.076\|4.76\| \|Bitwise_not::OCL_BitwiseNotFixture::(1280x720, 8UC3)\|206.752\|43.376\|4.77\| \|Bitwise_not::OCL_BitwiseNotFixture::(1280x720, 32FC3)\|841.638\|177.787\|4.73\| \|Bitwise_not::OCL_BitwiseNotFixture::(1280x720, 8UC4)\|276.773\|57.784\|4.79\| \|Bitwise_not::OCL_BitwiseNotFixture::(1280x720, 32FC4)\|1127.740\|237.472\|4.75\| \|Bitwise_not::OCL_BitwiseNotFixture::(1920x1080, 8UC1)\|153.808\|32.531\|4.73\| \|Bitwise_not::OCL_BitwiseNotFixture::(1920x1080, 32FC1)\|627.765\|129.990\|4.83\| \|Bitwise_not::OCL_BitwiseNotFixture::(1920x1080, 8UC3)\|469.799\|98.249\|4.78\| \|Bitwise_not::OCL_BitwiseNotFixture::(1920x1080, 32FC3)\|1893.591\|403.694\|4.69\| \|Bitwise_not::OCL_BitwiseNotFixture::(1920x1080, 8UC4)\|627.724\|129.962\|4.83\| \|Bitwise_not::OCL_BitwiseNotFixture::(1920x1080, 32FC4)\|2529.967\|540.744\|4.68\| \|Bitwise_not::OCL_BitwiseNotFixture::(3840x2160, 8UC1)\|628.089\|130.277\|4.82\| \|Bitwise_not::OCL_BitwiseNotFixture::(3840x2160, 32FC1)\|2521.817\|540.146\|4.67\| \|Bitwise_not::OCL_BitwiseNotFixture::(3840x2160, 8UC3)\|1905.004\|404.704\|4.71\| \|Bitwise_not::OCL_BitwiseNotFixture::(3840x2160, 32FC3)\|7567.971\|1627.898\|4.65\| \|Bitwise_not::OCL_BitwiseNotFixture::(3840x2160, 8UC4)\|2531.476\|540.181\|4.69\| \|Bitwise_not::OCL_BitwiseNotFixture::(3840x2160, 32FC4)\|10075.594\|2181.654\|4.62\| \|Bitwise_or::OCL_BitwiseOrFixture::(640x480, 8UC1)\|22.566\|5.076\|4.45\| \|Bitwise_or::OCL_BitwiseOrFixture::(640x480, 32FC1)\|90.391\|19.928\|4.54\| \|Bitwise_or::OCL_BitwiseOrFixture::(640x480, 8UC3)\|67.758\|14.740\|4.60\| \|Bitwise_or::OCL_BitwiseOrFixture::(640x480, 32FC3)\|279.253\|59.844\|4.67\| \|Bitwise_or::OCL_BitwiseOrFixture::(640x480, 8UC4)\|90.296\|19.802\|4.56\| \|Bitwise_or::OCL_BitwiseOrFixture::(640x480, 32FC4)\|373.972\|79.815\|4.69\| \|Bitwise_or::OCL_BitwiseOrFixture::(1280x720, 8UC1)\|67.815\|14.865\|4.56\| \|Bitwise_or::OCL_BitwiseOrFixture::(1280x720, 32FC1)\|279.398\|60.054\|4.65\| \|Bitwise_or::OCL_BitwiseOrFixture::(1280x720, 8UC3)\|208.643\|45.043\|4.63\| \|Bitwise_or::OCL_BitwiseOrFixture::(1280x720, 32FC3)\|850.042\|180.985\|4.70\| \|Bitwise_or::OCL_BitwiseOrFixture::(1280x720, 8UC4)\|279.363\|60.385\|4.63\| \|Bitwise_or::OCL_BitwiseOrFixture::(1280x720, 32FC4)\|1134.858\|243.062\|4.67\| \|Bitwise_or::OCL_BitwiseOrFixture::(1920x1080, 8UC1)\|155.212\|33.155\|4.68\| \|Bitwise_or::OCL_BitwiseOrFixture::(1920x1080, 32FC1)\|634.985\|134.911\|4.71\| \|Bitwise_or::OCL_BitwiseOrFixture::(1920x1080, 8UC3)\|474.648\|100.407\|4.73\| \|Bitwise_or::OCL_BitwiseOrFixture::(1920x1080, 32FC3)\|1912.049\|414.184\|4.62\| \|Bitwise_or::OCL_BitwiseOrFixture::(1920x1080, 8UC4)\|635.252\|132.587\|4.79\| \|Bitwise_or::OCL_BitwiseOrFixture::(1920x1080, 32FC4)\|2544.471\|560.737\|4.54\| \|Bitwise_or::OCL_BitwiseOrFixture::(3840x2160, 8UC1)\|634.574\|134.966\|4.70\| \|Bitwise_or::OCL_BitwiseOrFixture::(3840x2160, 32FC1)\|2545.129\|561.498\|4.53\| \|Bitwise_or::OCL_BitwiseOrFixture::(3840x2160, 8UC3)\|1910.900\|419.365\|4.56\| \|Bitwise_or::OCL_BitwiseOrFixture::(3840x2160, 32FC3)\|7662.603\|1685.812\|4.55\| \|Bitwise_or::OCL_BitwiseOrFixture::(3840x2160, 8UC4)\|2548.971\|560.787\|4.55\| \|Bitwise_or::OCL_BitwiseOrFixture::(3840x2160, 32FC4)\|10201.407\|2237.552\|4.56\| \|Bitwise_xor::OCL_BitwiseXorFixture::(640x480, 8UC1)\|22.718\|4.961\|4.58\| \|Bitwise_xor::OCL_BitwiseXorFixture::(640x480, 32FC1)\|91.496\|19.831\|4.61\| \|Bitwise_xor::OCL_BitwiseXorFixture::(640x480, 8UC3)\|67.910\|15.151\|4.48\| \|Bitwise_xor::OCL_BitwiseXorFixture::(640x480, 32FC3)\|279.612\|59.792\|4.68\| \|Bitwise_xor::OCL_BitwiseXorFixture::(640x480, 8UC4)\|91.073\|19.853\|4.59\| \|Bitwise_xor::OCL_BitwiseXorFixture::(640x480, 32FC4)\|374.641\|79.155\|4.73\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1280x720, 8UC1)\|67.704\|15.008\|4.51\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1280x720, 32FC1)\|279.229\|60.088\|4.65\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1280x720, 8UC3)\|208.156\|44.426\|4.69\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1280x720, 32FC3)\|849.501\|180.848\|4.70\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1280x720, 8UC4)\|279.642\|59.728\|4.68\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1280x720, 32FC4)\|1129.826\|242.880\|4.65\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1920x1080, 8UC1)\|155.585\|33.354\|4.66\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1920x1080, 32FC1)\|634.090\|134.995\|4.70\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1920x1080, 8UC3)\|474.931\|99.598\|4.77\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1920x1080, 32FC3)\|1910.519\|413.138\|4.62\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1920x1080, 8UC4)\|635.026\|135.155\|4.70\| \|Bitwise_xor::OCL_BitwiseXorFixture::(1920x1080, 32FC4)\|2560.167\|560.838\|4.56\| \|Bitwise_xor::OCL_BitwiseXorFixture::(3840x2160, 8UC1)\|634.893\|134.883\|4.71\| \|Bitwise_xor::OCL_BitwiseXorFixture::(3840x2160, 32FC1)\|2548.166\|560.831\|4.54\| \|Bitwise_xor::OCL_BitwiseXorFixture::(3840x2160, 8UC3)\|1911.392\|419.816\|4.55\| \|Bitwise_xor::OCL_BitwiseXorFixture::(3840x2160, 32FC3)\|7646.634\|1677.988\|4.56\| \|Bitwise_xor::OCL_BitwiseXorFixture::(3840x2160, 8UC4)\|2560.637\|560.805\|4.57\| \|Bitwise_xor::OCL_BitwiseXorFixture::(3840x2160, 32FC4)\|10227.044\|2249.458\|4.55\| ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-05-28 14:25:53 +03:00
Kumataro	b6593517c4	Merge pull request #25647 from Kumataro:fix25646 imgcodecs: support IMWRITE_JPEG_LUMA/CHROMA_QUALITY with internal libjpeg-turbo #25647 Close #25646 - increase JPEG_LIB_VERSION for internal libjpeg-turbo from 62 to 70 - add log when using IMWRITE_JPEG_LUMA/CHROMA_QUALITY with JPEG_LIB_VERSION<70 - add document IMWRITE_JPEG_LUMA/CHROMA_QUALITY requests JPEG_LIB_VERSION >= 70 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-05-27 17:33:43 +03:00
Suleyman TURKMEN	8955a27577	minor cosmetic changes	2024-05-26 22:50:47 +03:00
Alexander Smorkalov	d97df262f6	Merge pull request #25623 from asmorkalov:as/jpegturbo_3.0.3 Libjpeg-turbo update to version 3.0.3 #25623 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-05-25 13:03:12 +03:00
Alexander Alekhin	f1aef892bf	Merge pull request #25619 from opencv-pushbot:gitee/alalek/update_ffmpeg_4.x	2024-05-23 16:54:30 +00:00
Alexander Smorkalov	4824354e46	Merge pull request #25631 from asmorkalov:as/png_build_fix Fixed CMake Missing variable is: CMAKE_ASM_COMPILE_OBJECT in PNG build #25631 Error message with `-DBUILD_PNG=ON` on ARM64: ``` -- Configuring done CMake Error: Error required internal CMake variable not set, cmake may not be built correctly. Missing variable is: CMAKE_ASM_COMPILE_OBJECT -- Generating done CMake Generate step failed. Build files cannot be regenerated correctly. ``` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-05-23 17:14:01 +03:00
Alexander Smorkalov	16b5096ed0	Merge pull request #25618 from asmorkalov:as/kleidicv_0.1.0 KleidiCV HAL update to version 0.1.0. #25618 Original integration PR: https://github.com/opencv/opencv/pull/25443 Force the library for testing with CI ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-05-23 16:55:44 +03:00
Suleyman TURKMEN	e765c9f9c8	Merge pull request #25580 from sturkmen72:libpng_1_6_43 3rdparty: update libpng 1.6.43 #25580 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-05-22 15:45:10 +03:00
Alexander Alekhin	5696413bbc	ffmpeg/4.x: update FFmpeg wrapper 2024.05	2024-05-22 08:53:39 +00:00
Alexander Smorkalov	40faced6c1	OpenJPEG update to 2.5.2.	2024-05-20 13:44:43 +03:00
Alexander Smorkalov	d29ad2fb71	Merge pull request #25443 from asmorkalov:as/kleidicv_hal Integrate ARM KleidiCV as OpenCV HAL #25443 The library source code with license: https://gitlab.arm.com/kleidi/kleidicv/ ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-05-16 15:29:06 +03:00
Alexander Smorkalov	6bf758ecc4	Merge pull request #24782 from FantasqueX:4.x-zlib-ng Add zlib-ng as an alternative zlib implementation	2024-05-03 10:55:25 +03:00
Vincent Rabaud	85673afb47	Bump libwebp to v1.4.0	2024-04-17 14:36:44 +02:00
Alexander Alekhin	4fb0541916	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2024-04-15 11:51:36 +00:00
eplankin	02a74f2e88	Merge pull request #25377 from eplankin:3.4 * Update IPP integration * Updated packages and hashes	2024-04-15 10:03:44 +00:00
Alexander Smorkalov	bbd9059334	Fixed jpeg-turbo diagnostics and build options in default configuration.	2024-04-10 16:19:59 +03:00
daiyin	93800a85b2	fix download file hash value mismatch issue	2024-04-01 10:32:37 +08:00
zzuliys	2b9d1a2ff8	Merge pull request #24877 from zzuliys:feature/mac Orbbec Camera supports MacOS，Gemini2 and Gemini2L support Y16 format #24877 note: 1.Gemini2 and Gemini2L must use the latest firmware -- https://github.com/orbbec/OrbbecFirmware; 2.Administrator privileges are necessary to run on MacOS.	2024-03-29 11:23:41 +03:00
Michael Klatis	f87e1efd2a	Merge pull request #25092 from klatism:libjpeg-upgrade libjpeg upgrade to version 9f #25092 Upgrade libjpeg dependency from version 9d to 9f. - [X] I agree to contribute to the project under Apache 2 License. - [X] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-03-28 17:03:05 +03:00
Kumataro	aae77b65a5	Merge pull request #25257 from Kumataro:fix25256 3rdparty: libtiff: fix for small version expression problems for built-in tiff460 #25257 Close #25256 1. fix to show build-int libtiff version 2. fix to set value of LIBTIFF_VERSION define. (RELEASE-DATE file coms from original libtiff 4.6.0) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-03-25 08:58:30 +03:00
Michael Klatis	52f3f5a3f6	libtiff upgrade to version 4.6.0 (#25096 ) * libtiff upgrade to version 4.6.0 * fix tiffvers.h cmake generation * temp: force build 3rd party deps from source * remove libport.h and spintf.c * cmake fixes * don't use tiff_dummy_namespace on windows * introduce numeric_types namespace alias * include cstdint * uint16_t is not a numeric_types type * fix uint16 and uint32 type defs * use standard c++ types * remove unused files * remove more unused files * revert build 3rd party code from source --------- Co-authored-by: Misha Klatis <misha.klatis@autodesk.com>	2024-03-22 04:08:16 +03:00
Giorgos Karagounis	2fd011b6ff	Fix for jpegturbo Windows build	2024-03-18 16:37:26 +01:00
Michael Klatis	bc82959184	Merge pull request #25123 from klatism:zlib-upgrade Zlib upgrade to version 1.3.1 #25123 Upgrade zlib dependency from 1.3.0 to 1.3.1 - [X] I agree to contribute to the project under Apache 2 License. - [X] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake Co-authored-by: Misha Klatis <misha.klatis@autodesk.com>	2024-03-05 12:40:21 +03:00
Alexander Smorkalov	facb66b8a7	Fixed lcense and readme filenames after TBB update	2024-02-27 12:06:55 +03:00
Alexander Smorkalov	4a9b0f2bf4	Merge pull request #25098 from klatism:tbb-lib-upgrade upgrade tbb to version 2021.11.0	2024-02-26 15:06:49 +03:00
Misha Klatis	b5c6956f31	upgrade tbb to version 2021.11.0	2024-02-25 21:51:56 -08:00
Ryo H	e4ef88c623	Fixed typo.	2024-02-24 18:03:18 +09:00
Alexander Smorkalov	b24e365454	Suppress TBB compilation warnings visible during Android SDK build.	2024-02-13 15:19:25 +03:00
Alexander Alekhin	073c2d9964	Merge remote-tracking branch 'upstream/3.4'	2024-02-12 18:09:16 +00:00
eplankin	3c73557dac	Update IPPICV binaries (20231206)	2024-02-09 04:05:17 -08:00
Letu Ren	0de26fd78e	Add zlib-ng as an alternative zlib implementation Zlib-ng is zlib replacement with optimizations for "next generation" systems. Its optimization may benifits image library decode and encode speed such as libpng. In our tests, if using zlib-ng and libpng combination on a x86_64 machine with AVX2, the time of `imdecode` amd `imencode` will drop 20% approximately. This patch enables zlib-ng's optimization if `CV_DISABLE_OPTIMIZATION` is OFF. Since Zlib-ng can dispatch intrinsics on the fly, port work is much easier. Related discussion: https://github.com/opencv/opencv/issues/22573	2024-01-14 14:58:47 +08:00
Alexander Alekhin	43007af42e	ffmpeg/4.x: update FFmpeg wrapper 2023.12	2023-12-25 16:49:17 +00:00
Kumataro	dba7186378	Merge pull request #24271 from Kumataro:fix24163 Fix to convert float32 to int32/uint32 with rounding to nearest (ties to even). #24271 Fix https://github.com/opencv/opencv/issues/24163 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake (carotene is BSD)	2023-12-25 12:17:17 +03:00
Letu Ren	baf07f9a03	Update 3rdparty zlib to 1.3 The opencv zlib patch is still not merged. So I applied it to zlib-1.3. Since zlib-1.3 didn't modify CMakeLists.txt, opencv's zlib CMakeLists.txt doesn't need to be modified.	2023-12-19 14:48:39 +08:00
Kumataro	ea3164ac77	3rdparty: update libtiff URLs	2023-12-16 10:23:22 +09:00
Alexander Smorkalov	1cca4bef64	Merge pull request #24689 from FantasqueX:update-libspng-0-7-4 Update 3rdparty libspng to 0.7.4	2023-12-15 17:53:42 +03:00
Alexander Alekhin	2e37a697b9	Merge pull request #24693 from FantasqueX:fix-spng-homepage	2023-12-13 12:22:25 +00:00
Letu Ren	992c527c91	Fix homepage of libspng The current one cannot be reached. I replaced it with the one I found in official Github repository https://github.com/randy408/libspng	2023-12-13 11:51:19 +08:00
Letu Ren	46743c1701	Update 3rdparty libspng to 0.7.4 Include three fixes: `e68ba5df57` `6c7c8ce372` `c9451cab66` and three chores: `d86a11d3ab` `bab9f94a11` `fb768002d4` I have reviewed CMakeLists.txt and I think disabling unused-variable warning is not needed.	2023-12-12 23:05:02 +08:00
Peter Johnson	72a987ef5a	Merge pull request #24205 from PeterJohnson:fix-msvc-arm64 ht_dec.c: Improve MSVC arm64 popcount performance #24205 Use NEON instructions for ARM64 (implementation based on https://github.com/microsoft/STL/pull/2127, which is Apache licensed). Godbolt output here: https://godbolt.org/z/q7GPTqT14 Related patch to openjpeg: https://github.com/uclouvain/openjpeg/pull/1479 ### Pull Request Readiness Checklist - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch	2023-12-10 19:59:20 +03:00
Tomoaki Teshima	2146bbb7bd	fix warning in Clang 10	2023-12-07 21:35:44 +09:00
Alexander Alekhin	e202116b56	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2023-10-31 14:52:49 +00:00
eplankin	cac1695099	Update IPPICV binaries (20230919)	2023-10-25 10:01:19 -07:00
Sean McBride	5fb3869775	Merge pull request #23109 from seanm:misc-warnings * Fixed clang -Wnewline-eof warnings * Fixed all trivial clang -Wextra-semi and -Wc++98-compat-extra-semi warnings * Removed trailing semi from various macros * Fixed various -Wunused-macros warnings * Fixed some trivial -Wdocumentation warnings * Fixed some -Wdocumentation-deprecated-sync warnings * Fixed incorrect indentation * Suppressed some clang warnings in 3rd party code * Fixed QRCodeEncoder::Params documentation. --------- Co-authored-by: Alexander Smorkalov <alexander.smorkalov@xperience.ai>	2023-10-06 13:33:21 +03:00
Vincent Rabaud	687fc11626	Merge pull request #24274 from vrabaud:webp_1.3.2 Merge pull request #24274 from vrabaud:webp_1.3.2 Bump libwebp to 1.3.2 #24274 This is version [c1ffd9a](`c1ffd9ac75`) It is 1.3.2 with a few patches that were made right after to help compilation. No need for patches on the OpenCV side! ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch	2023-09-15 14:25:49 +03:00

1 2 3 4 5 ...

926 Commits