opencv

mirror of https://github.com/opencv/opencv.git synced 2025-06-23 20:21:40 +08:00

History

GenshinImpactStarts 6a6a5a765d Merge pull request #26943 from GenshinImpactStarts:flip_hal_rvv Impl RISC-V HAL for cv::flip \| Add perf test for flip #26943 Implement through the existing `cv_hal_flip` interfaces. Add perf test for `cv::flip`. The reason why select these args for testing: - size: copied from perf_lut - type: - U8C1: basic situation - U8C3: unaligned element size - U8C4: large element size Tested on - MUSE-PI (vlen=256) - Compiler: gcc 14.2 (riscv-collab/riscv-gnu-toolchain Nightly: December 16, 2024) ```sh $ opencv_test_core --gtest_filter="Core_Flip/ElemWiseTest." $ opencv_perf_core --gtest_filter="Size_MatType_FlipCode" --perf_min_samples=300 --perf_force_samples=300 ``` ``` Geometric mean (ms) Name of Test scalar ui rvv ui rvv vs vs scalar scalar (x-factor) (x-factor) flip::Size_MatType_FlipCode::(320x240, 8UC1, FLIP_X) 0.026 0.033 0.031 0.81 0.84 flip::Size_MatType_FlipCode::(320x240, 8UC1, FLIP_XY) 0.206 0.212 0.091 0.97 2.26 flip::Size_MatType_FlipCode::(320x240, 8UC1, FLIP_Y) 0.185 0.189 0.082 0.98 2.25 flip::Size_MatType_FlipCode::(320x240, 8UC3, FLIP_X) 0.070 0.084 0.084 0.83 0.83 flip::Size_MatType_FlipCode::(320x240, 8UC3, FLIP_XY) 0.616 0.612 0.235 1.01 2.62 flip::Size_MatType_FlipCode::(320x240, 8UC3, FLIP_Y) 0.587 0.603 0.204 0.97 2.88 flip::Size_MatType_FlipCode::(320x240, 8UC4, FLIP_X) 0.263 0.110 0.109 2.40 2.41 flip::Size_MatType_FlipCode::(320x240, 8UC4, FLIP_XY) 0.930 0.831 0.316 1.12 2.95 flip::Size_MatType_FlipCode::(320x240, 8UC4, FLIP_Y) 1.175 1.129 0.313 1.04 3.75 flip::Size_MatType_FlipCode::(640x480, 8UC1, FLIP_X) 0.303 0.118 0.111 2.57 2.73 flip::Size_MatType_FlipCode::(640x480, 8UC1, FLIP_XY) 0.949 0.836 0.405 1.14 2.34 flip::Size_MatType_FlipCode::(640x480, 8UC1, FLIP_Y) 0.784 0.783 0.409 1.00 1.92 flip::Size_MatType_FlipCode::(640x480, 8UC3, FLIP_X) 1.084 0.360 0.355 3.01 3.06 flip::Size_MatType_FlipCode::(640x480, 8UC3, FLIP_XY) 3.768 3.348 1.364 1.13 2.76 flip::Size_MatType_FlipCode::(640x480, 8UC3, FLIP_Y) 4.361 4.473 1.296 0.97 3.37 flip::Size_MatType_FlipCode::(640x480, 8UC4, FLIP_X) 1.252 0.469 0.451 2.67 2.78 flip::Size_MatType_FlipCode::(640x480, 8UC4, FLIP_XY) 5.732 5.220 1.303 1.10 4.40 flip::Size_MatType_FlipCode::(640x480, 8UC4, FLIP_Y) 5.041 5.105 1.203 0.99 4.19 flip::Size_MatType_FlipCode::(1920x1080, 8UC1, FLIP_X) 2.382 0.903 0.903 2.64 2.64 flip::Size_MatType_FlipCode::(1920x1080, 8UC1, FLIP_XY) 8.606 7.508 2.581 1.15 3.33 flip::Size_MatType_FlipCode::(1920x1080, 8UC1, FLIP_Y) 8.421 8.535 2.219 0.99 3.80 flip::Size_MatType_FlipCode::(1920x1080, 8UC3, FLIP_X) 6.312 2.416 2.429 2.61 2.60 flip::Size_MatType_FlipCode::(1920x1080, 8UC3, FLIP_XY) 29.174 26.055 12.761 1.12 2.29 flip::Size_MatType_FlipCode::(1920x1080, 8UC3, FLIP_Y) 25.373 25.500 13.382 1.00 1.90 flip::Size_MatType_FlipCode::(1920x1080, 8UC4, FLIP_X) 7.620 3.204 3.115 2.38 2.45 flip::Size_MatType_FlipCode::(1920x1080, 8UC4, FLIP_XY) 32.876 29.310 12.976 1.12 2.53 flip::Size_MatType_FlipCode::(1920x1080, 8UC4, FLIP_Y) 28.831 29.094 14.919 0.99 1.93 ``` The optimization for vlen <= 256 and > 256 are different, but I have no real hardware with vlen > 256. So accuracy tests for that like 512 and 1024 are conducted on QEMU built from the `riscv-collab/riscv-gnu-toolchain`. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake		2025-02-24 08:56:23 +03:00
..
cuda	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
opencl	Merge pull request #26115 from savuor:rv/flip_ocl_dtypes	2024-09-06 08:26:00 +03:00
perf_abs.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_addWeighted.cpp	Merge pull request #12411 from vpisarev:wide_convert	2018-09-06 19:36:59 +03:00
perf_allocation.cpp	Merge pull request #23109 from seanm:misc-warnings	2023-10-06 13:33:21 +03:00
perf_arithm.cpp	Merge pull request #26886 from sk1er52:feature/exp64f	2025-02-21 17:36:54 +03:00
perf_bitwise.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_compare.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_convertTo.cpp	Merge pull request #12411 from vpisarev:wide_convert	2018-09-06 19:36:59 +03:00
perf_cvround.cpp	fast_math: add extra perf/unit tests	2019-08-07 14:59:46 -05:00
perf_dft.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_dot.cpp	Merge pull request #15510 from seiko2plus:issue15506	2019-10-07 22:01:35 +03:00
perf_flip.cpp	Merge pull request #26943 from GenshinImpactStarts:flip_hal_rvv	2025-02-24 08:56:23 +03:00
perf_inRange.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_io_base64.cpp	core: disable I/O perf test	2019-02-27 18:07:45 +03:00
perf_lut.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_main.cpp	Merge pull request #11897 from Jakub-Golinowski:hpx_backend	2018-08-31 16:23:26 +03:00
perf_mat.cpp	Utilize CV_UNUSED macro	2018-09-07 20:33:52 +09:00
perf_math.cpp	Merge pull request #25450 from savuor:rv/svd_perf	2024-04-27 14:33:13 +03:00
perf_merge.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_minmaxloc.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_norm.cpp	Merge pull request #13317 from terfendail:norm_wintr	2018-11-29 19:34:14 +03:00
perf_precomp.hpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_reduce.cpp	Remove useless C headers	2025-01-13 16:34:28 +01:00
perf_sort.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00
perf_split.cpp	Merge pull request #12437 from vpisarev:avx2_fixes	2018-09-06 18:56:55 +03:00
perf_stat.cpp	Merge pull request #22947 from chacha21:hasNonZero	2023-06-09 13:37:20 +03:00
perf_umat.cpp	ts: refactor OpenCV tests	2018-02-03 19:39:47 +00:00