opencv/modules/core/perf
Yuantao Feng 603b1cafdf
Merge pull request #26821 from fengyuentau:core/transform_simd
Core: vectorize cv::transform in terms of all data types #26821

## Performance

### i7-12700K

```
Geometric mean (ms)

                      Name of Test                       base  patch   patch
                                                                         vs
                                                                        base
                                                                     (x-factor)
Mat_Transform::Size_MatType::(127x61, 8SC3)              0.017 0.004    4.64
Mat_Transform::Size_MatType::(127x61, 16SC3)             0.015 0.004    3.78
Mat_Transform::Size_MatType::(127x61, 32SC3)             0.015 0.007    2.03
Mat_Transform::Size_MatType::(127x61, 64FC3)             0.007 0.004    1.78
Mat_Transform::Size_MatType::(640x480, 8SC3)             0.673 0.140    4.80
Mat_Transform::Size_MatType::(640x480, 16SC3)            0.618 0.158    3.90
Mat_Transform::Size_MatType::(640x480, 32SC3)            0.579 0.278    2.08
Mat_Transform::Size_MatType::(640x480, 64FC3)            0.290 0.266    1.09
Mat_Transform::Size_MatType::(1280x720, 8SC3)            1.919 0.414    4.63
Mat_Transform::Size_MatType::(1280x720, 16SC3)           1.811 0.488    3.71
Mat_Transform::Size_MatType::(1280x720, 32SC3)           1.736 0.917    1.89
Mat_Transform::Size_MatType::(1280x720, 64FC3)           2.310 2.030    1.14
Mat_Transform::Size_MatType::(1920x1080, 8SC3)           4.339 0.924    4.70
Mat_Transform::Size_MatType::(1920x1080, 16SC3)          4.095 1.288    3.18
Mat_Transform::Size_MatType::(1920x1080, 32SC3)          4.267 3.191    1.34
Mat_Transform::Size_MatType::(1920x1080, 64FC3)          6.641 5.481    1.21
Mat_Transform_Diagonal::Size_MatType::(640x480, 8SC3)    0.415 0.104    3.98
Mat_Transform_Diagonal::Size_MatType::(640x480, 16SC3)   0.385 0.128    3.00
Mat_Transform_Diagonal::Size_MatType::(640x480, 32SC3)   0.389 0.225    1.72
Mat_Transform_Diagonal::Size_MatType::(640x480, 64FC3)   0.279 0.275    1.01
Mat_Transform_Diagonal::Size_MatType::(1280x720, 8SC3)   1.223 0.313    3.91
Mat_Transform_Diagonal::Size_MatType::(1280x720, 16SC3)  1.118 0.387    2.89
Mat_Transform_Diagonal::Size_MatType::(1280x720, 32SC3)  1.215 0.801    1.52
Mat_Transform_Diagonal::Size_MatType::(1280x720, 64FC3)  2.198 1.900    1.16
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 8SC3)  2.772 0.705    3.93
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 16SC3) 2.572 1.134    2.27
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 32SC3) 3.477 3.276    1.06
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 64FC3) 5.984 5.186    1.15
```

### A311D

```
Geometric mean (ms)

                      Name of Test                        base  patch    patch
                                                                           vs
                                                                          base
                                                                       (x-factor)
Mat_Transform::Size_MatType::(127x61, 8SC3)              0.143  0.035     4.05
Mat_Transform::Size_MatType::(127x61, 16SC3)             0.135  0.037     3.67
Mat_Transform::Size_MatType::(127x61, 32SC3)             0.110  0.062     1.77
Mat_Transform::Size_MatType::(127x61, 64FC3)             0.034  0.039     0.89
Mat_Transform::Size_MatType::(640x480, 8SC3)             5.673  1.395     4.07
Mat_Transform::Size_MatType::(640x480, 16SC3)            5.331  1.439     3.70
Mat_Transform::Size_MatType::(640x480, 32SC3)            4.329  2.472     1.75
Mat_Transform::Size_MatType::(640x480, 64FC3)            1.560  2.316     0.67
Mat_Transform::Size_MatType::(1280x720, 8SC3)            17.002 4.139     4.11
Mat_Transform::Size_MatType::(1280x720, 16SC3)           15.996 4.308     3.71
Mat_Transform::Size_MatType::(1280x720, 32SC3)           12.948 7.241     1.79
Mat_Transform::Size_MatType::(1280x720, 64FC3)           4.742  7.376     0.64
Mat_Transform::Size_MatType::(1920x1080, 8SC3)           38.253 9.384     4.08
Mat_Transform::Size_MatType::(1920x1080, 16SC3)          35.913 9.750     3.68
Mat_Transform::Size_MatType::(1920x1080, 32SC3)          29.145 16.528    1.76
Mat_Transform::Size_MatType::(1920x1080, 64FC3)          10.606 20.968    0.51
Mat_Transform_Diagonal::Size_MatType::(640x480, 8SC3)    4.439  1.086     4.09
Mat_Transform_Diagonal::Size_MatType::(640x480, 16SC3)   4.251  1.136     3.74
Mat_Transform_Diagonal::Size_MatType::(640x480, 32SC3)   3.786  1.999     1.89
Mat_Transform_Diagonal::Size_MatType::(640x480, 64FC3)   1.555  1.551     1.00
Mat_Transform_Diagonal::Size_MatType::(1280x720, 8SC3)   13.319 3.243     4.11
Mat_Transform_Diagonal::Size_MatType::(1280x720, 16SC3)  12.828 3.398     3.78
Mat_Transform_Diagonal::Size_MatType::(1280x720, 32SC3)  11.336 5.989     1.89
Mat_Transform_Diagonal::Size_MatType::(1280x720, 64FC3)  4.707  4.690     1.00
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 8SC3)  29.952 7.293     4.11
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 16SC3) 28.817 7.656     3.76
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 32SC3) 25.476 13.388    1.90
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 64FC3) 10.533 10.509    1.00
```

### M2

```
Geometric mean (ms)

                      Name of Test                       base  patch   patch
                                                                         vs
                                                                        base
                                                                     (x-factor)
Mat_Transform::Size_MatType::(127x61, 8SC3)              0.020 0.004    4.45
Mat_Transform::Size_MatType::(127x61, 16SC3)             0.016 0.004    4.48
Mat_Transform::Size_MatType::(127x61, 32SC3)             0.016 0.007    2.23
Mat_Transform::Size_MatType::(127x61, 64FC3)             0.007 0.006    1.20
Mat_Transform::Size_MatType::(640x480, 8SC3)             0.793 0.197    4.03
Mat_Transform::Size_MatType::(640x480, 16SC3)            0.626 0.154    4.08
Mat_Transform::Size_MatType::(640x480, 32SC3)            0.627 0.306    2.05
Mat_Transform::Size_MatType::(640x480, 64FC3)            0.273 0.253    1.08
Mat_Transform::Size_MatType::(1280x720, 8SC3)            2.350 0.540    4.35
Mat_Transform::Size_MatType::(1280x720, 16SC3)           1.875 0.415    4.52
Mat_Transform::Size_MatType::(1280x720, 32SC3)           1.893 0.844    2.24
Mat_Transform::Size_MatType::(1280x720, 64FC3)           0.830 0.808    1.03
Mat_Transform::Size_MatType::(1920x1080, 8SC3)           5.302 1.178    4.50
Mat_Transform::Size_MatType::(1920x1080, 16SC3)          4.475 0.946    4.73
Mat_Transform::Size_MatType::(1920x1080, 32SC3)          4.409 1.864    2.37
Mat_Transform::Size_MatType::(1920x1080, 64FC3)          1.853 1.512    1.23
Mat_Transform_Diagonal::Size_MatType::(640x480, 8SC3)    0.586 0.110    5.32
Mat_Transform_Diagonal::Size_MatType::(640x480, 16SC3)   0.518 0.110    4.69
Mat_Transform_Diagonal::Size_MatType::(640x480, 32SC3)   0.430 0.220    1.95
Mat_Transform_Diagonal::Size_MatType::(640x480, 64FC3)   0.228 0.178    1.28
Mat_Transform_Diagonal::Size_MatType::(1280x720, 8SC3)   1.768 0.336    5.26
Mat_Transform_Diagonal::Size_MatType::(1280x720, 16SC3)  1.514 0.335    4.52
Mat_Transform_Diagonal::Size_MatType::(1280x720, 32SC3)  1.292 0.670    1.93
Mat_Transform_Diagonal::Size_MatType::(1280x720, 64FC3)  0.681 0.579    1.18
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 8SC3)  3.998 0.747    5.35
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 16SC3) 3.392 0.757    4.48
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 32SC3) 2.956 1.491    1.98
Mat_Transform_Diagonal::Size_MatType::(1920x1080, 64FC3) 1.546 1.476    1.05
```


### Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

- [x] I agree to contribute to the project under Apache 2 License.
- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [x] The PR is proposed to the proper branch
- [ ] There is a reference to the original bug report and related work
- [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable
      Patch to opencv_extra has the same branch name.
- [ ] The feature is well documented and sample code can be built with the project CMake
2025-02-06 13:38:16 +03:00
..
cuda ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
opencl Merge pull request #26327 from asmorkalov:as/drop_convertFp16 2024-10-22 15:17:24 +03:00
perf_abs.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_addWeighted.cpp Merge pull request #12411 from vpisarev:wide_convert 2018-09-06 19:36:59 +03:00
perf_allocation.cpp Merge pull request #23109 from seanm:misc-warnings 2023-10-06 13:33:21 +03:00
perf_arithm.cpp Merge branch 4.x 2024-11-28 13:20:39 +03:00
perf_bitwise.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_compare.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_convertTo.cpp Merge pull request #12411 from vpisarev:wide_convert 2018-09-06 19:36:59 +03:00
perf_cvround.cpp fast_math: add extra perf/unit tests 2019-08-07 14:59:46 -05:00
perf_dft.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_dot.cpp Merge pull request #15510 from seiko2plus:issue15506 2019-10-07 22:01:35 +03:00
perf_inRange.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_io_base64.cpp core: disable I/O perf test 2019-02-27 18:07:45 +03:00
perf_lut.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_main.cpp Merge pull request #11897 from Jakub-Golinowski:hpx_backend 2018-08-31 16:23:26 +03:00
perf_mat.cpp Merge pull request #26821 from fengyuentau:core/transform_simd 2025-02-06 13:38:16 +03:00
perf_math.cpp Merge pull request #25450 from savuor:rv/svd_perf 2024-04-27 14:33:13 +03:00
perf_merge.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_minmaxloc.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_norm.cpp Merge pull request #13317 from terfendail:norm_wintr 2018-11-29 19:34:14 +03:00
perf_precomp.hpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_reduce.cpp C-API cleanup: core module tests 2024-11-11 14:53:09 +03:00
perf_sort.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_split.cpp Merge pull request #12437 from vpisarev:avx2_fixes 2018-09-06 18:56:55 +03:00
perf_stat.cpp Merge pull request #22947 from chacha21:hasNonZero 2023-06-09 13:37:20 +03:00
perf_umat.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00