opencv/modules/core/perf
Daniil Anufriev b5f5540e8a
Merge pull request #26886 from sk1er52:feature/exp64f
Enable SIMD_SCALABLE for exp and sqrt #26886

### Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

- [x] I agree to contribute to the project under Apache 2 License.
- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [x] The PR is proposed to the proper branch
- [x] There is a reference to the original bug report and related work
- [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable
      Patch to opencv_extra has the same branch name.
- [x] The feature is well documented and sample code can be built with the project CMake
```
CPU - Banana Pi k1, compiler - clang 18.1.4
```
```
Geometric mean (ms)

              Name of Test               baseline  hal     ui      hal         ui    
                                                                    vs         vs
                                                                 baseline   baseline
                                                                (x-factor) (x-factor)
Exp::ExpFixture::(127x61, 32FC1)          0.358     --   0.033      --       10.70   
Exp::ExpFixture::(640x480, 32FC1)         14.304    --   1.167      --       12.26   
Exp::ExpFixture::(1280x720, 32FC1)        42.785    --   3.538      --       12.09
Exp::ExpFixture::(1920x1080, 32FC1)       96.206    --   7.927      --       12.14   
Exp::ExpFixture::(127x61, 64FC1)          0.433   0.050  0.098     8.59       4.40   
Exp::ExpFixture::(640x480, 64FC1)         17.315  1.935  3.813     8.95       4.54   
Exp::ExpFixture::(1280x720, 64FC1)        52.181  5.877  11.519    8.88       4.53   
Exp::ExpFixture::(1920x1080, 64FC1)      117.082  13.157 25.854    8.90       4.53
```
Additionally, this PR brings Sqrt optimization with UI:
```
Geometric mean (ms)

              Name of Test                     baseline    ui       ui    
                                                                    vs
                                                                 baseline
                                                                (x-factor)
Sqrt::SqrtFixture::(127x61, 5, false)            0.111   0.027     4.11   
Sqrt::SqrtFixture::(127x61, 6, false)            0.149   0.053     2.82   
Sqrt::SqrtFixture::(640x480, 5, false)           4.374   0.967     4.52   
Sqrt::SqrtFixture::(640x480, 6, false)           5.885   2.046     2.88   
Sqrt::SqrtFixture::(1280x720, 5, false)          12.960  2.915     4.45   
Sqrt::SqrtFixture::(1280x720, 6, false)          17.648  6.107     2.89   
Sqrt::SqrtFixture::(1920x1080, 5, false)         29.178  6.524     4.47   
Sqrt::SqrtFixture::(1920x1080, 6, false)         39.709  13.670    2.90   
```

Reference
Muller, J.-M. Elementary Functions: Algorithms and Implementation. 2nd ed. Boston: Birkhäuser, 2006.
https://www.springer.com/gp/book/9780817643720
2025-02-21 17:36:54 +03:00
..
cuda ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
opencl Merge pull request #26115 from savuor:rv/flip_ocl_dtypes 2024-09-06 08:26:00 +03:00
perf_abs.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_addWeighted.cpp Merge pull request #12411 from vpisarev:wide_convert 2018-09-06 19:36:59 +03:00
perf_allocation.cpp Merge pull request #23109 from seanm:misc-warnings 2023-10-06 13:33:21 +03:00
perf_arithm.cpp Merge pull request #26886 from sk1er52:feature/exp64f 2025-02-21 17:36:54 +03:00
perf_bitwise.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_compare.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_convertTo.cpp Merge pull request #12411 from vpisarev:wide_convert 2018-09-06 19:36:59 +03:00
perf_cvround.cpp fast_math: add extra perf/unit tests 2019-08-07 14:59:46 -05:00
perf_dft.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_dot.cpp Merge pull request #15510 from seiko2plus:issue15506 2019-10-07 22:01:35 +03:00
perf_inRange.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_io_base64.cpp core: disable I/O perf test 2019-02-27 18:07:45 +03:00
perf_lut.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_main.cpp Merge pull request #11897 from Jakub-Golinowski:hpx_backend 2018-08-31 16:23:26 +03:00
perf_mat.cpp Utilize CV_UNUSED macro 2018-09-07 20:33:52 +09:00
perf_math.cpp Merge pull request #25450 from savuor:rv/svd_perf 2024-04-27 14:33:13 +03:00
perf_merge.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_minmaxloc.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_norm.cpp Merge pull request #13317 from terfendail:norm_wintr 2018-11-29 19:34:14 +03:00
perf_precomp.hpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_reduce.cpp Remove useless C headers 2025-01-13 16:34:28 +01:00
perf_sort.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00
perf_split.cpp Merge pull request #12437 from vpisarev:avx2_fixes 2018-09-06 18:56:55 +03:00
perf_stat.cpp Merge pull request #22947 from chacha21:hasNonZero 2023-06-09 13:37:20 +03:00
perf_umat.cpp ts: refactor OpenCV tests 2018-02-03 19:39:47 +00:00