mirror of
https://github.com/opencv/opencv.git
synced 2024-11-26 20:20:20 +08:00
814550d2a6
Improve performance on Arm64 * Improve performance on Apple silicon This patch will - Enable dot product intrinsics for macOS arm64 builds - Enable for macOS arm64 builds - Improve HAL primitives - reduction (sum, min, max, sad) - signmask - mul_expand - check_any / check_all Results on a M1 Macbook Pro * Updates to #20011 based on feedback - Removes Apple Silicon specific workarounds - Makes #ifdef sections smaller for v_mul_expand cases - Moves dot product optimization to compiler optimization check - Adds 4x4 matrix transpose optimization * Remove dotprod and fix v_transpose Based on the latest, we've removed dotprod entirely and will revisit in a future PR. Added explicit cats with v_transpose4x4() This should resolve all opens with this PR * Remove commented out lines Remove two extraneous comments |
||
---|---|---|
.. | ||
ocl | ||
test_arithm.cpp | ||
test_async.cpp | ||
test_concatenation.cpp | ||
test_conjugate_gradient.cpp | ||
test_countnonzero.cpp | ||
test_downhill_simplex.cpp | ||
test_ds.cpp | ||
test_dxt.cpp | ||
test_eigen.cpp | ||
test_hal_core.cpp | ||
test_intrin128.simd.hpp | ||
test_intrin256.simd.hpp | ||
test_intrin512.simd.hpp | ||
test_intrin_emulator.cpp | ||
test_intrin_utils.hpp | ||
test_intrin.cpp | ||
test_io.cpp | ||
test_lpsolver.cpp | ||
test_main.cpp | ||
test_mat.cpp | ||
test_math.cpp | ||
test_misc.cpp | ||
test_operations.cpp | ||
test_precomp.hpp | ||
test_ptr.cpp | ||
test_rand.cpp | ||
test_rotatedrect.cpp | ||
test_umat.cpp | ||
test_utils_tls.impl.hpp | ||
test_utils.cpp |