opencv/modules
Maksim Shabunin f37924796f
Merge pull request #25364 from mshabunin:fix-unaligned-filter
imgproc: fix unaligned memory access in filters and Gaussian blur #25364

* filter/SIMD: removed parts which casted 8u pointers to int causing unaligned memory access on RISC-V platform.
* GaussianBlur/fixed_point: replaced casts from s16 to u32 with union operations

Performance comparison:
- [x] check performance on x86_64 - (4 threads, `-DCPU_BASELINE=AVX2`, GCC 11.4, Ubuntu 22) - [report_imgproc_x86_64.ods](https://github.com/opencv/opencv/files/14904702/report_x86_64.ods)
- [x] check performance on AArch64 - (4 cores of RK3588, GCC 11.4 aarch64, Raspbian) - [report_imgproc_aarch64.ods](https://github.com/opencv/opencv/files/14908437/report_aarch64.ods)

Note: for some reason my performance results are quite unstable, unaffected functions show speedups and slowdowns in many cases. Filter2D and GaussianBlur seem to be OK.

Slightly related PR: https://github.com/opencv/ci-gha-workflow/pull/165
2024-04-09 17:44:36 +03:00
..
calib3d Merge pull request #25195 from MaximSmolskiy:use-initial-quads-corners-in-ChessBoardDetector-findQuadNeighbors 2024-04-02 13:13:01 +03:00
core Merge pull request #25342 from asmorkalov:as/HAL_transpose 2024-04-09 09:03:13 +03:00
dnn Merge pull request #25361 from hanliutong:rvv-f32 2024-04-09 16:04:02 +03:00
features2d Merge pull request #25042 from mshabunin:doc-upgrade 2024-03-05 16:19:45 +03:00
flann Merge pull request #25024 from vrabaud:neon 2024-02-20 11:29:23 +03:00
gapi Skip InferROI.TestStreamingInfer as unstable as it hangs on CI time-to-time. 2024-03-29 11:33:12 +03:00
highgui Update highgui+HighGui.java 2024-04-03 13:29:10 +03:00
imgcodecs Export TIFF compression options as API and git rid of tiff.h. 2024-04-03 14:30:11 +03:00
imgproc Merge pull request #25364 from mshabunin:fix-unaligned-filter 2024-04-09 17:44:36 +03:00
java doc: fix formulas in JavaDoc broken after Doxygen upgrade 2024-03-11 23:47:23 +03:00
js Merge pull request #25084 from EDVTAZ:emscripten-3.1.54-compat 2024-02-26 10:30:56 +03:00
ml Partially back-port #25075 to 4.x 2024-03-05 12:15:39 +03:00
objc Merge pull request #24136 from komakai:visionos_support 2023-12-20 15:35:10 +03:00
objdetect Resolve valgrind issue at QRCode detector 2024-04-03 16:58:46 +03:00
photo Merge pull request #25042 from mshabunin:doc-upgrade 2024-03-05 16:19:45 +03:00
python Merge pull request #24415 from thewoz:imread 2024-03-29 10:51:19 +03:00
stitching Partially back-port #25075 to 4.x 2024-03-05 12:15:39 +03:00
ts Partially back-port #25075 to 4.x 2024-03-05 12:15:39 +03:00
video Merge pull request #25092 from klatism:libjpeg-upgrade 2024-03-28 17:03:05 +03:00
videoio Added option to dump v4l2 test frame from virtual camera. 2024-04-04 15:50:27 +03:00
world cmake: use /INCREMENTAL:NO with MSVS 2015 2023-12-07 19:46:27 +00:00