opencv

mirror of https://github.com/opencv/opencv.git synced 2025-08-06 14:36:36 +08:00

Open Source Computer Vision Library

c-plus-plus computer-vision deep-learning image-processing opencv

Go to file

Alexander Alekhin 40533dbf69 Merge pull request #24918 from opencv-pushbot:gitee/alalek/core_convertfp16_replacement core(OpenCL): optimize convertTo() with CV_16F (convertFp16() replacement) #24918 relates #24909 relates #24917 relates #24892 Performance changes: - [x] 12700K (1 thread) + Intel iGPU \|Name of Test\|noOCL\|convertFp16\|convertTo BASE\|convertTo PATCH\| \|---\|:-:\|:-:\|:-:\|:-:\| \|ConvertFP16FP32MatMat::OCL_Core\|3.130\|3.152\|3.127\|3.136\| \|ConvertFP16FP32MatUMat::OCL_Core\|3.030\|3.996\|3.007\|2.671\| \|ConvertFP16FP32UMatMat::OCL_Core\|3.010\|3.101\|3.056\|2.854\| \|ConvertFP16FP32UMatUMat::OCL_Core\|3.016\|3.298\|2.072\|2.061\| \|ConvertFP32FP16MatMat::OCL_Core\|2.697\|2.652\|2.723\|2.721\| \|ConvertFP32FP16MatUMat::OCL_Core\|2.752\|4.268\|2.662\|2.947\| \|ConvertFP32FP16UMatMat::OCL_Core\|2.706\|2.601\|2.603\|2.528\| \|ConvertFP32FP16UMatUMat::OCL_Core\|2.704\|3.215\|1.999\|1.988\| Patched version is not worse than convertFp16 and convertTo baseline (except MatUMat 32->16, baseline uses CPU code+dst buffer map). There are still gaps against noOpenCL(CPU only) mode due to T-API implementation issues (unnecessary synchronization). - [x] 12700K + AMD dGPU \|Name of Test\|noOCL\|convertFp16 dGPU\|convertTo BASE dGPU\|convertTo PATCH dGPU\| \|---\|:-:\|:-:\|:-:\|:-:\| \|ConvertFP16FP32MatMat::OCL_Core\|3.130\|3.133\|3.172\|3.087\| \|ConvertFP16FP32MatUMat::OCL_Core\|3.030\|1.713\|9.559\|1.729\| \|ConvertFP16FP32UMatMat::OCL_Core\|3.010\|6.515\|6.309\|4.452\| \|ConvertFP16FP32UMatUMat::OCL_Core\|3.016\|0.242\|23.597\|0.170\| \|ConvertFP32FP16MatMat::OCL_Core\|2.697\|2.641\|2.713\|2.689\| \|ConvertFP32FP16MatUMat::OCL_Core\|2.752\|4.076\|6.483\|4.191\| \|ConvertFP32FP16UMatMat::OCL_Core\|2.706\|9.042\|16.481\|1.834\| \|ConvertFP32FP16UMatUMat::OCL_Core\|2.704\|0.229\|15.730\|0.176\| convertTo-baseline can't compile OpenCL kernel for FP16 properly - FIXED. dGPU has much more power, so results are x16-17 better than single cpu core. Patched version is not worse than convertFp16 and convertTo baseline. There are still gaps against noOpenCL(CPU only) mode due to T-API implementation issues (unnecessary synchronization) and required memory transfers. Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>		2024-01-26 12:56:52 +03:00
.github	CI: enable RISC-V for 4.x branch	2023-10-16 09:28:37 +03:00
3rdparty	ffmpeg/4.x: update FFmpeg wrapper 2023.12	2023-12-25 16:49:17 +00:00
apps	Add missing <sstream> includes	2023-09-05 22:04:26 +03:00
cmake	Downgrade LIMITED_API_VERSION, if python3 is older than 3.6.	2024-01-17 13:09:08 +03:00
data	Merge pull request #22727 from su77ungr:patch-1	2022-11-17 06:54:25 +00:00
doc	Update windows_install.markdown - add set opencv path for vc17	2024-01-20 19:57:23 -06:00
include	exclude opencv_contrib modules	2020-02-26 15:12:45 +03:00
modules	Merge pull request #24918 from opencv-pushbot:gitee/alalek/core_convertfp16_replacement	2024-01-26 12:56:52 +03:00
platforms	RISC-V: updated intrin_rvv071.hpp to work with modern toolchain 2.8.0	2024-01-17 20:34:12 +03:00
samples	Merge pull request #23736 from seanm:c++11-simplifications	2024-01-19 16:53:08 +03:00
.editorconfig	add .editorconfig	2018-10-11 17:57:51 +00:00
.gitattributes	cmake: generate and install ffmpeg-download.ps1	2018-06-09 13:19:48 +03:00
.gitignore	Merge pull request #17165 from komakai:objc-binding	2020-06-08 18:32:53 +00:00
CMakeLists.txt	Manage Python Limited API version externally.	2024-01-10 12:30:29 +03:00
CONTRIBUTING.md	migration: github.com/opencv/opencv	2016-07-12 12:51:12 +03:00
COPYRIGHT	copyright: 2023 (update)	2023-01-09 09:49:22 +00:00
LICENSE	Merge pull request #18073 from vpisarev:apache2_license	2020-08-17 11:49:11 +00:00
README.md	Update README.md - remove IndieGoGo, add Support OpenCV page	2024-01-10 10:18:52 -08:00
SECURITY.md	Updated PGP key for security reports	2023-04-19 19:16:55 +03:00

README.md

OpenCV: Open Source Computer Vision Library

Resources

Homepage: https://opencv.org
- Courses: https://opencv.org/courses
Docs: https://docs.opencv.org/4.x/
Q&A forum: https://forum.opencv.org
- previous forum (read only): http://answers.opencv.org
Issue tracking: https://github.com/opencv/opencv/issues
Additional OpenCV functionality: https://github.com/opencv/opencv_contrib
Donate to OpenCV: https://opencv.org/support/

Contributing

Please read the contribution guidelines before starting work on a pull request.

Summary of the guidelines:

One pull request per issue;
Choose the right base branch;
Include tests and documentation;
Clean up "oops" commits before submitting;
Follow the coding style guide.

Additional Resources

Submit your OpenCV-based project for inclusion in Community Friday on opencv.org
Subscribe to the OpenCV YouTube Channel featuring OpenCV Live, an hour-long streaming show
Follow OpenCV on LinkedIn for daily posts showing the state-of-the-art in computer vision &AI
Apply to be an OpenCV Volunteer to help organize events and online campaigns as well as amplify them
Follow OpenCV on Mastodon in the Fediverse
Follow OpenCV on Twitter
OpenCV.ai: Computer Vision and AI development services from the OpenCV team.