opencv

mirror of https://github.com/opencv/opencv.git synced 2025-06-07 09:25:45 +08:00

Open Source Computer Vision Library

c-plus-plus computer-vision deep-learning image-processing opencv

Go to file

alexlyulkov 1d1faaabef Merge pull request #24411 from alexlyulkov:al/dnn-type-inference Added int32, int64 support and type inference to dnn #24411 Added a type inference to dnn similar to the shape inference, added int32 and int64 support. - Added getTypes method for layers that calculates layer outputs types and internals types from inputs types (Similar to getMemoryShapes). By default outputs and internals types = input[0] type - Added type inference pipeline similar to shape inference pipeline. LayersShapes struct (that is used in shape inference pipeline) now contains both shapes and types - All layers output blobs are now allocated using the calculated types from the type inference. - Inputs and constants with int32 and int64 types are not automatically converted into float32 now. - Added int32 and int64 support for all the layers with indexing and for all the layers required in tests. Added int32 and int64 support for CUDA: - Added host<->device data moving for int32 and int64 - Added int32 and int64 support for several layers (just slightly modified CUDA C++ templates) Passed all the accuracy tests on CPU, OCL, OCL_FP16, CUDA, CUDA_FP16. (except RAFT model) CURRENT PROBLEMS: - ONNX parser always converts int64 constants and layers attributes to int32, so some models with int64 constants doesn't work (e.g. RAFT). The solution is to disable int64->int32 conversion and fix attributes reading in a lot of ONNX layers parsers (https://github.com/opencv/opencv/issues/25102) - I didn't add type inference and int support to VULCAN, so it doesn't work at all now. - Some layers don't support int yet, so some unknown models may not work. CURRENT WORKAROUNDS: - CPU arg_layer indides are implemented in int32 followed by a int32->int64 conversion (the master branch has the same workaround with int32->float conversion) - CPU and OCL pooling_layer indices are implemented in float followed by a float->int64 conversion - CPU gather_layer indices are implemented in int32, so int64 indices are converted to int32 (the master branch has the same workaround with float->int32 conversion) DISABLED TESTS: - RAFT model REMOVED TESTS: - Greater_input_dtype_int64 (because it doesn't fit ONNX rules, the whole test is just comparing float tensor with int constant) TODO IN NEXT PULL REQUESTS: - Add int64 support for ONNX parser - Add int support for more layers - Add int support for OCL (currently int layers just run on CPU) - Add int tests - Add int support for other backends		2024-03-01 17:07:38 +03:00
.github	Merge pull request #24411 from alexlyulkov:al/dnn-type-inference	2024-03-01 17:07:38 +03:00
3rdparty	Merge branch 4.x	2024-02-28 18:49:05 +03:00
apps	Dump board poses human-readable and machine-readable format.	2024-02-14 15:17:01 +03:00
cmake	Merge branch 4.x	2024-02-28 18:49:05 +03:00
data	Merge pull request #22727 from su77ungr:patch-1	2022-11-17 06:54:25 +00:00
doc	Merge pull request #25017 from kaingwade:ml_to_contrib	2024-02-27 15:54:08 +03:00
include	calib3d module in opencv is split into 3 modules: 3d, calib and stereo.	2020-12-01 23:42:15 +03:00
modules	Merge pull request #24411 from alexlyulkov:al/dnn-type-inference	2024-03-01 17:07:38 +03:00
platforms	Merge branch 4.x	2024-02-26 18:05:36 +03:00
samples	Merge branch 4.x	2024-02-28 18:49:05 +03:00
.editorconfig	add .editorconfig	2018-10-11 17:57:51 +00:00
.gitattributes	cmake: generate and install ffmpeg-download.ps1	2018-06-09 13:19:48 +03:00
.gitignore	Merge pull request #17165 from komakai:objc-binding	2020-06-08 18:32:53 +00:00
CMakeLists.txt	Merge branch 4.x	2024-02-12 14:20:35 +03:00
CONTRIBUTING.md	migration: github.com/opencv/opencv	2016-07-12 12:51:12 +03:00
COPYRIGHT	copyright: 2023 (update)	2023-01-09 09:49:22 +00:00
LICENSE	Merge pull request #18073 from vpisarev:apache2_license	2020-08-17 11:49:11 +00:00
README.md	Merge branch 4.x	2024-01-23 17:06:52 +03:00
SECURITY.md	Updated PGP key for security reports	2023-04-19 19:16:55 +03:00

README.md

OpenCV: Open Source Computer Vision Library

Resources

Homepage: https://opencv.org
- Courses: https://opencv.org/courses
Docs: https://docs.opencv.org/5.x/
Q&A forum: https://forum.opencv.org
- previous forum (read only): http://answers.opencv.org
Issue tracking: https://github.com/opencv/opencv/issues
Additional OpenCV functionality: https://github.com/opencv/opencv_contrib
Donate to OpenCV: https://opencv.org/support/

Contributing

Please read the contribution guidelines before starting work on a pull request.

Summary of the guidelines:

One pull request per issue;
Choose the right base branch;
Include tests and documentation;
Clean up "oops" commits before submitting;
Follow the coding style guide.

Additional Resources

Submit your OpenCV-based project for inclusion in Community Friday on opencv.org
Subscribe to the OpenCV YouTube Channel featuring OpenCV Live, an hour-long streaming show
Follow OpenCV on LinkedIn for daily posts showing the state-of-the-art in computer vision &AI
Apply to be an OpenCV Volunteer to help organize events and online campaigns as well as amplify them
Follow OpenCV on Mastodon in the Fediverse
Follow OpenCV on Twitter
OpenCV.ai: Computer Vision and AI development services from the OpenCV team.