opencv

mirror of https://github.com/opencv/opencv.git synced 2024-12-13 07:59:27 +08:00

Author	SHA1	Message	Date
Yuantao Feng	d4fd5157fa	Merge pull request #24980 from fengyuentau:on-fly-quantization-removal dnn cleanup: On-fly-quantization removal #2498 On-fly-quantization is first introduced via https://github.com/opencv/opencv/pull/20228. We decided to remove it but keep int8 layers implementation because on-fly-quantization is less practical given the fact that there has been so many dedicated tools for model quantization. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-02-16 18:21:45 +03:00
Alexander Smorkalov	fa3f1822ae	Merge pull request #24993 from asmorkalov:as/FastNeuralStyle_eccv16_CUDA Relax test requirements for CUDA in DNNTestNetwork.FastNeuraStyle_eccv16	2024-02-12 16:37:57 +03:00
Alexander Smorkalov	5ce0acce40	Relax test requirements for CUDA in DNNTestNetwork.FastNeuraStyle_eccv16	2024-02-12 14:47:37 +03:00
Alexander Smorkalov	3a55f50133	Merge branch 4.x	2024-02-12 14:20:35 +03:00
Alexander Smorkalov	4b35b2f968	Merge pull request #24973 from asmorkalov:as/fix_weigths_proto_mess Fix proto and weights mess in dnn performance tests	2024-02-07 11:10:32 +03:00
Alexander Smorkalov	77af137285	Fix proto and weights mess in dnn performance tests.	2024-02-07 09:16:09 +03:00
fengyuentau	fcaa8ce3c2	fix incorrect steps and elemsize when dtype changes	2024-02-06 16:27:25 +08:00
Haosonn	87f749277d	Merge pull request #24768 from Haosonn:pre-pr-2 Vulkan backend for NaryEltwiseLayer in DNN module #24768 We improve Vulkan backend for ``NaryEltwiseLayer`` in DNN module by: - add a basic framework for Vulkan backend in ``NaryEltwiseLayer`` - add a compute shader for binary forwarding (an imitation of what has been done in native OpenCV backend including broadcasting and eltwise-operation) - typo fixed: - Wrong info output in ``context.cpp`` Currently, our implementation (or all layers supporting Vulkan backend) runs pretty slow on discrete GPUs basically due to IO cost in function ``copyToHost``, and we are going to fix that by - find out the best ``VkMemoryProperty`` for various discrete GPUs - prevent ``copyToHost`` in middle layers during forwarding, (i.e keep data in GPU memory) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake Co-authored-by: IskXCr <IskXCr@outlook.com>	2024-01-29 18:41:49 +03:00
Alexander Alekhin	efc9837df1	Merge pull request #24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage DNN: avoid CV_16S usage for FP16 #24892 Merge after: #24918 TODO: - [x] measure performance changes - [x] optimize convertTo for OpenCL: #24918 12700K iGPU: \|Name of Test\|0\|1\|1 vs 0 (x-factor)\| \|---\|:-:\|:-:\|:-:\| \|AlexNet::DNNTestNetwork::OCV/OCL_FP16\|7.441\|7.480\|0.99\| \|CRNN::DNNTestNetwork::OCV/OCL_FP16\|10.776\|10.736\|1.00\| \|DenseNet_121::DNNTestNetwork::OCV/OCL_FP16\|52.762\|52.833\|1.00\| \|EAST_text_detection::DNNTestNetwork::OCV/OCL_FP16\|60.694\|60.721\|1.00\| \|EfficientNet::DNNTestNetwork::OCV/OCL_FP16\|33.373\|33.173\|1.01\| \|FastNeuralStyle_eccv16::DNNTestNetwork::OCV/OCL_FP16\|81.840\|81.724\|1.00\| \|GoogLeNet::DNNTestNetwork::OCV/OCL_FP16\|20.965\|20.927\|1.00\| \|Inception_5h::DNNTestNetwork::OCV/OCL_FP16\|22.204\|22.173\|1.00\| \|Inception_v2_SSD_TensorFlow::DNNTestNetwork::OCV/OCL_FP16\|47.115\|47.460\|0.99\| \|MPHand::DNNTestNetwork::OCV/OCL_FP16\|6.760\|6.670\|1.01\| \|MPPalm::DNNTestNetwork::OCV/OCL_FP16\|10.188\|10.171\|1.00\| \|MPPose::DNNTestNetwork::OCV/OCL_FP16\|12.510\|12.561\|1.00\| \|MobileNet_SSD_Caffe::DNNTestNetwork::OCV/OCL_FP16\|17.290\|17.072\|1.01\| \|MobileNet_SSD_v1_TensorFlow::DNNTestNetwork::OCV/OCL_FP16\|19.473\|19.306\|1.01\| \|MobileNet_SSD_v2_TensorFlow::DNNTestNetwork::OCV/OCL_FP16\|22.874\|23.404\|0.98\| \|OpenFace::DNNTestNetwork::OCV/OCL_FP16\|9.568\|9.517\|1.01\| \|OpenPose_pose_mpi_faster_4_stages::DNNTestNetwork::OCV/OCL_FP16\|539.899\|539.845\|1.00\| \|PPHumanSeg::DNNTestNetwork::OCV/OCL_FP16\|18.015\|18.769\|0.96\| \|PPOCRv3::DNNTestNetwork::OCV/OCL_FP16\|63.122\|63.540\|0.99\| \|ResNet_50::DNNTestNetwork::OCV/OCL_FP16\|34.947\|34.925\|1.00\| \|SFace::DNNTestNetwork::OCV/OCL_FP16\|10.249\|10.206\|1.00\| \|SSD::DNNTestNetwork::OCV/OCL_FP16\|213.068\|213.108\|1.00\| \|SqueezeNet_v1_1::DNNTestNetwork::OCV/OCL_FP16\|4.867\|4.878\|1.00\| \|VIT_B_32::DNNTestNetwork::OCV/OCL_FP16\|200.563\|190.788\|1.05\| \|VitTrack::DNNTestNetwork::OCV/OCL_FP16\|7.528\|7.173\|1.05\| \|YOLOX::DNNTestNetwork::OCV/OCL_FP16\|132.858\|132.701\|1.00\| \|YOLOv3::DNNTestNetwork::OCV/OCL_FP16\|209.559\|208.809\|1.00\| \|YOLOv4::DNNTestNetwork::OCV/OCL_FP16\|221.357\|220.924\|1.00\| \|YOLOv4_tiny::DNNTestNetwork::OCV/OCL_FP16\|24.446\|24.382\|1.00\| \|YOLOv5::DNNTestNetwork::OCV/OCL_FP16\|43.922\|44.080\|1.00\| \|YOLOv8::DNNTestNetwork::OCV/OCL_FP16\|64.159\|63.842\|1.00\| \|YuNet::DNNTestNetwork::OCV/OCL_FP16\|10.177\|10.231\|0.99\| \|opencv_face_detector::DNNTestNetwork::OCV/OCL_FP16\|15.121\|15.445\|0.98\| Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2024-01-26 16:34:17 +03:00
Yuantao Feng	37156a4719	Merge pull request #24925 from fengyuentau:loongarch_handle_warnings Handle warnings in loongson-related code #24925 See https://github.com/fengyuentau/opencv/actions/runs/7665377694/job/20891162958#step:14:16 Warnings needs to be handled before we add the loongson server to our CI. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-01-26 13:38:00 +03:00
Alexander Smorkalov	decf6538a2	Merge branch 4.x	2024-01-23 17:06:52 +03:00
Alexander Smorkalov	d6424233f0	Merge pull request #24906 from Abdurrahheem:ash/fix_einsum_inner Einsum Layer Inner Product Issue Solution	2024-01-23 09:26:22 +03:00
Abduragim	0e6b7f1656	fix 1D handling issue in inner product	2024-01-22 20:10:34 +04:00
Alexander Smorkalov	775210e701	Relax test requirements for OpenCL in test DNNTestNetwork.FastNeuralStyle_eccv16.	2024-01-22 17:11:41 +03:00
Alexander Smorkalov	c739117a7c	Merge branch 4.x	2024-01-19 17:32:22 +03:00
Sean McBride	e64857c561	Merge pull request #23736 from seanm:c++11-simplifications Removed all pre-C++11 code, workarounds, and branches #23736 This removes a bunch of pre-C++11 workrarounds that are no longer necessary as C++11 is now required. It is a nice clean up and simplification. * No longer unconditionally #include <array> in cvdef.h, include explicitly where needed * Removed deprecated CV_NODISCARD, already unused in the codebase * Removed some pre-C++11 workarounds, and simplified some backwards compat defines * Removed CV_CXX_STD_ARRAY * Removed CV_CXX_MOVE_SEMANTICS and CV_CXX_MOVE * Removed all tests of CV_CXX11, now assume it's always true. This allowed removing a lot of dead code. * Updated some documentation consequently. * Removed all tests of CV_CXX11, now assume it's always true * Fixed links. --------- Co-authored-by: Maksim Shabunin <maksim.shabunin@gmail.com> Co-authored-by: Alexander Smorkalov <alexander.smorkalov@xperience.ai>	2024-01-19 16:53:08 +03:00
fengyuentau	d269de0a03	initial commit	2024-01-18 11:17:50 +08:00
Alexander Smorkalov	d1e4bd8543	Merge pull request #24809 from Abdurrahheem:ash/yolo-nas-test Added test for YOLO NAS	2024-01-17 20:36:15 +03:00
Alexander Smorkalov	ac4c0bffac	Merge pull request #24813 from fengyuentau:speedup_scatter dnn: improve scatter and scatterND speed with multi-threading	2024-01-17 17:16:50 +03:00
Abduragim	d30bf1bc3c	added test for yolo nas	2024-01-17 13:01:43 +03:00
Alexander Smorkalov	84bb1cda4e	Merge pull request #24865 from asmorkalov:as/dnn_concat_assert Normalize axis parameter in DNN Concat to handle negative values	2024-01-16 14:39:28 +03:00
Alexander Smorkalov	26cf82a56c	Normalize axis parameter in DNN Concat to handle negative values.	2024-01-16 12:22:22 +03:00
Alexander Smorkalov	99c86bb40c	Merge pull request #24556 from plctlab:rvp Optimization based on RISC-V P Packed SIMD Extension v0.5.2	2024-01-16 11:36:31 +03:00
Alexander Smorkalov	68dc02e302	Merge pull request #24858 from Dhanwanth1803:avx-fix Use AVX2 overload instread on AVX in AVX2 scope	2024-01-16 09:14:31 +03:00
Dhanwanth1803	a289eba357	Fixes #24677	2024-01-13 09:56:56 +05:30
Stefan Dragnev	2791bb7062	Merge pull request #24773 from tailsu:sd/pathlike python: accept path-like objects wherever file names are expected #24773 Merry Christmas, all 🎄 Implements #15731 Support is enabled for all arguments named `filename` or `filepath` (case-insensitive), or annotated with `CV_WRAP_FILE_PATH`. Support is based on `PyOS_FSPath`, which is available in Python 3.6+. When running on older Python versions the arguments must have a `str` value as before. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-01-12 16:23:05 +03:00
jimmylaw21	a7fa1e6f4b	Merge pull request #24610 from jimmylaw21:dnn-onnx-add-group-norm-layer dnn onnx: add group norm layer #24610 dnn onnx: add group norm layer Todo: - [x] speed up by multi-threading - [x] add perf - [x] add backend: OpenVINO - [x] add backend: CUDA - [x] add backend: OpenCL (no fp16) - [ ] add backend: CANN ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>	2024-01-12 15:13:26 +03:00
Alexander Smorkalov	97c418ab86	Merge pull request #24840 from fengyuentau:ocl_innerproduct dnn (opencl): integrate bias handling in the inner product opencl kernel	2024-01-12 15:10:16 +03:00
Abduragim Shtanchaev	c923c59833	Merge pull request #24812 from Abdurrahheem:ash/einsum_bachedGemm Replace interactive batched Matrix Multiply. #24812 This PR replaces iterative batch matrix multiplication which `FastGemmBatch` in Einsum layer. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-01-12 14:23:43 +03:00
Yuantao Feng	e7ccff9805	Merge pull request #24834 from fengyuentau:cuda_naryeltwise_broadcast dnn (cuda): support broadcasting if a.rank() != b.rank() #24834 Inspired by https://github.com/opencv/opencv/pull/24786. This PR keeps the fusion of `NaryEltwise` and `Concat` while addressed the data missing problem via supporting broadcasting if a.rank() != b.rank(). Resolves https://github.com/opencv/opencv/issues/23977 Resolves https://github.com/opencv/opencv/issues/24606 Resolves https://github.com/opencv/opencv/issues/24635 Resolves https://github.com/opencv/opencv/issues/24721 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-01-11 10:04:46 +03:00
fengyuentau	83acb656f1	integrate bias handling in ocl kernel	2024-01-11 11:15:17 +08:00
Yuantao Feng	7fb336322d	Merge pull request #24808 from fengyuentau:fix_layernorm dnn: no layer norm fusion if axes.back() is not the axis of last dimension #24808 Merge with https://github.com/opencv/opencv_extra/pull/1137 Resolves https://github.com/opencv/opencv/issues/24797 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-01-10 13:01:00 +03:00
Yuantao Feng	c955564cb3	Merge pull request #24765 from fengyuentau:mod_operator dnn onnx: add mod #24765 Resolves https://github.com/opencv/opencv/issues/23174 TODO: - [x] enable some conformance tests - [x] add backends - [x] CANN - [x] OpenVINO - [x] CUDA ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-01-09 19:00:17 +03:00
Abduragim	6c28d7140a	1d support for einsum	2024-01-08 21:34:47 +03:00
fengyuentau	13127365e2	better comment	2024-01-08 11:55:06 +08:00
Yuantao Feng	b7d70613e4	fix failed assertion in debug build	2024-01-05 18:33:01 +00:00
fengyuentau	2ed97b9ef3	multi-threaded scatterND and refactor perf	2024-01-05 18:15:59 +08:00
fengyuentau	2997b4c5fe	pretty format	2024-01-05 18:15:27 +08:00
fengyuentau	63cde0b90d	multi-threaded scatter and refactor perf	2024-01-05 17:24:09 +08:00
Abduragim Shtanchaev	3b26e183cb	changed weights of yolov7	2023-12-28 23:03:47 +03:00
cudawarped	7d681cf80d	build: first class cuda support	2023-12-26 09:39:18 +03:00
Alexander Smorkalov	62f1a7410d	Merge pull request #24766 from asmorkalov:update_version_4.9.0-pre pre: OpenCV 4.9.0 (version++)	2023-12-25 16:04:53 +03:00
Alexander Smorkalov	b407c58b96	pre: OpenCV 4.9.0 (version++).	2023-12-25 15:20:10 +03:00
Yuantao Feng	f978c99523	Merge pull request #24753 from fengyuentau:einsum_importer dnn onnx: support constaint inputs in einsum importer #24753 Merge with https://github.com/opencv/opencv_extra/pull/1132. Resolves https://github.com/opencv/opencv/issues/24697 Credits to @LaurentBerger. --- This is a workaround. I suggest to get input shapes and calculate the output shapes in `getMemoryShapes` so as to keep the best compatibility. It is not always robust getting shapes during the importer stage and we should avoid that as much as possible. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-12-25 14:42:05 +03:00
Alexander Alekhin	f49b26182b	dnn(test): skip very long debug tests, reduce test time	2023-12-25 08:44:06 +00:00
Alexander Alekhin	96b894e0e1	Merge pull request #24761 from opencv-pushbot:gitee/alalek/test_skip_update_win32	2023-12-25 08:27:30 +00:00
Alexander Alekhin	f8502d45f9	dnn(test): skip tests on 32-bit Windows	2023-12-25 07:23:45 +00:00
Alexander Smorkalov	953dddd26b	Merge pull request #24747 from asmorkalov:as/tune_vitb_cuda Increate Vit_b test threshold a bit for CUDA FP16.	2023-12-22 17:04:46 +03:00
Dmitry Kurtaev	938bc4d503	[CUDA] Hotfix Scale with 1 parameter	2023-12-22 15:49:27 +03:00
Dhanwanth1803	027aee8ad4	Merge pull request #24384 from Dhanwanth1803:feat-crop Fixes #22747. Support [crop] configuration for DarkNet #24384 Request for comments. This is my first PR. Merge with extra: https://github.com/opencv/opencv_extra/pull/1112 resolves https://github.com/opencv/opencv/issues/22747 - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2023-12-22 14:55:01 +03:00

1 2 3 4 5 ...

2276 Commits