opencv

mirror of https://github.com/opencv/opencv.git synced 2024-12-17 02:48:01 +08:00

Author	SHA1	Message	Date
Yuantao Feng	55d7e3f8cc	Merge pull request #1165 from fengyuentau:gold_yolo [BugFix] dnn (ONNX): Foce dropping constant inputs in parseClip if they are shared #25319 Resolves https://github.com/opencv/opencv/issues/25278 Merge with https://github.com/opencv/opencv_extra/pull/1165 In Gold-YOLO ,`Div` has a constant input `B=6` which is then parsed into a `Const` layer in the ONNX importer, but `Clip` also has the shared constant input `max=6` which is already a `Const` layer and then connected to `Elementwise` layer. This should not happen because in the `forward()` of `Elementwise` layer, the legacy code goes through and apply activation to each input. More details on https://github.com/opencv/opencv/issues/25278#issuecomment-2032199630. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-04-03 15:56:59 +03:00
Dmitry Kurtaev	13c95efa74	Merge pull request #25312 from dkurt:dnn_hotfix_tflite Ownership check in TFLite importer #25312 ### Pull Request Readiness Checklist resolves https://github.com/opencv/opencv/issues/25310 See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-04-03 09:41:40 +03:00
HAN Liutong	eba158fb0c	Merge pull request #25230 from hanliutong/rvv-conv Optimize int8 layers in DNN modules by using RISC-V Vector intrinsic. #25230 This patch optimize 3 functions in the int8 layer by using RVV Native Intrinsic. This patch was tested on QEMU using VLEN=128 and VLEN=256 on `./bin/opencv_test_dnn --gtest_filter="Int8"`; On the real device (k230, VLEN=128), `EfficientDet_int8` in `opencv_perf_dnn` showed a performance improvement of 1.46x. \| Name of Test \| Original \| optimized \| Speed-up \| \| ------------------------------------------ \| -------- \| ---------- \| -------- \| \| EfficientDet_int8::DNNTestNetwork::OCV/CPU \| 2843.467 \| 1947.013 \| 1.46 \| ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [ ] I agree to contribute to the project under Apache 2 License. - [ ] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-03-31 16:47:06 +03:00
Yuantao Feng	b758897c29	Merge pull request #25271 from fengyuentau:matmul_bias Merge with https://github.com/opencv/opencv_extra/pull/1158 Todo: - [x] Fix Attention pattern recognition. - [x] Handle other backends. Benchmark: "VIT_B_32 OCV/CPU", M1, results in milliseconds. \| Model \| 4.x \| This PR \| \| - \| - \| - \| \| VIT_B_32 OCV/CPU \| 87.66 \| 83.83 \| ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-03-29 17:35:23 +03:00
Alexander Smorkalov	9fc4b61074	Merge pull request #25291 from dkurt:einsum_openvino Einsum OpenVINO backend	2024-03-29 15:54:26 +03:00
Dmitry Kurtaev	cfa42e4338	Einsum OpenVINO backend	2024-03-29 14:29:45 +03:00
Dmitry Kurtaev	01dc010436	Merge pull request #25273 from dkurt:tflite_new_layers TFLite new layers #25273 ### Pull Request Readiness Checklist resolves https://github.com/opencv/opencv/issues/25272, https://github.com/opencv/opencv/issues/24965 Merge with extra: https://github.com/opencv/opencv_extra/pull/1160 See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-03-29 11:21:13 +03:00
Yuantao Feng	accf200408	Merge pull request #25238 from fengyuentau:optimized_const dnn: avoid const layer forwarding in layer norm layer and attention layer #25238 While profiling ViTs with dnn, I found `ConstLayer` can take a proportion of the inference time, which is weird. This comes from the data copy during the inference of `ConstLayer`. There is a chance that we can improve the efficiency of data copying but the easiest and most convenient way is to avoid `ConstLayer`. This PR change the way how we handle constants in layer normalization layer and attention layer, which is storing in the layer blobs instead of making constant layers for them. Checklists: - [x] Backend compatibility in layer normalization layer. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-03-26 15:09:51 +03:00
Alexander Smorkalov	fc34554475	Merge pull request #25184 from dkurt:avoid_extra_memset Avoid extra memset	2024-03-25 13:07:49 +03:00
Yuantao Feng	025e7602b9	Merge pull request #25166 from fengyuentau:fix_cann_gemm dnn (CANN): Fix incorrect shape of 1d bias in Gemm #25166 Gemm layer was refactored some time ago. Users found that the mobilenet example in https://github.com/opencv/opencv/wiki/Huawei-CANN-Backend does not work because of incorrect shape set for 1d bias in Gemm. This PR resolves this issue. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-03-25 09:47:28 +03:00
Dmitry Kurtaev	0b6c9a2123	Merge pull request #25181 from dkurt:release_conv_weights Release convolution weightsMat after usage #25181 ### Pull Request Readiness Checklist related (but not resolved): https://github.com/opencv/opencv/issues/24134 Minor memory footprint improvement. Also, adds a test for VmHWM. RAM top memory usage (-230MB) \| YOLOv3 (237MB file) \| 4.x \| PR \| \|---------------------\|---------\|---------\| \| no winograd \| 808 MB \| 581 MB \| \| winograd \| 1985 MB \| 1750 MB \| See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-03-25 09:03:28 +03:00
Oleg Pipikin	6da2ddcf0e	Fix for OpenVINO 2024.0 Remove support OpenVINO lower than 2022.1 release Remove legacy InferenceEngine wrappers	2024-03-18 15:05:50 +04:00
Dmitry Kurtaev	6a370ba9e7	Avoid extra memset in convolution initialization	2024-03-08 10:46:07 +03:00
Dmitry Kurtaev	98aed21dd4	Avoid copy of ONNX graph during import	2024-03-05 18:22:46 +03:00
Alexander Smorkalov	daa8f7dfc6	Partially back-port #25075 to 4.x	2024-03-05 12:15:39 +03:00
Laurent Berger	5fe3933346	Merge pull request #25120 from LaurentBerger:I25103 Fixed ReduceMean layer behaviour #25120 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake `a93c31e3c9/onnxruntime/core/providers/cpu/reduction/reduction_ops.cc (L433-L443)`	2024-03-04 09:36:53 +03:00
CSBVision	e8582f2cf8	Update net_impl.cpp See issue #25112	2024-03-01 14:56:00 +01:00
Yuantao Feng	5aa5c39210	Merge pull request #25076 from fengyuentau:improve_attention dnn: try improving performance of Attention layer #25076 Checklist: - [x] Use `Mat` over `Mat::zeros` for temporary buffer in forward - [x] Use layer internal buffer over temporary Mat buffer - [x] Try a single fastGemmBatch on the Q/K/V calculation Performance: Performance test case is `Layer_Attention.VisionTransformer/0`, which has input of shape {1, 197, 768}, weight of shape {768, 2304} and bias {2304}. Data is in millisecond. \| \| macOS 14.2.1, Apple M1 \| Ubuntu 22.04.2, Intel i7 12700K \| \| - \| - \| - \| \| Current \| 10.96 \| 1.58 \| \| w/ Mat \| 6.27 \| 1.41 \| \| w/ Internals \| 5.87 \| 1.38 \| \| w/ fastGemmBatch \| 6.12 \| 2.14 \| ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-02-28 16:47:08 +03:00
Laurent Berger	3c712cf77d	Merge pull request #25100 from LaurentBerger:I25077 Fix issue #25077 #25100 Fixes https://github.com/opencv/opencv/issues/25077 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-02-27 14:15:11 +03:00
Dhanwanth1803	12aa0fe898	Merge pull request #24985 from Dhanwanth1803:hardswish Fixes #24974 support HardSwishInt8 #24985 As given very clearly in the issue #24974 I made the required 2 changes to implement HardSwish Layer in INT8. Requesting comments. resolves https://github.com/opencv/opencv/issues/24974 - [X] I agree to contribute to the project under Apache 2 License. - [X] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [X] The PR is proposed to the proper branch - [X] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake Co-authored-by: Dhanwanth1803 <dhanwanthvarala@gmail,com>	2024-02-16 18:19:29 +03:00
Alexander Smorkalov	4b35b2f968	Merge pull request #24973 from asmorkalov:as/fix_weigths_proto_mess Fix proto and weights mess in dnn performance tests	2024-02-07 11:10:32 +03:00
Alexander Smorkalov	77af137285	Fix proto and weights mess in dnn performance tests.	2024-02-07 09:16:09 +03:00
fengyuentau	fcaa8ce3c2	fix incorrect steps and elemsize when dtype changes	2024-02-06 16:27:25 +08:00
Haosonn	87f749277d	Merge pull request #24768 from Haosonn:pre-pr-2 Vulkan backend for NaryEltwiseLayer in DNN module #24768 We improve Vulkan backend for ``NaryEltwiseLayer`` in DNN module by: - add a basic framework for Vulkan backend in ``NaryEltwiseLayer`` - add a compute shader for binary forwarding (an imitation of what has been done in native OpenCV backend including broadcasting and eltwise-operation) - typo fixed: - Wrong info output in ``context.cpp`` Currently, our implementation (or all layers supporting Vulkan backend) runs pretty slow on discrete GPUs basically due to IO cost in function ``copyToHost``, and we are going to fix that by - find out the best ``VkMemoryProperty`` for various discrete GPUs - prevent ``copyToHost`` in middle layers during forwarding, (i.e keep data in GPU memory) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake Co-authored-by: IskXCr <IskXCr@outlook.com>	2024-01-29 18:41:49 +03:00
Alexander Alekhin	efc9837df1	Merge pull request #24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage DNN: avoid CV_16S usage for FP16 #24892 Merge after: #24918 TODO: - [x] measure performance changes - [x] optimize convertTo for OpenCL: #24918 12700K iGPU: \|Name of Test\|0\|1\|1 vs 0 (x-factor)\| \|---\|:-:\|:-:\|:-:\| \|AlexNet::DNNTestNetwork::OCV/OCL_FP16\|7.441\|7.480\|0.99\| \|CRNN::DNNTestNetwork::OCV/OCL_FP16\|10.776\|10.736\|1.00\| \|DenseNet_121::DNNTestNetwork::OCV/OCL_FP16\|52.762\|52.833\|1.00\| \|EAST_text_detection::DNNTestNetwork::OCV/OCL_FP16\|60.694\|60.721\|1.00\| \|EfficientNet::DNNTestNetwork::OCV/OCL_FP16\|33.373\|33.173\|1.01\| \|FastNeuralStyle_eccv16::DNNTestNetwork::OCV/OCL_FP16\|81.840\|81.724\|1.00\| \|GoogLeNet::DNNTestNetwork::OCV/OCL_FP16\|20.965\|20.927\|1.00\| \|Inception_5h::DNNTestNetwork::OCV/OCL_FP16\|22.204\|22.173\|1.00\| \|Inception_v2_SSD_TensorFlow::DNNTestNetwork::OCV/OCL_FP16\|47.115\|47.460\|0.99\| \|MPHand::DNNTestNetwork::OCV/OCL_FP16\|6.760\|6.670\|1.01\| \|MPPalm::DNNTestNetwork::OCV/OCL_FP16\|10.188\|10.171\|1.00\| \|MPPose::DNNTestNetwork::OCV/OCL_FP16\|12.510\|12.561\|1.00\| \|MobileNet_SSD_Caffe::DNNTestNetwork::OCV/OCL_FP16\|17.290\|17.072\|1.01\| \|MobileNet_SSD_v1_TensorFlow::DNNTestNetwork::OCV/OCL_FP16\|19.473\|19.306\|1.01\| \|MobileNet_SSD_v2_TensorFlow::DNNTestNetwork::OCV/OCL_FP16\|22.874\|23.404\|0.98\| \|OpenFace::DNNTestNetwork::OCV/OCL_FP16\|9.568\|9.517\|1.01\| \|OpenPose_pose_mpi_faster_4_stages::DNNTestNetwork::OCV/OCL_FP16\|539.899\|539.845\|1.00\| \|PPHumanSeg::DNNTestNetwork::OCV/OCL_FP16\|18.015\|18.769\|0.96\| \|PPOCRv3::DNNTestNetwork::OCV/OCL_FP16\|63.122\|63.540\|0.99\| \|ResNet_50::DNNTestNetwork::OCV/OCL_FP16\|34.947\|34.925\|1.00\| \|SFace::DNNTestNetwork::OCV/OCL_FP16\|10.249\|10.206\|1.00\| \|SSD::DNNTestNetwork::OCV/OCL_FP16\|213.068\|213.108\|1.00\| \|SqueezeNet_v1_1::DNNTestNetwork::OCV/OCL_FP16\|4.867\|4.878\|1.00\| \|VIT_B_32::DNNTestNetwork::OCV/OCL_FP16\|200.563\|190.788\|1.05\| \|VitTrack::DNNTestNetwork::OCV/OCL_FP16\|7.528\|7.173\|1.05\| \|YOLOX::DNNTestNetwork::OCV/OCL_FP16\|132.858\|132.701\|1.00\| \|YOLOv3::DNNTestNetwork::OCV/OCL_FP16\|209.559\|208.809\|1.00\| \|YOLOv4::DNNTestNetwork::OCV/OCL_FP16\|221.357\|220.924\|1.00\| \|YOLOv4_tiny::DNNTestNetwork::OCV/OCL_FP16\|24.446\|24.382\|1.00\| \|YOLOv5::DNNTestNetwork::OCV/OCL_FP16\|43.922\|44.080\|1.00\| \|YOLOv8::DNNTestNetwork::OCV/OCL_FP16\|64.159\|63.842\|1.00\| \|YuNet::DNNTestNetwork::OCV/OCL_FP16\|10.177\|10.231\|0.99\| \|opencv_face_detector::DNNTestNetwork::OCV/OCL_FP16\|15.121\|15.445\|0.98\| Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2024-01-26 16:34:17 +03:00
Yuantao Feng	37156a4719	Merge pull request #24925 from fengyuentau:loongarch_handle_warnings Handle warnings in loongson-related code #24925 See https://github.com/fengyuentau/opencv/actions/runs/7665377694/job/20891162958#step:14:16 Warnings needs to be handled before we add the loongson server to our CI. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-01-26 13:38:00 +03:00
Sean McBride	e64857c561	Merge pull request #23736 from seanm:c++11-simplifications Removed all pre-C++11 code, workarounds, and branches #23736 This removes a bunch of pre-C++11 workrarounds that are no longer necessary as C++11 is now required. It is a nice clean up and simplification. * No longer unconditionally #include <array> in cvdef.h, include explicitly where needed * Removed deprecated CV_NODISCARD, already unused in the codebase * Removed some pre-C++11 workarounds, and simplified some backwards compat defines * Removed CV_CXX_STD_ARRAY * Removed CV_CXX_MOVE_SEMANTICS and CV_CXX_MOVE * Removed all tests of CV_CXX11, now assume it's always true. This allowed removing a lot of dead code. * Updated some documentation consequently. * Removed all tests of CV_CXX11, now assume it's always true * Fixed links. --------- Co-authored-by: Maksim Shabunin <maksim.shabunin@gmail.com> Co-authored-by: Alexander Smorkalov <alexander.smorkalov@xperience.ai>	2024-01-19 16:53:08 +03:00
fengyuentau	d269de0a03	initial commit	2024-01-18 11:17:50 +08:00
Alexander Smorkalov	d1e4bd8543	Merge pull request #24809 from Abdurrahheem:ash/yolo-nas-test Added test for YOLO NAS	2024-01-17 20:36:15 +03:00
Alexander Smorkalov	ac4c0bffac	Merge pull request #24813 from fengyuentau:speedup_scatter dnn: improve scatter and scatterND speed with multi-threading	2024-01-17 17:16:50 +03:00
Abduragim	d30bf1bc3c	added test for yolo nas	2024-01-17 13:01:43 +03:00
Alexander Smorkalov	84bb1cda4e	Merge pull request #24865 from asmorkalov:as/dnn_concat_assert Normalize axis parameter in DNN Concat to handle negative values	2024-01-16 14:39:28 +03:00
Alexander Smorkalov	26cf82a56c	Normalize axis parameter in DNN Concat to handle negative values.	2024-01-16 12:22:22 +03:00
Alexander Smorkalov	99c86bb40c	Merge pull request #24556 from plctlab:rvp Optimization based on RISC-V P Packed SIMD Extension v0.5.2	2024-01-16 11:36:31 +03:00
Alexander Smorkalov	68dc02e302	Merge pull request #24858 from Dhanwanth1803:avx-fix Use AVX2 overload instread on AVX in AVX2 scope	2024-01-16 09:14:31 +03:00
Dhanwanth1803	a289eba357	Fixes #24677	2024-01-13 09:56:56 +05:30
Stefan Dragnev	2791bb7062	Merge pull request #24773 from tailsu:sd/pathlike python: accept path-like objects wherever file names are expected #24773 Merry Christmas, all 🎄 Implements #15731 Support is enabled for all arguments named `filename` or `filepath` (case-insensitive), or annotated with `CV_WRAP_FILE_PATH`. Support is based on `PyOS_FSPath`, which is available in Python 3.6+. When running on older Python versions the arguments must have a `str` value as before. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-01-12 16:23:05 +03:00
jimmylaw21	a7fa1e6f4b	Merge pull request #24610 from jimmylaw21:dnn-onnx-add-group-norm-layer dnn onnx: add group norm layer #24610 dnn onnx: add group norm layer Todo: - [x] speed up by multi-threading - [x] add perf - [x] add backend: OpenVINO - [x] add backend: CUDA - [x] add backend: OpenCL (no fp16) - [ ] add backend: CANN ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>	2024-01-12 15:13:26 +03:00
Alexander Smorkalov	97c418ab86	Merge pull request #24840 from fengyuentau:ocl_innerproduct dnn (opencl): integrate bias handling in the inner product opencl kernel	2024-01-12 15:10:16 +03:00
Abduragim Shtanchaev	c923c59833	Merge pull request #24812 from Abdurrahheem:ash/einsum_bachedGemm Replace interactive batched Matrix Multiply. #24812 This PR replaces iterative batch matrix multiplication which `FastGemmBatch` in Einsum layer. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-01-12 14:23:43 +03:00
Yuantao Feng	e7ccff9805	Merge pull request #24834 from fengyuentau:cuda_naryeltwise_broadcast dnn (cuda): support broadcasting if a.rank() != b.rank() #24834 Inspired by https://github.com/opencv/opencv/pull/24786. This PR keeps the fusion of `NaryEltwise` and `Concat` while addressed the data missing problem via supporting broadcasting if a.rank() != b.rank(). Resolves https://github.com/opencv/opencv/issues/23977 Resolves https://github.com/opencv/opencv/issues/24606 Resolves https://github.com/opencv/opencv/issues/24635 Resolves https://github.com/opencv/opencv/issues/24721 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-01-11 10:04:46 +03:00
fengyuentau	83acb656f1	integrate bias handling in ocl kernel	2024-01-11 11:15:17 +08:00
Yuantao Feng	7fb336322d	Merge pull request #24808 from fengyuentau:fix_layernorm dnn: no layer norm fusion if axes.back() is not the axis of last dimension #24808 Merge with https://github.com/opencv/opencv_extra/pull/1137 Resolves https://github.com/opencv/opencv/issues/24797 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2024-01-10 13:01:00 +03:00
Yuantao Feng	c955564cb3	Merge pull request #24765 from fengyuentau:mod_operator dnn onnx: add mod #24765 Resolves https://github.com/opencv/opencv/issues/23174 TODO: - [x] enable some conformance tests - [x] add backends - [x] CANN - [x] OpenVINO - [x] CUDA ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-01-09 19:00:17 +03:00
fengyuentau	13127365e2	better comment	2024-01-08 11:55:06 +08:00
Yuantao Feng	b7d70613e4	fix failed assertion in debug build	2024-01-05 18:33:01 +00:00
fengyuentau	2ed97b9ef3	multi-threaded scatterND and refactor perf	2024-01-05 18:15:59 +08:00
fengyuentau	2997b4c5fe	pretty format	2024-01-05 18:15:27 +08:00
fengyuentau	63cde0b90d	multi-threaded scatter and refactor perf	2024-01-05 17:24:09 +08:00
Abduragim Shtanchaev	3b26e183cb	changed weights of yolov7	2023-12-28 23:03:47 +03:00

1 2 3 4 5 ...

2261 Commits