opencv/modules/dnn/src/cuda
jimmylaw21 a7fa1e6f4b
Merge pull request #24610 from jimmylaw21:dnn-onnx-add-group-norm-layer
dnn onnx: add group norm layer #24610

dnn onnx: add group norm layer

Todo:

- [x] speed up by multi-threading
- [x] add perf
- [x] add backend: OpenVINO
- [x] add backend: CUDA
- [x] add backend: OpenCL (no fp16)
- [ ] add backend: CANN

### Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

- [x] I agree to contribute to the project under Apache 2 License.
- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [x] The PR is proposed to the proper branch
- [ ] There is a reference to the original bug report and related work
- [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable
      Patch to opencv_extra has the same branch name.
- [x] The feature is well documented and sample code can be built with the project CMake

Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>
2024-01-12 15:13:26 +03:00
..
activation_eltwise.cu Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 2020-07-09 16:02:21 +03:00
activations.cu Merge pull request #24080 from dkurt:dnn_cuda_layers 2023-08-03 09:13:42 +03:00
array.hpp Merge pull request #14827 from YashasSamaga:cuda4dnn-csl-low 2019-10-21 14:28:00 +03:00
atomics.hpp Merge pull request #17534 from YashasSamaga:cuda4dnn-remove-unused-funcs 2020-06-17 09:07:52 +00:00
bbox_utils.hpp add DetectionOutputOp 2020-07-29 12:28:00 +05:30
bias_activation_eltwise.cu Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 2020-07-09 16:02:21 +03:00
bias_activation.cu Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 2020-07-09 16:02:21 +03:00
bias_eltwise_activation.cu Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 2020-07-09 16:02:21 +03:00
block_stride_range.hpp add DetectionOutputOp 2020-07-29 12:28:00 +05:30
concat.cu dnn: fix various dnn related typos 2022-03-23 18:12:12 -04:00
crop_and_resize.cu Merge pull request #17200 from YashasSamaga:cuda4dnn-general-opt1 2020-05-09 17:20:30 +00:00
detection_output.cu fix compile-time errors, disable unsupported tests 2020-08-09 14:43:20 +05:30
eltwise_activation.cu Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 2020-07-09 16:02:21 +03:00
eltwise_ops.cu Merge pull request #24834 from fengyuentau:cuda_naryeltwise_broadcast 2024-01-11 10:04:46 +03:00
execution.hpp Merge pull request #16658 from YashasSamaga:cuda4dnn-refactor-activations 2020-02-29 11:46:14 +03:00
fill_copy.cu add DetectionOutputOp 2020-07-29 12:28:00 +05:30
fp_conversion.cu perfor fp conversions on GPU 2019-12-30 00:05:39 +05:30
functors.hpp Merge pull request #24765 from fengyuentau:mod_operator 2024-01-09 19:00:17 +03:00
grid_nms.cu Merge pull request #20933 from shubham-shahh:master 2021-11-28 12:54:29 +00:00
grid_stride_range.hpp Merge pull request #20933 from shubham-shahh:master 2021-11-28 12:54:29 +00:00
index_helpers.hpp add DetectionOutputOp 2020-07-29 12:28:00 +05:30
kernel_dispatcher.hpp dnn: fix various dnn related typos 2022-03-23 18:12:12 -04:00
limits.hpp Merge pull request #16218 from JulienMaille:cuda-dnn-for-older-gpus 2020-01-15 18:28:37 +03:00
math.hpp Merge pull request #24765 from fengyuentau:mod_operator 2024-01-09 19:00:17 +03:00
max_unpooling.cu Merge pull request #19058 from sl-sergei:cuda_1d 2021-01-21 22:16:56 +00:00
memory.hpp Merge pull request #17200 from YashasSamaga:cuda4dnn-general-opt1 2020-05-09 17:20:30 +00:00
mvn.cu Merge pull request #24610 from jimmylaw21:dnn-onnx-add-group-norm-layer 2024-01-12 15:13:26 +03:00
normalize.cu Merge pull request #16218 from JulienMaille:cuda-dnn-for-older-gpus 2020-01-15 18:28:37 +03:00
padding.cu Merge pull request #16218 from JulienMaille:cuda-dnn-for-older-gpus 2020-01-15 18:28:37 +03:00
permute.cu dnn: fix various dnn related typos 2022-03-23 18:12:12 -04:00
prior_box.cu Merge pull request #16218 from JulienMaille:cuda-dnn-for-older-gpus 2020-01-15 18:28:37 +03:00
region.cu backport YOLOv4x-mish new_coords CUDA implementation 2021-10-08 14:14:49 +03:00
resize.cu add half pixel centers and align corners param 2020-12-27 15:05:39 +05:30
roi_pooling.cu Merge pull request #17200 from YashasSamaga:cuda4dnn-general-opt1 2020-05-09 17:20:30 +00:00
scale_shift.cu Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2 2020-07-09 16:02:21 +03:00
shortcut.cu FIX: disable dnn cuda input_shortcut on _half for CC<5.3 2020-01-17 14:21:25 +01:00
slice.cu dnn: fix various dnn related typos 2022-03-23 18:12:12 -04:00
types.hpp Merge pull request #14827 from YashasSamaga:cuda4dnn-csl-low 2019-10-21 14:28:00 +03:00
vector_traits.hpp Merge pull request #17200 from YashasSamaga:cuda4dnn-general-opt1 2020-05-09 17:20:30 +00:00