opencv/modules/dnn/src
Aliaksei Urbanski 35ca2f78d6
Merge pull request #25880 from Jamim:fix/cuda-no-fp16
Fix CUDA for old GPUs without FP16 support #25880

Fixes #21461

~This is a build-time solution that reflects https://github.com/opencv/opencv/blob/4.10.0/modules/dnn/src/cuda4dnn/init.hpp#L68-L82.~
~We shouldn't add an invalid target while building with `CUDA_ARCH_BIN` < 53.~
_(please see [this discussion](https://github.com/opencv/opencv/pull/25880#discussion_r1668074505))_

This is a run-time solution that basically reverts [these lines](d0fe6ad109 (diff-757c5ab6ddf2f99cdd09f851e3cf17abff203aff4107d908c7ad3d0466f39604L245-R245)).

I've debugged these changes, [coupled with other fixes](https://github.com/gentoo/gentoo/pull/37479), on [Gentoo Linux](https://www.gentoo.org/) and [related tests passed](https://github.com/user-attachments/files/16135391/opencv-4.10.0.20240708-224733.log.gz) on my laptop with `GeForce GTX 960M`.

Alternative solution:
  - #21462

_Best regards!_

### Pull Request Readiness Checklist

- [x] I agree to contribute to the project under Apache 2 License.
- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [x] The PR is proposed to the proper branch
- [x] There is a reference to the original bug report and related work
- [ ] `n/a` There is accuracy test, performance test and test data in opencv_extra repository, if applicable
- [ ] `n/a` The feature is well documented and sample code can be built with the project CMake
2024-07-10 12:39:30 +03:00
..
caffe Merge pull request #24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage 2024-01-26 16:34:17 +03:00
cuda Merge pull request #25630 from fengyuentau:nary-multi-thread 2024-07-03 10:09:05 +03:00
cuda4dnn Merge pull request #25880 from Jamim:fix/cuda-no-fp16 2024-07-10 12:39:30 +03:00
darknet Merge pull request #24384 from Dhanwanth1803:feat-crop 2023-12-22 14:55:01 +03:00
int8layers Merge pull request #25230 from hanliutong/rvv-conv 2024-03-31 16:47:06 +03:00
layers Merge pull request #25147 from fengyuentau:dnn/elementwise_layers/speedup 2024-07-08 14:24:36 +03:00
ocl4dnn Merge pull request #24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage 2024-01-26 16:34:17 +03:00
onnx Merge pull request #25630 from fengyuentau:nary-multi-thread 2024-07-03 10:09:05 +03:00
opencl Merge pull request #25147 from fengyuentau:dnn/elementwise_layers/speedup 2024-07-08 14:24:36 +03:00
tensorflow Merge pull request #25686 from Kumataro:fix25674 2024-06-02 14:14:04 +03:00
tflite Merge pull request #25613 from CNOCycle:tflite/ops 2024-05-31 19:31:21 +03:00
torch Merge pull request #25686 from Kumataro:fix25674 2024-06-02 14:14:04 +03:00
vkcom Partially back-port #25075 to 4.x 2024-03-05 12:15:39 +03:00
webnn Merge pull request #20406 from MarkGHX:gsoc_2021_webnn 2021-11-23 21:15:31 +00:00
backend.cpp dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
backend.hpp dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
debug_utils.cpp fix model diagnostic tool 2022-01-18 01:22:22 +03:00
dnn_common.hpp speed up vulkan dnn, and support ios and apple m1 chip. (#23349) 2023-05-18 20:02:27 +03:00
dnn_params.cpp cmake: revise OPENCV_DNN_BACKEND_DEFAULT integration 2023-09-10 13:11:36 +00:00
dnn_read.cpp Migrate Android Face Detection sample to DNN. 2023-11-29 11:02:44 +03:00
dnn_utils.cpp Partially back-port #25075 to 4.x 2024-03-05 12:15:39 +03:00
dnn.cpp dnn: fix index access 2022-03-19 06:54:07 +00:00
factory.hpp dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
graph_simplifier.cpp Merge pull request #24577 from dkurt:dnn_graph_match_stack 2023-11-24 10:40:32 +03:00
graph_simplifier.hpp Merge pull request #24483 from dkurt:dnn_fusion_commutative_ops 2023-11-08 16:26:33 +03:00
halide_scheduler.cpp Merge pull request #22656 from dkurt:halide_fixes 2022-10-21 17:49:49 +03:00
halide_scheduler.hpp
ie_ngraph.cpp Fix for OpenVINO 2024.0 2024-03-18 15:05:50 +04:00
ie_ngraph.hpp Fix for OpenVINO 2024.0 2024-03-18 15:05:50 +04:00
init.cpp Merge pull request #25779 from fengyuentau:dnn/fix_onnx_depthtospace 2024-06-21 19:28:22 +03:00
layer_factory.cpp dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
layer_internals.hpp Merge pull request #24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage 2024-01-26 16:34:17 +03:00
layer.cpp Merge pull request #24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage 2024-01-26 16:34:17 +03:00
legacy_backend.cpp Merge pull request #23109 from seanm:misc-warnings 2023-10-06 13:33:21 +03:00
legacy_backend.hpp dnn: split dnn.cpp code 2022-03-08 19:22:46 +00:00
math_utils.hpp Implement ctc prefix beam search decode for TextRecognitionModel. 2021-08-12 20:33:31 +08:00
model.cpp change fcn8s-heavy-pascal tests from caffe to onnx 2024-05-03 00:15:09 +08:00
net_cann.cpp Merge pull request #23936 from SaltFish-T:4.x 2023-07-27 14:21:30 +03:00
net_impl_backend.cpp Merge pull request #25880 from Jamim:fix/cuda-no-fp16 2024-07-10 12:39:30 +03:00
net_impl_fuse.cpp Merge pull request #24834 from fengyuentau:cuda_naryeltwise_broadcast 2024-01-11 10:04:46 +03:00
net_impl.cpp Merge pull request #25582 from fengyuentau:dnn/dump_pbtxt 2024-05-17 11:07:05 +03:00
net_impl.hpp Merge pull request #25582 from fengyuentau:dnn/dump_pbtxt 2024-05-17 11:07:05 +03:00
net_openvino.cpp Fix for OpenVINO 2024.0 2024-03-18 15:05:50 +04:00
net_quantization.cpp add enableWinograd API for Net. 2022-10-09 09:33:07 +08:00
net.cpp Merge pull request #25582 from fengyuentau:dnn/dump_pbtxt 2024-05-17 11:07:05 +03:00
nms.cpp batched nms impl 2022-11-29 15:32:34 +08:00
nms.inl.hpp boost NMS performance 2021-03-10 15:59:26 +00:00
op_cann.cpp Merge pull request #23319 from fengyuentau:fix_zoo_issue_136 2023-03-13 21:46:33 +03:00
op_cann.hpp Merge pull request #23936 from SaltFish-T:4.x 2023-07-27 14:21:30 +03:00
op_cuda.cpp Let part of the operators in nary_eltwise support cuda 2022-11-02 14:08:21 +08:00
op_cuda.hpp transfer output blobs in background 2020-07-04 12:55:12 +05:30
op_halide.cpp Merge pull request #24167 from autoantwort:missing-include 2023-08-17 09:34:19 +00:00
op_halide.hpp
op_inf_engine.cpp Fix for OpenVINO 2024.0 2024-03-18 15:05:50 +04:00
op_inf_engine.hpp Fix for OpenVINO 2024.0 2024-03-18 15:05:50 +04:00
op_timvx.cpp Merge pull request #21036 from fengyuentau:timvx_backend_support 2022-03-31 21:42:11 +00:00
op_timvx.hpp Merge pull request #21036 from fengyuentau:timvx_backend_support 2022-03-31 21:42:11 +00:00
op_vkcom.cpp speed up vulkan dnn, and support ios and apple m1 chip. (#23349) 2023-05-18 20:02:27 +03:00
op_vkcom.hpp speed up vulkan dnn, and support ios and apple m1 chip. (#23349) 2023-05-18 20:02:27 +03:00
op_webnn.cpp dnn: split dnn.cpp code 2022-03-08 19:22:46 +00:00
op_webnn.hpp Fix for OpenVINO 2024.0 2024-03-18 15:05:50 +04:00
plugin_api.hpp dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
plugin_wrapper.impl.hpp dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
precomp.hpp speed up vulkan dnn, and support ios and apple m1 chip. (#23349) 2023-05-18 20:02:27 +03:00
registry.cpp Merge pull request #25880 from Jamim:fix/cuda-no-fp16 2024-07-10 12:39:30 +03:00