opencv

mirror of https://github.com/opencv/opencv.git synced 2025-07-23 21:16:58 +08:00

History

Yuantao Feng 8a96e34e33 dnn: add gemm_layer in place of fully_connected_layer for onnx models (#23897 ) * first commit * turned C from input to constant; force C constant in impl; better handling 0d/1d cases * integrate with gemm from ficus nn * fix const inputs * adjust threshold for int8 tryQuantize * adjust threshold for int8 quantized 2 * support batched gemm and matmul; tune threshold for rcnn_ilsvrc13; update googlenet * add gemm perf against innerproduct * add perf tests for innerproduct with bias * fix perf * add memset * renamings for next step * add dedicated perf gemm * add innerproduct in perf_gemm * remove gemm and innerproduct perf tests from perf_layer * add perf cases for vit sizes; prepack constants * remove batched gemm; fix wrong trans; optimize KC * remove prepacking for const A; several fixes for const B prepacking * add todos and gemm expression * add optimized branch for avx/avx2 * trigger build * update macros and signature * update signature * fix macro * fix bugs for neon aarch64 & x64 * add backends: cuda, cann, inf_ngraph and vkcom * fix cuda backend * test commit for cuda * test cuda backend * remove debug message from cuda backend * use cpu dispatcher * fix neon macro undef in dispatcher * fix dispatcher * fix inner kernel for neon aarch64 * fix compiling issue on armv7; try fixing accuracy issue on other platforms * broadcast C with beta multiplied; improve func namings * fix bug for avx and avx2 * put all platform-specific kernels in dispatcher * fix typos * attempt to fix compile issues on x64 * run old gemm when neon, avx, avx2 are all not available; add kernel for armv7 neon * fix typo * quick fix: add macros for pack4 * quick fix: use vmlaq_f32 for armv7 * quick fix for missing macro of fast gemm pack f32 4 * disable conformance tests when optimized branches are not supported * disable perf tests when optimized branches are not supported * decouple cv_try_neon and cv_neon_aarch64 * drop googlenet_2023; add fastGemmBatched * fix step in fastGemmBatched * cpu: fix initialization ofb; gpu: support batch * quick followup fix for cuda * add default kernels * quick followup fix to avoid macro redef * optmized kernels for lasx * resolve mis-alignment; remove comments * tune performance for x64 platform * tune performance for neon aarch64 * tune for armv7 * comment time consuming tests * quick follow-up fix		2023-09-20 00:53:34 +03:00
..
cityscapes_semsegm_test_enet.py	Misc. modules/ typos	2018-02-12 07:09:43 -05:00
imagenet_cls_test_alexnet.py	python: better Python 3 support	2018-05-11 17:32:04 +03:00
imagenet_cls_test_googlenet.py	Misc. modules/ typos	2018-02-12 07:09:43 -05:00
imagenet_cls_test_inception.py	fix 4.x links	2021-12-22 13:24:30 +00:00
npy_blob.cpp	dnn: fix precomp.hpp usage	2018-02-28 17:06:26 +03:00
npy_blob.hpp	dnn: fix precomp.hpp usage	2018-02-28 17:06:26 +03:00
pascal_semsegm_test_fcn.py	Remove references to deprecated NumPy type aliases.	2022-12-23 13:53:49 +03:00
test_backends.cpp	Merge pull request #24120 from dkurt:actualize_dnn_links	2023-08-16 15:46:11 +03:00
test_caffe_importer.cpp	Higher threshold for FasterRCNN_vgg16	2023-09-14 13:11:53 +03:00
test_common.cpp	cmake: fix build of dnn tests with shared common code	2019-03-31 08:52:25 +00:00
test_common.hpp	Merge pull request #22275 from zihaomu:fp16_support_conv	2023-05-17 09:38:33 +03:00
test_common.impl.hpp	Merge pull request #22275 from zihaomu:fp16_support_conv	2023-05-17 09:38:33 +03:00
test_darknet_importer.cpp	Merge pull request #24039 from dkurt:tflite_test_backends	2023-08-04 11:28:51 +03:00
test_googlenet.cpp	Merge pull request #22275 from zihaomu:fp16_support_conv	2023-05-17 09:38:33 +03:00
test_halide_layers.cpp	Merge pull request #24196 from dkurt:ov_backend_cleanups	2023-09-05 18:08:28 +03:00
test_ie_models.cpp	Merge pull request #24072 from dkurt:openvino_cpu_tests	2023-08-02 14:39:11 +03:00
test_int8_layers.cpp	dnn: add gemm_layer in place of fully_connected_layer for onnx models (#23897 )	2023-09-20 00:53:34 +03:00
test_layers.cpp	Merge pull request #24039 from dkurt:tflite_test_backends	2023-08-04 11:28:51 +03:00
test_main.cpp	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2019-06-26 20:19:04 +00:00
test_misc.cpp	Update dnn_utils.cpp	2023-09-06 10:01:07 +03:00
test_model.cpp	Merge pull request #24120 from dkurt:actualize_dnn_links	2023-08-16 15:46:11 +03:00
test_nms.cpp	batched nms impl	2022-11-29 15:32:34 +08:00
test_onnx_conformance_layer_filter__cuda_denylist.inl.hpp	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
test_onnx_conformance_layer_filter__halide_denylist.inl.hpp	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
test_onnx_conformance_layer_filter__openvino.inl.hpp	Merge pull request #24072 from dkurt:openvino_cpu_tests	2023-08-02 14:39:11 +03:00
test_onnx_conformance_layer_filter__vulkan_denylist.inl.hpp	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
test_onnx_conformance_layer_filter_opencv_all_denylist.inl.hpp	Merge pull request #21865 from rogday:nary_eltwise_layers	2022-07-19 06:14:05 +03:00
test_onnx_conformance_layer_filter_opencv_cpu_denylist.inl.hpp	Merge pull request #21865 from rogday:nary_eltwise_layers	2022-07-19 06:14:05 +03:00
test_onnx_conformance_layer_filter_opencv_denylist.inl.hpp	move global skip out of if loop, and add opencv_deny_list	2023-03-13 22:16:51 +08:00
test_onnx_conformance_layer_filter_opencv_ocl_fp16_denylist.inl.hpp	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
test_onnx_conformance_layer_filter_opencv_ocl_fp32_denylist.inl.hpp	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
test_onnx_conformance_layer_parser_denylist.inl.hpp	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
test_onnx_conformance.cpp	Merge pull request #22275 from zihaomu:fp16_support_conv	2023-05-17 09:38:33 +03:00
test_onnx_importer.cpp	dnn: add gemm_layer in place of fully_connected_layer for onnx models (#23897 )	2023-09-20 00:53:34 +03:00
test_precomp.hpp	dnn: reduce set of ignored warnings	2018-11-15 13:15:59 +03:00
test_tf_importer.cpp	Merge pull request #24039 from dkurt:tflite_test_backends	2023-08-04 11:28:51 +03:00
test_tflite_importer.cpp	Merge pull request #24196 from dkurt:ov_backend_cleanups	2023-09-05 18:08:28 +03:00
test_torch_importer.cpp	Increase eps for Test_Torch_nets.FastNeuralStyle_accuracy to prevent sporadic test failres with CUDA.	2023-07-21 13:51:03 +03:00