opencv/modules/dnn
HAN Liutong aaca4987c9
Merge pull request #20287 from hanliutong:dev-rvv-0.10
Optimization of DNN using native RISC-V vector intrinsics.

* Use RVV to optimize fastGEMM (FP32) in DNN.

* Use RVV to optimize fastGEMM1T in DNN.

* Use RVV to optimize fastConv in DNN.

* Use RVV to optimize fastDepthwiseConv in DNN.

* Vectorize tails using vl.

* Use "vl" instead of scalar to handle small block in fastConv.

* Fix memory access out of bound in "fastGEMM1T".

* Remove setvl.

* Remove useless initialization.

* Use loop unrolling to handle tail part instead of switch.
2021-08-11 01:16:03 +03:00
..
cmake/hooks Merge pull request #14660 from YashasSamaga:dnn-cuda-build 2019-06-02 14:47:15 +03:00
include/opencv2 Merge pull request #20442 from JulieBar:gru_layer 2021-08-07 10:07:37 +03:00
misc Merge remote-tracking branch 'upstream/3.4' into merge-3.4 2021-08-07 17:25:06 +00:00
perf Merge remote-tracking branch 'upstream/3.4' into merge-3.4 2020-12-17 21:05:34 +00:00
src Merge pull request #20287 from hanliutong:dev-rvv-0.10 2021-08-11 01:16:03 +03:00
test Merge pull request #20442 from JulieBar:gru_layer 2021-08-07 10:07:37 +03:00
CMakeLists.txt Merge pull request #20287 from hanliutong:dev-rvv-0.10 2021-08-11 01:16:03 +03:00