opencv/modules/dnn
HAN Liutong eba158fb0c
Merge pull request #25230 from hanliutong/rvv-conv
Optimize int8 layers in DNN modules by using RISC-V Vector intrinsic. #25230

This patch optimize 3 functions in the int8 layer by using RVV Native Intrinsic.

This patch was tested on QEMU using VLEN=128 and VLEN=256 on `./bin/opencv_test_dnn --gtest_filter="*Int8*"`;
On the real device (k230, VLEN=128), `EfficientDet_int8` in `opencv_perf_dnn` showed a performance improvement of 1.46x.

| Name of Test                               |  Original | optimized | Speed-up |
| ------------------------------------------ | -------- | ---------- | -------- |
| EfficientDet_int8::DNNTestNetwork::OCV/CPU | 2843.467 | 1947.013   | 1.46     |


### Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

- [ ] I agree to contribute to the project under Apache 2 License.
- [ ] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [ ] The PR is proposed to the proper branch
- [ ] There is a reference to the original bug report and related work
- [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable
      Patch to opencv_extra has the same branch name.
- [ ] The feature is well documented and sample code can be built with the project CMake
2024-03-31 16:47:06 +03:00
..
cmake dnn: plugin support for OpenVINO 2022-10-07 16:57:31 +00:00
include/opencv2 Merge pull request #24773 from tailsu:sd/pathlike 2024-01-12 16:23:05 +03:00
misc Merge pull request #24539 from LaurentBerger:blobrecttoimage 2023-12-19 20:00:04 +03:00
perf Fix proto and weights mess in dnn performance tests. 2024-02-07 09:16:09 +03:00
src Merge pull request #25230 from hanliutong/rvv-conv 2024-03-31 16:47:06 +03:00
test Merge pull request #25271 from fengyuentau:matmul_bias 2024-03-29 17:35:23 +03:00
CMakeLists.txt Merge pull request #25230 from hanliutong/rvv-conv 2024-03-31 16:47:06 +03:00