opencv/modules/dnn/src
Arjan van de Ven a75840d19c Merge pull request #10468 from fenrus75:avx512-2
* Add a 512 bit codepath to the AVX512 fastConv function

this patch adds a 512 wide codepath to the fastConv() function for
AVX512 use.
The basic idea is to process the first N * 16 elements of the vector
with avx512, and then run the rest of the vector using the traditional
AVX2 codepath.

* dnn: use unaligned AVX512 load (OpenCV aligns data on 32-byte boundary)

* dnn: change "vecsize" condition for AVX512

* dnn: fix indentation
2018-01-31 16:34:12 +03:00
..
caffe Replace Caffe's psroi_pooling_param tag from 10001 to 10002 2018-01-08 13:29:20 +03:00
darknet Fixed missing #include "../precomp.hpp" 2018-01-05 15:10:39 +00:00
layers Merge pull request #10468 from fenrus75:avx512-2 2018-01-31 16:34:12 +03:00
ocl4dnn convolution and tanh layer fusion 2018-01-25 17:45:33 +08:00
opencl mvn, batch_norm and relu layer fusion 2018-01-25 18:57:05 +08:00
tensorflow PriorBox layer with explicit normalized sizes 2018-01-24 14:01:42 +03:00
torch Remove cv::dnn::Importer 2017-12-18 18:08:28 +03:00
dnn.cpp mvn, batch_norm and relu layer fusion 2018-01-25 18:57:05 +08:00
halide_scheduler.cpp dnn: fix build 2017-06-27 09:07:01 +03:00
halide_scheduler.hpp dnn: move module from opencv_contrib 2017-06-26 13:41:51 +03:00
init.cpp R-FCN models support 2017-12-20 10:43:22 +03:00
nms.cpp dnn: add an accuracy test for NMS 2017-10-25 13:40:56 +03:00
nms.inl.hpp dnn: add an accuracy test for NMS 2017-10-25 13:40:56 +03:00
op_halide.cpp Fixed Halide's copy_to_device invocation 2017-11-01 14:01:54 +03:00
op_halide.hpp MobileNet-SSD and VGG-SSD topologies in Halide 2017-09-08 09:55:53 +03:00
precomp.hpp Merge pull request #9114 from pengli:dnn_rebase 2017-10-02 15:38:00 +03:00