opencv/modules/dnn
Arjan van de Ven a75840d19c Merge pull request #10468 from fenrus75:avx512-2
* Add a 512 bit codepath to the AVX512 fastConv function

this patch adds a 512 wide codepath to the fastConv() function for
AVX512 use.
The basic idea is to process the first N * 16 elements of the vector
with avx512, and then run the rest of the vector using the traditional
AVX2 codepath.

* dnn: use unaligned AVX512 load (OpenCV aligns data on 32-byte boundary)

* dnn: change "vecsize" condition for AVX512

* dnn: fix indentation
2018-01-31 16:34:12 +03:00
..
include/opencv2 Merge pull request #10585 from dkurt:dnn_weightless_scale 2018-01-15 06:07:50 +00:00
misc dnn: Updated protobuf files (3.5.1) 2018-01-15 15:51:38 +00:00
perf dnn(test): avoid calling of cv::setNumThreads() in tests directly 2017-12-27 15:16:41 +00:00
src Merge pull request #10468 from fenrus75:avx512-2 2018-01-31 16:34:12 +03:00
test Fix padding for average pooling from TensorFlow 2018-01-31 15:54:30 +03:00
CMakeLists.txt dnn: protobuf build warnings 2018-01-15 21:15:23 +00:00