mirror of
https://github.com/opencv/opencv.git
synced 2024-11-29 13:47:32 +08:00
8b3d6603d5
* another round of dnn optimization: * increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly * improved SIMD optimization of pooling layer, optimized average pooling * cleaned up convolution layer implementation * made activation layer "attacheable" to all other layers, including fully connected and addition layer. * fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology. * greatly optimized permutation layer, which improved SSD performance * parallelized element-wise binary/ternary/... ops (sum, prod, max) * also, added missing copyrights to many of the layer implementation files * temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders |
||
---|---|---|
.. | ||
include/opencv2 | ||
misc | ||
perf | ||
src | ||
test | ||
tutorials | ||
CMakeLists.txt |