Vadim Pisarevsky
5e93c82023
Merge pull request #9491 from dkurt:tf_lstm
2017-09-28 21:04:06 +00:00
Vadim Pisarevsky
68cc2e292d
Merge pull request #9734 from dkurt:fix_deconv_layer_kernel_layout
2017-09-28 11:42:57 +00:00
Vadim Pisarevsky
45365e4df1
Merge pull request #9691 from dkurt:padding_layer_refactoring
2017-09-28 11:34:28 +00:00
Dmitry Kurtaev
6e593cd1f0
Swap dimensions of deconvolution kernel
2017-09-27 22:38:34 +03:00
Alexander Alekhin
3dee92ec50
fix usage of CV_FMA3 macro
2017-09-26 17:23:54 +03:00
Dmitry Kurtaev
84cec17913
LSTM layer for TensorFlow importer
2017-09-26 12:59:36 +03:00
Dmitry Kurtaev
222149b9c6
Refactored Padding layer
2017-09-22 12:39:00 +03:00
Dmitry Kurtaev
17a85b16fc
Remove reorder_dims attribute of Reshape layer
2017-09-21 16:42:03 +03:00
Dmitry Kurtaev
d891e9b1d8
Layers for MobileNet from TensorFlow
2017-09-15 20:17:30 +03:00
Vadim Pisarevsky
6bf8fe815d
Merge pull request #9384 from dkurt:torch_split
2017-09-15 13:05:05 +00:00
Vadim Pisarevsky
41b23fde9f
Merge pull request #9524 from dkurt:dnn_torch_openface
2017-09-15 12:38:12 +00:00
Dmitry Kurtaev
0ce7c33bc8
Torch's Concat and ConcatTable doesn't use Split layer
2017-09-14 09:26:57 +03:00
Dmitry Kurtaev
7dc6b1d7d4
Layers for OpenFace face recognition network
2017-09-14 09:11:31 +03:00
Dmitry Kurtaev
58b890b9f7
Dilated convolution import from TensorFlow
2017-09-13 18:44:14 +03:00
Maksim Shabunin
248e2c7d47
Fixed some issues found by static analysis
2017-09-08 12:22:12 +03:00
dkurt
339793143c
Unit tests for TensorFlow importer
2017-08-03 11:29:48 +03:00
Aleksandr Rybnikov
8d6b8b45b6
Added ELU and test for it
2017-08-02 11:13:59 +03:00
Alexander Alekhin
2959e7aba9
Merge pull request #9188 from arrybn:mobilenet_ssd_sample
2017-08-01 11:12:54 +00:00
Aleksandr Rybnikov
ce1cc352d9
MobileNet SSD sample
2017-08-01 12:30:27 +03:00
Tomoaki Teshima
0f91faddae
fix linker error when trying CPU_BASELINE=AVX
2017-07-21 21:13:47 +09:00
Aleksandr Rybnikov
7d1140340e
Rewrote googlenet tests
2017-07-18 18:49:14 +03:00
Vadim Pisarevsky
0488d9bdb2
optimize out scaleLayer & concatLayer whenever possible
...
fixed problem in concat layer by disabling memory re-use in layers with multiple inputs
trying to fix the tests when Halide is used to run deep nets
another attempt to fix Halide tests
see if the Halide tests will pass with concat layer fusion turned off
trying to fix failures in halide tests; another try
one more experiment to make halide_concat & halide_enet tests pass
continue attempts to fix halide tests
moving on
uncomment parallel concat layer
seemingly fixed failures in Halide tests and re-enabled concat layer fusion; thanks to dkurt for the patch
2017-07-14 18:30:53 +03:00
Alexander Alekhin
4784c7be5f
dnn: cleanup dispatched code, fix SIMD128 types
2017-07-13 19:00:34 +03:00
Alexander Alekhin
c3e6de293f
dnn: code cleanup, refactor detection output layer
2017-07-13 19:00:34 +03:00
Alexander Alekhin
520da7aaaf
Merge pull request #9111 from vpisarev:dnn_optim_avx1
2017-07-13 12:27:05 +00:00
dkurt
3203635765
Eltwise layer fixes
2017-07-10 12:58:11 +03:00
Vadim Pisarevsky
ed9564106c
reuse AVX2-optimized kernels for AVX1 CPUs (like IvyBridge)
2017-07-06 21:36:59 +03:00
Maksim Shabunin
e0393f8557
Fixed some issues found by static analysis (4th round)
2017-06-30 12:26:53 +03:00
Vadim Pisarevsky
ac49a17a82
Merge pull request #9022 from dkurt:keep_conv_weights_for_halide
2017-06-29 11:09:17 +00:00
Maksim Shabunin
ace0701a46
Merge pull request #9019 from alalek:dnn_trace
2017-06-29 07:33:46 +00:00
Alexander Alekhin
324851882a
Merge pull request #9025 from mshabunin:fix-static-3
2017-06-28 20:50:21 +00:00
Maksim Shabunin
a769d69a9d
Fixed several issues found by static analysis
2017-06-28 18:06:18 +03:00
dkurt
b46f5b1b38
Align convolutional layer weights separately from origin ones
2017-06-28 17:05:56 +03:00
Alexander Alekhin
ed10383359
dnn: added trace macros
2017-06-28 14:57:26 +03:00
Vadim Pisarevsky
c5faa9aefa
Merge pull request #9013 from arrybn:ssd_last_layers_optim
2017-06-28 10:38:55 +00:00
Vadim Pisarevsky
bbb14d3746
Merge pull request #9003 from dkurt:halide_bug_fixes
2017-06-28 08:48:27 +00:00
Aleksandr Rybnikov
ec321e651f
Removed usage of std::map in DetectionOutput layer
2017-06-28 11:31:38 +03:00
Vadim Pisarevsky
8b3d6603d5
another round of dnn optimization ( #9011 )
...
* another round of dnn optimization:
* increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly
* improved SIMD optimization of pooling layer, optimized average pooling
* cleaned up convolution layer implementation
* made activation layer "attacheable" to all other layers, including fully connected and addition layer.
* fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology.
* greatly optimized permutation layer, which improved SSD performance
* parallelized element-wise binary/ternary/... ops (sum, prod, max)
* also, added missing copyrights to many of the layer implementation files
* temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders
2017-06-28 11:15:22 +03:00
Alexander Alekhin
f8a75c4361
dispatch: added CV_TRY_${OPT} macro, fix dnn build
...
- 1: OPT is available directly or via dispatcher
- 0: optimization is not compiled at all
2017-06-27 17:05:15 +03:00
dkurt
121789f78e
Fixed some bugs from Halide tests
2017-06-27 14:52:46 +03:00
Alexander Alekhin
93091ba203
dnn: AVX2 fix invalid unaligned read
2017-06-26 19:48:42 +03:00
Alexander Alekhin
93729784bb
dnn: move module from opencv_contrib
...
e6f63c7a38/modules/dnn
2017-06-26 13:41:51 +03:00