Vitaly Tuzov
9a4b5a4545
OpenVX calls updated to use single common OpenVX context per thread
2017-02-21 16:08:23 +03:00
Tomoaki Teshima
820fdbff23
brush up AVX optimization in popcount
...
- make sure SIMD optimization works even when AVX is not available
2017-01-26 18:37:56 +09:00
Tomoaki Teshima
8b22099da2
use universal intrinsic and SSE4 popcount instruction in normHamming
...
- add v_popcount in universal intrinsic
- add test for v_popcount
- add wrapper of popcount for both MSVC and GCC
2017-01-12 09:09:22 +09:00
atinfinity
b3c77e664d
added to check the support of AVX2
2017-01-09 19:51:33 +09:00
Rostislav Vasilikhin
8b9422a052
OpenVX wrappers rewritten with CV_OVX_RUN, VX_DbgThrow
2016-12-14 17:49:41 +03:00
Vadim Pisarevsky
924ac10159
Merge pull request #7649 from terfendail:ovx_meanstddev
2016-12-01 17:46:30 +00:00
Vitaly Tuzov
a4b9eb23d1
Added OpenVX based processing to meanStdDev
2016-11-29 16:59:25 +03:00
Vitaly Tuzov
e9cc0f6a4c
Added OpenVX based processing to minMaxLoc
2016-11-29 15:33:48 +03:00
Pavel Vlasov
349d5ba012
--perf_instrument parameter now has int type and 0, 1, 2 modes (1 - simple trees, 2 - expanded trees for functions with same name but different calling address);
...
Maximum depth limit var was added to the instrumentation structure;
Trace names output console output fix: improper tree formatting could happen;
Output in case of error was added;
Custom regions improvements;
Improved timing and weight calculation for parallel regions; New TC (threads counter) value to indicate how many different threads accessed particular node;
parallel_for, warnings fixes and ReturnAddress code from Alexander Alekhin;
2016-11-08 10:18:05 +03:00
Vadim Pisarevsky
fad0598634
normHamming done with AVX 2 intrinsics (ver 2) ( #7405 )
...
* normHamming done with AVX 2 intrinsics.
* fixed compiling AVX2-optimized norm hamming with MSVC
2016-10-05 21:23:32 +04:00
Pavel Vlasov
30a6cee2fe
Instrumentation for OpenCV API regions and IPP functions;
2016-08-19 18:10:03 +03:00
Pavel Vlasov
680ca88ce0
Outdated ICV restrictions were removed;
2016-08-19 15:08:39 +03:00
Pavel Vlasov
a462d76a95
Implementations instrumentation framework for OpenCV performance system;
2016-08-17 16:26:03 +03:00
Vadim Pisarevsky
0a7e53da0a
Merge pull request #7010 from K-Shinotsuka:issue10
2016-08-03 18:44:02 +00:00
k-shinotsuka
1a18a33688
improve SumSqr_SIMD<schar, int, int>()
2016-08-03 19:17:51 +09:00
k-shinotsuka
0b4e06ad3c
improve SumSqr_SIMD<uchar, int, int>()
2016-08-02 23:46:03 +09:00
Ilya Lavrenov
c4b57d0ded
disable ippiMinMaxIndx_32f_C1R usage since it crashes on Nans
2016-07-14 14:52:58 +03:00
Maksim Shabunin
5e9192287d
Added HAL cvtColor interface
2016-04-18 16:34:25 +03:00
Maksim Shabunin
84f37d352f
HAL moved back to core
2015-12-17 12:33:23 +03:00
Pavel Vlasov
62854dcc0d
Enables support of IPP 9.0.0;
...
HAVE_IPP_ICV_ONLY will be undefined if OpenCV was linked against ICV packet from IPP9 or greater. ICV9+ packets will be aligned with IPP in OpenCV APIs
This will ease code management between IPP and ICV
2015-09-29 17:27:13 +03:00
Pavel Vlasov
14b006e808
IPP_VERSION_X100 was changed to:
...
IPP_VERSION_MAJOR * 100 + IPP_VERSION_MINOR*10 + IPP_VERSION_UPDATE
to manage changes between updates more easily.
IPP_DISABLE_BLOCK was added to ease tracking of disabled IPP functions;
2015-09-25 17:50:15 +03:00
Pavel Vlasov
2177c7c5a8
Some IPP functions were encapsulated;
...
Minor changes to IPP implementations;
2015-09-25 17:30:26 +03:00
Gleb Gladilov
344d9fd83f
Fixed minMaxLoc and test functions
2015-09-21 13:44:29 +02:00
Alexander Alekhin
c0d61964d6
ocl: fix unaligned memory access
...
http://code.opencv.org/issues/4462
2015-07-06 13:58:17 +03:00
Pavel Vlasov
101607a7d0
Imgproc_Hist_MinMaxVal.accuracy fix;
...
Some code style corrections;
2015-06-24 13:50:17 +03:00
Dmitry Budnikov
a5a21019b2
ipp_countNonZero build fix;
...
Removed IPP port for tiny arithm.cpp functions
Additional warnings fix on various platforms.
Build without OPENCL and GCC warnings fixed
Fixed warnings, trailing spaces and removed unused secure_cpy.
IPP code refactored.
IPP code path implemented as separate static functions to simplify future work with IPP code and make it more readable.
2015-06-18 12:47:07 +03:00
Vadim Pisarevsky
7dd67f4997
use _mm_sad_epu8 instead of _mm_movemask_epi8 in countNonZero. Therefore, we do not need any popcount table and this solves http://code.opencv.org/issues/4333
2015-05-14 20:33:53 +03:00
Vadim Pisarevsky
d8f360316d
fixed http://code.opencv.org/issues/3828 , added test. thanks to berak for the patch
2015-04-29 13:12:53 +03:00
Vadim Pisarevsky
ee11a2d266
fully implemented SSE and NEON cases of intrin.hpp; extended the HAL with some basic math functions
2015-04-16 23:00:26 +03:00
Maksim Shabunin
35b2139929
Basic HAL module
2015-04-07 18:16:22 +03:00
Vadim Pisarevsky
64bd77cf4d
Merge pull request #3794 from vpisarev:ubuntu_x86_fixes
2015-03-09 21:34:17 +00:00
Vadim Pisarevsky
5d8a1bf813
explained reason why IPP is disabled on 32-bit Linux. disable IPP's minMaxLoc_32f on 32-bit OSes only
2015-03-06 15:02:22 +03:00
Vadim Pisarevsky
7a52c3d9f1
1) make sure Python 2.7 is detected on Ubuntu (including libs) when it's there.
...
2) disable IPP on 32-bit Linux when OpenCV is built as shared libs. Otherwise we get linker errors
3) disable IPP's minMaxIdx 32-bit floating-point flavor in a hope that it fixes some test failures
2015-03-05 18:58:54 +03:00
Ilya Lavrenov
cf4c79ebdd
disabled some kernels for Android && NVidia
2015-03-04 14:46:49 +03:00
Ilya Lavrenov
b963d84b41
fixed bug #4215
2015-02-25 21:57:17 +03:00
Ilya Lavrenov
e9a6c5db21
sqsum
2015-01-20 21:10:06 +03:00
Ilya Lavrenov
8c94568cc3
cv::sum
2015-01-12 10:59:31 +03:00
Ilya Lavrenov
6bce6ee34a
checks
2015-01-12 10:59:31 +03:00
Ilya Lavrenov
fc0869735d
used popcnt
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
5578088983
countNonZero
2015-01-12 10:59:28 +03:00
Alexander Alekhin
1c9f590f0d
IPPICV: disable NormDiff_L1_16s_C1R for IPP/ICV 8.2/8.2.1
2014-11-05 13:26:23 +03:00
Pavel Vlasov
45958eaabc
Implementation detector and selector for IPP and OpenCL;
...
IPP can be switched on and off on runtime;
Optional implementation collector was added (switched off by default in CMake). Gathers data of implementation used in functions and report this info through performance TS;
TS modifications for implementations control;
2014-10-15 14:24:41 +04:00
Ilya Lavrenov
be3efdf274
cv::sum refactoring
2014-09-30 14:36:21 +00:00
Ilya Lavrenov
1c491c42cd
fix for cornerHarris
2014-09-29 14:59:46 +00:00
Ilya Lavrenov
f50f0ba63e
cv::norm
2014-09-28 07:28:33 -07:00
Ilya Lavrenov
44ea50f1c4
cv::countNonZero
2014-09-28 07:06:53 -07:00
Ilya Lavrenov
27b933ba5a
Neon optimization of cv::sum
2014-09-22 09:22:03 +00:00
Vadim Pisarevsky
64a53de27d
Merge pull request #3185 from ElenaGvozdeva:ocl_norm
2014-09-04 08:53:47 +00:00
Elena Gvozdeva
9fe11db7e2
disabled IPP acceleration for 3-channel norms and for CV_8S only for APPLE
2014-09-04 10:38:45 +04:00
Alexander Karsakov
f57a4bf87b
Disabled minMaxIdx for 32FC1 since it occasionally fails on AMD devices (e.g. A10-6800K)
2014-09-03 14:36:51 +04:00