Tomoaki Teshima
7fef96be1e
add 64F intrinsic in HAL NEON
...
* use universal intrinsic for accumulate series using float/double
* accumulate, accumulateSquare, accumulateProduct and accumulateWeighted
* add v_cvt_f64_high in both SSE/NEON
* add test for conversion v_cvt_f64_high in test_intrin.cpp
* improve some existing universal intrinsic by using new instructions in Aarch64
* add workaround for Android build in intrin_neon.hpp
2016-08-30 17:21:02 +09:00
Alexander Alekhin
b3b434e221
Merge pull request #7141 from lupustr3:pvlasov/instrumentation_extension
2016-08-23 16:48:25 +00:00
Pavel Vlasov
30a6cee2fe
Instrumentation for OpenCV API regions and IPP functions;
2016-08-19 18:10:03 +03:00
Alexander Alekhin
bf4c5bef7f
Merge pull request #7135 from lupustr3:pvlasov/icv_restrictions_remove
2016-08-19 13:23:47 +00:00
Pavel Vlasov
680ca88ce0
Outdated ICV restrictions were removed;
2016-08-19 15:08:39 +03:00
Vadim Pisarevsky
0f716cdf33
Merge pull request #7108 from K-Shinotsuka:issue16
2016-08-19 08:10:51 +00:00
Alexander Alekhin
e884bbabcb
Merge pull request #7110 from tomoaki0705:featureUniversalIntrinsicAccumulate
2016-08-15 14:54:10 +00:00
k-shinotsuka
3faaabd6f4
improve YCrCb2RGB_i<uchar>()
2016-08-15 19:20:04 +09:00
Tomoaki Teshima
4f7679cf51
use universal intrinsic in accumulate series
...
* accumulate, accumulateSquare, accumulateProduct and accumulateWeighted
2016-08-15 13:00:39 +09:00
Maksim Shabunin
cf30c582c6
Merge pull request #7057 from K-Shinotsuka:issue13
2016-08-11 11:46:33 +00:00
Maksim Shabunin
de0909e03c
Merge pull request #7083 from tomoaki0705:fixAccumulateShortFloat
2016-08-11 09:00:38 +00:00
Alexander Alekhin
0dd00321e0
Merge pull request #7072 from alalek:ocl_fix_canny
2016-08-10 13:44:52 +03:00
Vadim Pisarevsky
df665e2386
Merge pull request #6983 from K-Shinotsuka:issue8
2016-08-10 10:31:12 +00:00
Tomoaki Teshima
ba73249dc0
let the test of AccSqr_SIMD<ushort, float> pass
...
* The difference becomes too large when multiply is done in int16
* To reproduce the test failure, IPP has to be switched off
2016-08-10 18:36:00 +09:00
Alexander Alekhin
b8e08d5d3c
ocl: fix Canny for Intel devices
...
There is an issue with processing of abs(short) function for
negative argument.
Affected OpenCL devices:
- iGPU: Intel(R) HD Graphics 520 (OpenCL 2.0 )
- CPU: Intel(R) Core(TM) i5-6300U CPU @ 2.40GHz (OpenCL 2.0 (Build 10094))
2016-08-09 12:48:06 +03:00
k-shinotsuka
e37c53d13e
improve RGB2YCrCb_i<uchar>()
2016-08-09 11:33:00 +09:00
Vadim Pisarevsky
a95d63dab6
Merge pull request #7042 from alalek:update_6967
2016-08-08 15:17:48 +00:00
mschoeneck
5f30a0a076
Merge pull request #6958 from mschoeneck:Common-Canny-parallelization
...
* Common Canny parallelization added. TBB and single thread code removed. Final pass vectorized with SSE2 intrinsics.
* wrong #ifdef replaced with #if
* Merged to actual Canny version
* Merged common parallelized Canny with actual Canny implementation
* Remove 'Mutex *mutex' and pass 'Mutex mutex' from outside to parallelCanny
* Replaced extern Mutex with intern mutable Mutex.
2016-08-05 17:20:56 +03:00
Alexander Alekhin
b12286329f
imgproc: sepFilter: update OpenCL condition
...
Problem is observed with test_loop_times=100
in corner cases like 93x1 or similar
2016-08-05 12:27:24 +03:00
Alexander Alekhin
2812762d6b
imgproc: disable IPP for GaussianBlur in case of ROI
2016-08-05 12:27:18 +03:00
matze
37cccf17c0
imgproc: Prefer OCL calculation before IPP in GaussianBlur.
2016-08-05 12:27:16 +03:00
k-shinotsuka
065d631f0e
improve MomentsInTile_SIMD<uchar, int, int>()
2016-08-04 20:08:53 +09:00
Vadim Pisarevsky
b8d743bf50
Merge pull request #6970 from K-Shinotsuka:issue6
2016-08-03 18:50:00 +00:00
k-shinotsuka
5775ace5e0
improve thresh_8u()
2016-08-02 23:35:14 +09:00
Vadim Pisarevsky
3b1803f6b1
Merge pull request #6989 from mself:gftt-deterministic-sort
2016-07-28 15:27:39 +00:00
Vadim Pisarevsky
f96b5652ce
Merge pull request #6984 from K-Shinotsuka:issue9
2016-07-28 12:05:13 +00:00
k-shinotsuka
ecfd4ad11e
improve MomentsInTile_SIMD<ushort, int, int>()
2016-07-27 20:27:30 +09:00
Matthew Self
72672c293f
Make goodFeaturesToTrack() return deterministic results
...
When using OCL, the results of goodFeaturesToTrack() vary slightly from
run to run. This appears to be because the order of the results from
the findCorners kernel depends on thread execution and the sorting
function that is used at the end to rank the features only enforces are
partial sort order.
This does not materially impact the quality of the results, but it
makes it hard to build regression tests and generally introduces noise
into the system that should be avoided.
An easy fix is to change the sort function to enforce a total sort on
the features, even in cases where the match quality is exactly the same
for two features.
2016-07-25 21:17:54 -07:00
Vadim Pisarevsky
167a4e6592
Merge pull request #6942 from alalek:fix_android_pack
2016-07-21 07:30:15 +00:00
Tomoaki Teshima
57d1591957
imgproc: fix typo
...
* indent line
2016-07-21 06:42:11 +09:00
Alexander Alekhin
2ec63e4dd1
fix android pack build
2016-07-20 16:49:57 +03:00
Vadim Pisarevsky
cc5c5f7da9
Merge pull request #6795 from tomoaki0705:accelerateBilateralFilterNeon
2016-07-20 13:19:06 +00:00
Tomoaki Teshima
b436f4b995
accelerate bilateralFilter using NEON
...
* clean up some lines
2016-07-20 08:28:49 +09:00
Alexander Alekhin
575de89ee2
imgproc: Canny: enable IPP & OpenCL optimization code path
2016-07-19 17:15:36 +03:00
Alexander Alekhin
460b1dc2fa
imgproc: Canny with custom gradient
2016-07-19 17:15:36 +03:00
Vadim Pisarevsky
34d68edd1a
Merge pull request #6924 from K-Shinotsuka:issue3
2016-07-18 18:16:02 +00:00
k-shinotsuka
a25dba001d
improve SymmRowSmallVec_8u32s().
2016-07-18 13:58:11 +09:00
k-shinotsuka
6f8695c178
improve RowVec_8u32s().
2016-07-17 18:35:34 +09:00
Maksim Shabunin
91d80d0309
Merge pull request #6798 from tomoaki0705:featureSSEvideoAcc
2016-07-15 12:31:18 +00:00
Tomoaki Teshima
3c2f7ecc97
speed up accumulate, accumulateSquare, accumulateProduct and accumulateWeighted using SIMD
...
* use SSE and/or AVX based on configuration
* revise the test to verify the implementation
2016-07-15 08:09:24 +09:00
Maksim Shabunin
79f8e516b8
Merge pull request #6891 from snosov1:fix-5311
2016-07-14 11:25:58 +00:00
Maksim Shabunin
4d053405cf
Merge pull request #6809 from K-Shinotsuka:master
2016-07-14 09:37:53 +00:00
Alexander Alekhin
9c92f65c0b
Merge pull request #6808 from sturkmen72:patch-14
2016-07-13 11:54:11 +00:00
Alexander Alekhin
6c4b634cfe
Merge pull request #6814 from tomoaki0705:featureThreshold64fSimd
2016-07-13 11:45:28 +00:00
Tomoaki Teshima
9fca953e62
check the CPU flag correctly
2016-07-13 07:34:49 +09:00
Sergei Nosov
dce310e03c
provide better error messages
2016-07-12 18:03:28 +03:00
Ilya Lavrenov
5bc10ef796
fixed empty image condition in resize
2016-07-12 13:23:19 +03:00
Alexander Alekhin
88da03722f
Merge pull request #6786 from terfendail:median_fix
2016-07-11 10:02:43 +00:00
Tomoaki Teshima
1095076d7f
imgproc: speed up threshold of 64F version using NEON and SSE
...
* use NEON under aarch64 only
* check 64F version correctly
2016-07-09 17:05:29 +09:00
Andrey Pavlenko
dcaac98569
Merge pull request #6775 from janstarzy:histo-ipp-fix
2016-07-08 15:15:17 +00:00