vbystricky
aab6f6c56f
Set sync=true in ocl_minMaxIdx kernel call. For tests
2014-08-14 12:14:56 +04:00
Vadim Pisarevsky
5d8098a2c3
avoid compiler warnings
2014-08-14 10:06:10 +04:00
Vadim Pisarevsky
5f590ebed0
Merge pull request #3094 from Adil-Ibragimov:minor_fixes_in_Mat-data
2014-08-13 18:28:51 +00:00
Vadim Pisarevsky
71ff228ad3
Merge pull request #3089 from Adil-Ibragimov:fix_cuda_constness
2014-08-13 18:09:42 +00:00
Vadim Pisarevsky
a37876ab95
fixed build of iOS framework another time; clang does not like the new NEON-optimized split/merge
2014-08-13 19:59:26 +04:00
vbystricky
1fe403f461
Enable OpenCL version of norm and convertScaleAbs or 32F data
...
Fix error in minmaxloc.cl
Change test for convertScaleAbs
Fix minMaxIdx for _src2 align
Change epsilon on the tests
2014-08-13 18:33:01 +04:00
Vadim Pisarevsky
d97f9fc477
now the tests pass with loop_times==30
2014-08-13 16:32:01 +04:00
Adil Ibragimov
8a4a1bb018
Several type of formal refactoring:
...
1. someMatrix.data -> someMatrix.prt()
2. someMatrix.data + someMatrix.step * lineIndex -> someMatrix.ptr( lineIndex )
3. (SomeType*) someMatrix.data -> someMatrix.ptr<SomeType>()
4. someMatrix.data -> !someMatrix.empty() ( or !someMatrix.data -> someMatrix.empty() ) in logical expressions
2014-08-13 15:21:35 +04:00
Vadim Pisarevsky
3cd7941098
Merge pull request #3083 from akarsakov:ocl_transpose_check_locmem
2014-08-13 10:50:51 +00:00
Vadim Pisarevsky
01bd4961e3
Merge pull request #3081 from akarsakov:small_fixes
2014-08-13 10:50:33 +00:00
Alexander Karsakov
c3100eeb19
Fixed buffer initialization in reduce kernel. Enabled OCL version of reduce for SUM, MAX, MIN modes.
2014-08-13 12:03:06 +04:00
Adil Ibragimov
dfd0b297ce
datastart must be non-const in CUDA.
2014-08-13 11:11:33 +04:00
Alexander Karsakov
713ddb89bd
Set minimum matrix size for AmdBlas::gemm to 20 since it works incorrect for small sizes
2014-08-12 18:02:29 +04:00
Alexander Karsakov
3deea1c3ce
Added check required local memory size for non-inplace transpose kernel
2014-08-12 16:26:08 +04:00
Alexander Alekhin
da806cc3e7
Merge pull request #3073 from vbystricky:ocl_fixMinMaxLoc
2014-08-12 10:21:41 +00:00
vbystricky
6fb282aa39
Remove mul24, for CV8UC3 3840x2160 it generates implementation specific result
2014-08-12 11:25:23 +04:00
Alexander Alekhin
8fd61c9aa6
Merge pull request #3075 from akarsakov:ipp_imgproc_fix
2014-08-11 15:02:53 +00:00
Elena Gvozdeva
b73b28ab9c
fix for ocl_lut
2014-08-11 16:21:55 +04:00
vbystricky
dfe5fff2f5
Restore OpenCL branch for minmaxLoc on float data
2014-08-11 15:57:29 +04:00
Alexander Karsakov
835b5e2179
Disabled ippiConvert_32f16s_C1R since it breaks OCL accuracy tests of remap with test_loop_times=30.
2014-08-11 15:50:06 +04:00
Vadim Pisarevsky
9ef373f6a5
Merge pull request #3038 from yury-gorbachev:core_arithm_neon
2014-08-10 20:20:04 +00:00
Vadim Pisarevsky
4de4ff5682
Merge pull request #3067 from vpisarev:minor_fixes2
2014-08-10 10:52:57 +00:00
Vadim Pisarevsky
e2f24f43c9
added some basic functionality needed by the new face module (moved from the old "contrib")
2014-08-10 01:54:16 +04:00
Vadim Pisarevsky
05e7c29de5
fixed various warnings and obvious errors reported by clang compiler and the coverity tool.
2014-08-10 00:10:05 +04:00
Yury Gorbachev
9a233999a9
Fixed review comment from Vadim Pisarevsky
2014-08-09 13:37:53 +04:00
Vadim Pisarevsky
99eed2d885
Merge pull request #3057 from Adil-Ibragimov:adding-constness
2014-08-08 17:27:47 +00:00
Vadim Pisarevsky
74457cd3b5
Merge pull request #3055 from akarsakov:copyto_UMat_fix
2014-08-08 14:20:02 +00:00
Vadim Pisarevsky
88672f414b
Merge pull request #3056 from akarsakov:ocl_setIdentity_fix
2014-08-08 14:03:46 +00:00
Alexander Karsakov
6ad4521b78
Fixed typos
2014-08-08 13:11:35 +04:00
Adil Ibragimov
8dab2ffce7
fixing error, wrong template method param.
2014-08-07 17:43:32 +04:00
Adil Ibragimov
98d5731ad8
some formal changes (generally adding constness)
2014-08-07 15:49:14 +04:00
Alexander Karsakov
c5ad6de596
Fixed choice of kercn and rowsPerWI for non-Intel device.
2014-08-07 15:37:26 +04:00
Alexander Karsakov
00221e96fd
Fixed issue: Mat::copyTo(UMat) if device copy is obsolete. Added test.
2014-08-07 13:16:02 +04:00
Vadim Pisarevsky
ca183707f2
Merge pull request #3039 from alalek:ocv_world
2014-08-06 14:52:52 +00:00
Vadim Pisarevsky
d611684d9e
Merge pull request #3041 from adrians:optimize_split
2014-08-06 09:22:38 +00:00
Alexander Alekhin
55188fe991
world fix
2014-08-05 20:12:35 +04:00
vbystricky
774d277c1f
Fix error in OpenCl version of meanstddev for continues src and not continues mask
2014-08-05 17:30:06 +04:00
Adrian Stratulat
b2acd1f75a
Vectorize split & merge for NEON
2014-08-05 06:51:36 +00:00
Vadim Pisarevsky
a05ce00a65
Merge pull request #2879 from alalek:ocl_runtime_flags
2014-08-04 16:43:01 +00:00
Vadim Pisarevsky
432b7cb235
Merge pull request #3032 from vpisarev:refactor_ml2
2014-08-04 14:51:38 +00:00
Yury Gorbachev
61423a2d47
ARM NEON accelerated implementation of cv::addWeighted, cv::inRange and cv::compare functions.
...
NOT verified on target platform, compilation passes with and without NEON.
2014-08-04 16:44:51 +04:00
vbystricky
b48e487d53
Enable ocl version of minmaxloc with mask
2014-07-31 19:12:10 +04:00
vbystricky
11a0e3ff78
Fix error in OCL minmaxloc
2014-07-31 19:04:38 +04:00
Elena Gvozdeva
9db8592aa8
fixed ocl_flip for cn==3
2014-07-31 16:57:47 +04:00
Vadim Pisarevsky
b0cd822924
trying to make the tests pass
2014-07-31 14:24:21 +04:00
Vadim Pisarevsky
ef8647fe30
put IPP ReduceSum_32f back
2014-07-30 19:12:11 +04:00
Vadim Pisarevsky
5267ed46c7
1. disabled OpenCL acceleration for cv::multiply() (CV_32F), cv::divide (CV_32F), cv::convertScaleAbs (CV_32F) and cv::reduce (SUM, CV_32F), cv::reduce (MIN & MAX), cv::flip (3-channel case).
...
2. changed the number of test loops from 1 to 30 (except for cv::pow() test, which fails for yet unknown reason)
3. disabled IPP acceleration for 3-channel norms.
4. modified relativeNorm test function to handle very small values
2014-07-30 18:19:47 +04:00
Vadim Pisarevsky
ba3783d205
initial commit; ml has been refactored; it compiles and the tests run well; some other modules, apps and samples do not compile; to be fixed
2014-07-29 23:54:23 +04:00
Vadim Pisarevsky
41df63aacd
Merge pull request #3010 from vpisarev:ocl_disable_some_norms2
2014-07-28 17:24:42 +00:00
Vadim Pisarevsky
962b519708
Merge pull request #2996 from akarsakov:ocl_dft_new_concept
2014-07-28 15:59:59 +00:00
Vadim Pisarevsky
0c749fd7e5
Merge pull request #2998 from ernest-galbrun:cuda_concurrency
2014-07-28 15:59:43 +00:00
Vadim Pisarevsky
af83884dc7
disabled ocl_minmaxloc on 32-bit data completely
2014-07-28 19:35:06 +04:00
Vadim Pisarevsky
1925fab427
disabled some more stuff
2014-07-28 19:32:06 +04:00
Vadim Pisarevsky
97f6dd1820
trying to make OpenCL builder green
2014-07-28 17:16:02 +04:00
Vadim Pisarevsky
f42bb0bd65
disable OpenCL difference norms with mask, since some of the tests fail
2014-07-28 15:45:54 +04:00
Vadim Pisarevsky
ef0f010437
Merge pull request #3003 from ElenaGvozdeva:ocl_sum
2014-07-28 11:08:08 +00:00
Alexander Karsakov
fecfaf4092
Using std::map in PlanCache
2014-07-28 14:19:47 +04:00
Elena Gvozdeva
27d97aa231
fixed
2014-07-28 12:21:00 +04:00
Elena Gvozdeva
fe29af2e58
Fixed bug in reduce.cl
2014-07-25 14:51:30 +04:00
Alexander Karsakov
37d01e2d27
Added license header, using cv::Ptr, small fixes.
2014-07-25 13:27:00 +04:00
Adrian Stratulat
a98ce86ca3
Add NEON intrinsics to core/src/arithm.cpp
2014-07-24 19:45:32 +00:00
Alexander Karsakov
66ac46214d
Final refactoring, fixes
2014-07-24 13:23:02 +04:00
Ernest Galbrun
551ab83cf2
trailing whitespace
2014-07-23 14:34:22 +02:00
Ernest Galbrun
d7038423dd
Merge branch 'master' of https://github.com/Itseez/opencv
2014-07-23 14:21:45 +02:00
Ernest Galbrun
6a769c92b3
modified default stream initialization to allow concurrent calls modified cuda surf.cuda.cpp to allow concurrent call
2014-07-23 14:21:21 +02:00
Elena Gvozdeva
4fdd6de8e8
Fixed ippiDFT call
2014-07-23 13:39:31 +04:00
Alexander Karsakov
1d2cf0e20e
Added nonzero_rows support
2014-07-22 18:31:08 +04:00
Alexander Karsakov
52f76a3283
Added rest Elena's changes
2014-07-22 18:31:08 +04:00
Alexander Karsakov
7791264525
Added multi-block scheme
2014-07-22 18:31:03 +04:00
Alexander Karsakov
2b9e556055
Added Elena's changes with implemented DFT_INVERSE C2C mode.
2014-07-22 18:27:31 +04:00
Alexander Karsakov
b17bf031f6
Added DFT_SCALE for forward transforms
2014-07-22 18:27:31 +04:00
Alexander Karsakov
6c8b6bd0c7
Added packing to CCS format
2014-07-22 18:27:25 +04:00
Alexander Karsakov
ed07241f89
Completed all forward transforms.
2014-07-22 18:27:17 +04:00
Alexander Karsakov
e5a3ab3cb9
Added fftplan cache
2014-07-22 18:27:08 +04:00
Alexander Karsakov
0318d27720
Enabled precalculated wave
2014-07-22 18:26:58 +04:00
Alexander Karsakov
5dd9263848
Multi-radix with kernel generation
2014-07-22 18:25:59 +04:00
PhilLab
e82241cba5
Function for drawing arrows
...
Porting the [PR 2970](https://github.com/Itseez/opencv/pull/2970 ) To the new InputOutputArray type
2014-07-18 10:04:44 +02:00
Vadim Pisarevsky
b63a7e66db
Merge pull request #2953 from gongzg:master
2014-07-14 19:21:15 +00:00
Alexander Alekhin
e169083fed
Merge pull request #2918 from vbystricky:oclopt_reduce2
2014-07-12 05:00:57 +00:00
Alexander Alekhin
df74a266cf
Merge pull request #2963 from ilya-lavrenov:tapi_bufferpool
2014-07-11 14:55:13 +00:00
Alexander Alekhin
0876ec7a67
Merge pull request #2935 from vbystricky:oclopt_meanStdDev
2014-07-11 14:53:46 +00:00
Alexander Alekhin
cbf63b0709
Merge pull request #2851 from ilya-lavrenov:tapi_set_identity
2014-07-11 10:00:16 +00:00
Ilya Lavrenov
f2a2467de3
enabled BufferPool
2014-07-11 12:21:10 +04:00
Alexander Alekhin
a0816c6d15
Merge pull request #2927 from ilya-lavrenov:tapi_lut
2014-07-10 09:38:37 +00:00
Zhigang Gong
2e49ca4903
Avoid unmap an umat(ocl) which hasn't been mapped at all.
...
According to opencl 1.2 spec 5.4.2:
enqueues a command to unmap a previously mapped region of a memory object.
...
CL_INVALID_VALUE if mapped_ptr is not a valid pointer returned by
clEnqueueMapBuffer, or clEnqueueMapImage for memobj.
So if the u->data is not from a clEnqueueMapBuffer call, we should not
call clEnqueueUnmapMemObject() unmap it. With this patch, the cases
./opencv_test_video --gtest_filter=OCL_Video/FarnebackOpticalFlow.Mat/*
could work well with beignet 0.9.1, Otherwise, it will get a
CL_INVALID_VALUE at the clEnqueueUnmapMemObject().
Signed-off-by: Zhigang Gong <zhigang.gong@intel.com>
2014-07-09 16:21:20 +08:00
Alexander Alekhin
e79ceb4b45
Merge pull request #2936 from ilya-lavrenov:tapi_minmaxloc_opt
2014-07-08 07:30:49 +00:00
Vadim Pisarevsky
e1553d5da3
Merge pull request #2929 from ilya-lavrenov:sse_pow
2014-07-07 11:22:26 +00:00
Vadim Pisarevsky
b2ab1b758a
Merge pull request #2937 from ilya-lavrenov:sse_patch_nans
2014-07-07 11:20:30 +00:00
Ilya Lavrenov
e2b86974ae
SSE2 optimization of cv::patchNaNs
2014-07-04 10:27:44 +04:00
Ilya Lavrenov
e04c39b089
SSE2 optimization of cv::inRange
2014-07-04 10:11:18 +04:00
Alexander Alekhin
964b260937
Merge pull request #2932 from ilya-lavrenov:tapi_other_vendors
2014-07-03 13:02:41 +00:00
Ilya Lavrenov
55d84cf815
optimization of cv::minMaxLoc - used min/max
2014-07-03 16:56:22 +04:00
vbystricky
690c932d7f
Change groups count for Intel
2014-07-03 15:50:56 +04:00
Ilya Lavrenov
2fe07abf83
use vectors for devices of all vendors
2014-07-03 15:32:34 +04:00
Alexander Alekhin
c5d3c08256
ocl: add try-catch for OpenCL device getter
2014-07-03 12:18:19 +04:00
Ilya Lavrenov
e5b63d43f2
sse2 optimization of inv_sqrt_64f
2014-07-03 01:33:49 +04:00
Ilya Lavrenov
10c772fa7f
minor optimization of cv::LUT
2014-07-02 18:50:21 +04:00
Vadim Pisarevsky
0e534b5a25
Merge pull request #2923 from alalek:ocl_default_device_ignore_cpu
2014-07-02 09:56:59 +00:00
vbystricky
a3b9884674
Change global buffer to local
2014-07-02 11:36:25 +04:00
Vadim Pisarevsky
15197eb09b
Merge pull request #2911 from ilya-lavrenov:sse2_convert_scale_abs
2014-07-01 17:31:37 +00:00