opencv/modules/core
Chuanbo Weng 2d8c89c40b Remove unnecessary kercn limitation of 4.
When accessing global memory by DWORD4, memory bandwidth
can be fully utilized on Intel platform. This patch will
make more image format(e.g. 8UC4) be processed in DWORD4
by work-item. After applying this patch, 3 subcase of
./opencv_perf_core --gtest_filter=OCL_RepeatFixture_Repeat.Repeat/*
can be speedup on HD4000 graphics card with Beignet:
OCL_RepeatFixture_Repeat.Repeat/2, 64% improvement.
OCL_RepeatFixture_Repeat.Repeat/6, 50% improvement.
OCL_RepeatFixture_Repeat.Repeat/8, 56% improvement.

Signed-off-by: Chuanbo Weng <chuanbo.weng@intel.com>
2014-12-04 11:15:13 +08:00
..
doc Documenting imgproc module 2014-11-18 14:00:56 +03:00
include/opencv2 Doxygen documentation: BiB references and fixes 2014-12-01 15:50:55 +03:00
perf changed perf test for ocl_gemm 2014-08-26 15:05:36 +04:00
src Remove unnecessary kercn limitation of 4. 2014-12-04 11:15:13 +08:00
test fixed my_umat(roi).clone() test by disabling poorly aligned ROI's. Real fix of the problem will require a separate branch in UMat::copyTo(). 2014-11-07 14:49:48 +03:00
CMakeLists.txt world fix 2014-08-05 20:12:35 +04:00