Vladislav Vinogradov
204a19b431
moved common gpu utility functionality to gpu_private.hpp
2013-04-08 17:25:15 +04:00
Vladislav Vinogradov
28b1e81883
renamed device -> cuda
2013-04-08 17:25:15 +04:00
Andrey Kamaev
f77a375354
Merge branch '2.4'
2013-03-29 19:38:35 +04:00
Vladislav Vinogradov
f0b19d4659
updated license header in whole gpu module
2013-03-22 14:03:15 +04:00
Andrey Kamaev
1ca8f33b4e
Merge branch '2.4'
2013-03-21 23:11:54 +04:00
Vladislav Vinogradov
99c49648fa
fixed gpu::LUT on multi device system
2013-03-21 13:39:25 +04:00
Andrey Kamaev
2a6fb2867e
Remove all using directives for STL namespace and members
...
Made all STL usages explicit to be able automatically find all usages of
particular class or function.
2013-02-25 15:04:17 +04:00
Vladislav Vinogradov
86bd325131
added gpu::normalize function
2013-02-13 15:55:58 +04:00
Vladislav Vinogradov
281d036fcf
optimizations:
...
- new reduce implementation (with kepler optimizations)
- saturate_cast via asm command
- video SIMD instructions in element operations
- float arithmetics instead of double
- new deviceSupports function
2013-01-23 21:05:07 +04:00
OpenCV Buildbot
04384a71e4
Normalize line endings and whitespace
2012-10-17 15:32:23 +04:00
Anatoly Baksheev
55b1d7c097
added CUDA_DISABLER preprocessor commands
2012-10-01 23:57:38 +04:00
Anatoly Baksheev
70204a8e68
Removed PtrElemStep, Marked DevMem1D as deprecated, now should use PtrStepSz now
2012-08-24 02:02:42 +04:00
Marina Kolpakova
b065c7a296
fixes for the newly added gcc warning keys
2012-06-21 03:48:36 +00:00
Vladislav Vinogradov
2582464e51
fixed several problems with CUDA 5.0
...
* gpu::LUT, uses device memory instead of host memory
* gpu::multiply, round mod for CV_8U depth
2012-05-28 12:09:40 +00:00
Vladislav Vinogradov
26691e00d4
fixed gpu core tests (added additional check for device's feature support)
...
added assertion on double types for old devices
2012-03-26 14:33:43 +00:00
Vladislav Vinogradov
509c910101
refactored and fixed some gpu tests
...
fixed some bugs in gpu module
2012-03-20 12:03:34 +00:00
Vladislav Vinogradov
2d30480982
created wrappers for new NPP functions
...
removed void integral(const GpuMat& src, GpuMat& sum, GpuMat& sqsum, Stream& stream) - it fails with NPP_NOT_IMPLEMENTED error
updated docs, accuracy and performance tests
2012-02-22 10:00:53 +00:00
Vladislav Vinogradov
0f53f2993e
removed BEGIN_OPENCV_DEVICE_NAMESPACE macros
2011-11-14 09:02:06 +00:00
Vladislav Vinogradov
fcfa72081e
moved GpuMat and DevMem2D to core module, some code refactoring
2011-11-09 13:13:52 +00:00
Vladislav Vinogradov
e7502e7641
implemented gpu::gemm via CUBLAS
2011-10-19 13:29:54 +00:00
Vladislav Vinogradov
40ee754e28
added WITH_CUFFT and WITH_CUBLAS flags to cmake scripts
...
fixed gpu module error reporting
added asynchronous version of some functions
2011-10-19 09:53:22 +00:00
Vladislav Vinogradov
42ced17c2c
fixed some warnings under win64
2011-08-08 11:28:14 +00:00
Vladislav Vinogradov
926a6bba00
modified according to CUDA 4.0 API updates
2011-05-31 08:31:10 +00:00
Anatoly Baksheev
0747f2d863
1) NPP_staging as sources. Binaries removed.
...
2) NVidia tests for GPU
3) FD sample that uses NVidia's interface.
2011-02-04 15:15:25 +00:00
Vladislav Vinogradov
811ba31897
added synchronization after NPP calls
2011-01-24 10:32:57 +00:00
Anatoly Baksheev
1a94186195
First version of CascadeClassifier_GPU.
...
Only for VS2008 now.
Sample for it.
new NPP_staging for VS2008 only
2011-01-13 13:04:00 +00:00
Vladislav Vinogradov
0cd587ee34
added gpu transpose and integral based on NPP Staging.
...
added mask support to SURF_GPU.
2010-12-21 14:02:09 +00:00
Alexey Spizhevoy
df8529377b
refactoring: moved gpu reduction-based functions into separated file
2010-12-20 09:51:25 +00:00
Alexey Spizhevoy
0465b89e7e
gpu module refactoring: moved per-element operations into separated file
2010-12-20 09:07:19 +00:00
Alexey Spizhevoy
e62bf3a2ae
updated gpu bitwise operations
2010-12-20 08:06:13 +00:00
Alexey Spizhevoy
640af6623c
added support of multichannel images into gpu::sqrSum
2010-12-15 16:32:56 +00:00
Alexey Spizhevoy
d8a7ff1e00
refactored gpu module, added vec math operators for uint, added support of 2 channel images into gpu::sum (removed support of double)
2010-12-15 15:12:32 +00:00
Alexey Spizhevoy
c9f9f38777
added gpu::sqrSum function
2010-12-13 14:34:02 +00:00
Alexey Spizhevoy
3997514b7c
added tests for gpu::sum, it supports all data types, but single channel images only
2010-12-13 12:00:58 +00:00
Vladislav Vinogradov
49ec8ba742
fixed bug in gpu filter engine (incorrect buffer type) and in vector's saturate_cast.
...
changed buffer type in linear filters to float.
added support of 1 channel image to linear filters.
added support of BORDER_REFLECT101, BORDER_REPLICATE and BORDER_CONSTANT border type to gpu linear filters.
minor fix in tests.
update comments in gpu.hpp.
2010-12-13 08:43:04 +00:00
Vladislav Vinogradov
905e5f1739
added support of 4-channels images to StereoConstantSpaceBP.
...
refactored transpose_gpu, made it non template function.
2010-12-08 07:23:59 +00:00
Vladislav Vinogradov
17d9014373
added per-element min/max to gpu module.
...
fixed compile error in transform.
2010-12-06 08:10:11 +00:00
Vladislav Vinogradov
57f7678db1
added gpu transpose for CV_8UC4, CV_8SC4, CV_16SC2, CV_16UC2, CV_32SC1 and CV_32FC1 types
2010-12-01 07:00:50 +00:00
Alexey Spizhevoy
32a9b63d2f
added better threads configuration estimator for the minMax, minMaxLoc, countNonZero functions in gpu module
2010-11-30 12:27:21 +00:00
Alexey Spizhevoy
b260382840
added masks support into gpu::minMaxLoc
2010-11-29 13:56:43 +00:00
Alexey Spizhevoy
678f392569
added is_signed into numeric_limits_gpu, fixed incorrect min max finding for floating values
2010-11-29 13:21:43 +00:00
Alexey Spizhevoy
437ac1a2f1
added mask support into gpu::minMax
2010-11-29 10:29:21 +00:00
Alexey Spizhevoy
1066bd2fa2
removed double support (gpu: minMax, minMaxLoc, countNonZero) for CC which doesn't have native double support
2010-11-29 07:31:09 +00:00
Alexey Spizhevoy
72f020a8f3
added gpu::count_non_zero version for CC1.0, refactored gpu module a little
2010-11-29 07:18:11 +00:00
Alexey Spizhevoy
7e2cc1be1b
added first version of gpu::countNonZero for all data types, it doesn't support compute capability 1.0 yet, also fixed some little bugs
2010-11-26 17:12:48 +00:00
Alexey Spizhevoy
bdaad9e1fe
added support of buffers into gpu::minMaxLoc, reduced memory requirements, refactored
2010-11-26 07:50:11 +00:00
Alexey Spizhevoy
9ad7a1c927
fixed compile time errors in gpu module (if one compiles without CUDA)
2010-11-25 11:52:15 +00:00
Alexey Spizhevoy
fa5422a2ad
added buf support into gpu::minMax
2010-11-25 10:19:06 +00:00
Alexey Spizhevoy
48183f10e7
optimized memory requirements for gpu::minMax's buffers, added support of compute capability 1.0
2010-11-25 09:57:02 +00:00
Alexey Spizhevoy
27690e3b6e
added minMaxLoc function into gpu module
2010-11-24 11:40:14 +00:00