Vladislav Vinogradov
99c49648fa
fixed gpu::LUT on multi device system
2013-03-21 13:39:25 +04:00
Vladislav Vinogradov
86bd325131
added gpu::normalize function
2013-02-13 15:55:58 +04:00
Vladislav Vinogradov
281d036fcf
optimizations:
...
- new reduce implementation (with kepler optimizations)
- saturate_cast via asm command
- video SIMD instructions in element operations
- float arithmetics instead of double
- new deviceSupports function
2013-01-23 21:05:07 +04:00
OpenCV Buildbot
04384a71e4
Normalize line endings and whitespace
2012-10-17 15:32:23 +04:00
Anatoly Baksheev
55b1d7c097
added CUDA_DISABLER preprocessor commands
2012-10-01 23:57:38 +04:00
Anatoly Baksheev
70204a8e68
Removed PtrElemStep, Marked DevMem1D as deprecated, now should use PtrStepSz now
2012-08-24 02:02:42 +04:00
Marina Kolpakova
b065c7a296
fixes for the newly added gcc warning keys
2012-06-21 03:48:36 +00:00
Vladislav Vinogradov
2582464e51
fixed several problems with CUDA 5.0
...
* gpu::LUT, uses device memory instead of host memory
* gpu::multiply, round mod for CV_8U depth
2012-05-28 12:09:40 +00:00
Vladislav Vinogradov
26691e00d4
fixed gpu core tests (added additional check for device's feature support)
...
added assertion on double types for old devices
2012-03-26 14:33:43 +00:00
Vladislav Vinogradov
509c910101
refactored and fixed some gpu tests
...
fixed some bugs in gpu module
2012-03-20 12:03:34 +00:00
Vladislav Vinogradov
2d30480982
created wrappers for new NPP functions
...
removed void integral(const GpuMat& src, GpuMat& sum, GpuMat& sqsum, Stream& stream) - it fails with NPP_NOT_IMPLEMENTED error
updated docs, accuracy and performance tests
2012-02-22 10:00:53 +00:00
Vladislav Vinogradov
0f53f2993e
removed BEGIN_OPENCV_DEVICE_NAMESPACE macros
2011-11-14 09:02:06 +00:00
Vladislav Vinogradov
fcfa72081e
moved GpuMat and DevMem2D to core module, some code refactoring
2011-11-09 13:13:52 +00:00
Vladislav Vinogradov
e7502e7641
implemented gpu::gemm via CUBLAS
2011-10-19 13:29:54 +00:00
Vladislav Vinogradov
40ee754e28
added WITH_CUFFT and WITH_CUBLAS flags to cmake scripts
...
fixed gpu module error reporting
added asynchronous version of some functions
2011-10-19 09:53:22 +00:00
Vladislav Vinogradov
42ced17c2c
fixed some warnings under win64
2011-08-08 11:28:14 +00:00
Vladislav Vinogradov
926a6bba00
modified according to CUDA 4.0 API updates
2011-05-31 08:31:10 +00:00
Anatoly Baksheev
0747f2d863
1) NPP_staging as sources. Binaries removed.
...
2) NVidia tests for GPU
3) FD sample that uses NVidia's interface.
2011-02-04 15:15:25 +00:00
Vladislav Vinogradov
811ba31897
added synchronization after NPP calls
2011-01-24 10:32:57 +00:00
Anatoly Baksheev
1a94186195
First version of CascadeClassifier_GPU.
...
Only for VS2008 now.
Sample for it.
new NPP_staging for VS2008 only
2011-01-13 13:04:00 +00:00
Vladislav Vinogradov
0cd587ee34
added gpu transpose and integral based on NPP Staging.
...
added mask support to SURF_GPU.
2010-12-21 14:02:09 +00:00
Alexey Spizhevoy
df8529377b
refactoring: moved gpu reduction-based functions into separated file
2010-12-20 09:51:25 +00:00
Alexey Spizhevoy
0465b89e7e
gpu module refactoring: moved per-element operations into separated file
2010-12-20 09:07:19 +00:00
Alexey Spizhevoy
e62bf3a2ae
updated gpu bitwise operations
2010-12-20 08:06:13 +00:00
Alexey Spizhevoy
640af6623c
added support of multichannel images into gpu::sqrSum
2010-12-15 16:32:56 +00:00
Alexey Spizhevoy
d8a7ff1e00
refactored gpu module, added vec math operators for uint, added support of 2 channel images into gpu::sum (removed support of double)
2010-12-15 15:12:32 +00:00
Alexey Spizhevoy
c9f9f38777
added gpu::sqrSum function
2010-12-13 14:34:02 +00:00
Alexey Spizhevoy
3997514b7c
added tests for gpu::sum, it supports all data types, but single channel images only
2010-12-13 12:00:58 +00:00
Vladislav Vinogradov
49ec8ba742
fixed bug in gpu filter engine (incorrect buffer type) and in vector's saturate_cast.
...
changed buffer type in linear filters to float.
added support of 1 channel image to linear filters.
added support of BORDER_REFLECT101, BORDER_REPLICATE and BORDER_CONSTANT border type to gpu linear filters.
minor fix in tests.
update comments in gpu.hpp.
2010-12-13 08:43:04 +00:00
Vladislav Vinogradov
905e5f1739
added support of 4-channels images to StereoConstantSpaceBP.
...
refactored transpose_gpu, made it non template function.
2010-12-08 07:23:59 +00:00
Vladislav Vinogradov
17d9014373
added per-element min/max to gpu module.
...
fixed compile error in transform.
2010-12-06 08:10:11 +00:00
Vladislav Vinogradov
57f7678db1
added gpu transpose for CV_8UC4, CV_8SC4, CV_16SC2, CV_16UC2, CV_32SC1 and CV_32FC1 types
2010-12-01 07:00:50 +00:00
Alexey Spizhevoy
32a9b63d2f
added better threads configuration estimator for the minMax, minMaxLoc, countNonZero functions in gpu module
2010-11-30 12:27:21 +00:00
Alexey Spizhevoy
b260382840
added masks support into gpu::minMaxLoc
2010-11-29 13:56:43 +00:00
Alexey Spizhevoy
678f392569
added is_signed into numeric_limits_gpu, fixed incorrect min max finding for floating values
2010-11-29 13:21:43 +00:00
Alexey Spizhevoy
437ac1a2f1
added mask support into gpu::minMax
2010-11-29 10:29:21 +00:00
Alexey Spizhevoy
1066bd2fa2
removed double support (gpu: minMax, minMaxLoc, countNonZero) for CC which doesn't have native double support
2010-11-29 07:31:09 +00:00
Alexey Spizhevoy
72f020a8f3
added gpu::count_non_zero version for CC1.0, refactored gpu module a little
2010-11-29 07:18:11 +00:00
Alexey Spizhevoy
7e2cc1be1b
added first version of gpu::countNonZero for all data types, it doesn't support compute capability 1.0 yet, also fixed some little bugs
2010-11-26 17:12:48 +00:00
Alexey Spizhevoy
bdaad9e1fe
added support of buffers into gpu::minMaxLoc, reduced memory requirements, refactored
2010-11-26 07:50:11 +00:00
Alexey Spizhevoy
9ad7a1c927
fixed compile time errors in gpu module (if one compiles without CUDA)
2010-11-25 11:52:15 +00:00
Alexey Spizhevoy
fa5422a2ad
added buf support into gpu::minMax
2010-11-25 10:19:06 +00:00
Alexey Spizhevoy
48183f10e7
optimized memory requirements for gpu::minMax's buffers, added support of compute capability 1.0
2010-11-25 09:57:02 +00:00
Alexey Spizhevoy
27690e3b6e
added minMaxLoc function into gpu module
2010-11-24 11:40:14 +00:00
Alexey Spizhevoy
13d18d65a8
added support of multichannel matrices in gpu::minMax
2010-11-24 09:03:37 +00:00
Alexey Spizhevoy
282e01cb4a
added support of all data types into gpu::minMax
2010-11-24 08:55:52 +00:00
Vladislav Vinogradov
dd2fe821c2
deleted NPP_VERSION and NPP_HAVE_COMPLEX_TYPE macros, because last NPP supports complex types under linux
2010-11-23 08:21:26 +00:00
Alexey Spizhevoy
732bd621bb
added masks support for bitwise operations on GPU
2010-11-22 09:39:34 +00:00
Alexey Spizhevoy
faf4d0bc74
added bitwise operations into gpu module
2010-11-19 10:19:35 +00:00
Vladislav Vinogradov
863d61e9eb
fix gpu module compilation under linux
2010-11-08 09:55:10 +00:00