Alexey Spizhevoy
e303b0dd62
added CUDA_ARCH_FEATUERS define (for correct handling of BIN(PTX) cases)
2011-02-01 12:28:39 +00:00
Alexey Spizhevoy
16e74ab306
added buffered version of norm, updated performance sample and docs
2011-02-01 10:46:19 +00:00
Alexey Spizhevoy
3795142604
implemented gpu::norm via absSum, sqrSum, and minMax (removed norm_diff call), added support of other data types
2011-02-01 10:23:10 +00:00
Alexey Spizhevoy
ae529f4bc6
added absSum function
2011-01-31 14:37:03 +00:00
Alexey Spizhevoy
fa446e7e35
removed linear_filters_beta.cu as its functionality was moved into filters.cu
2011-01-31 13:31:59 +00:00
Vladislav Vinogradov
8274ed22e4
fixed gpu tests (BruteForceMatcher_GPU, divide, phase, cartToPolar, async)
...
minor code refactoring
2011-01-31 13:20:52 +00:00
Alexey Spizhevoy
7a29d96cf4
added buffered version of gpu::integral function and updated performance test (it still works too slow)
2011-01-31 10:42:33 +00:00
Alexey Spizhevoy
0af5356dbc
updated code in case of the compilation without CUDA
2011-01-28 12:30:08 +00:00
Alexey Spizhevoy
575fd1fe4c
reafactoring: replaced query device props functions with the DeviceInfo class
2011-01-28 11:59:26 +00:00
Alexey Spizhevoy
65b9f3bc10
fixed TargetArchs implementation in case when HAVE_CUDA=false, added initial structure for multi_gpu sample
2011-01-27 12:17:56 +00:00
Alexey Spizhevoy
891e2ff310
replaced has* methods in the GPU module with the TargetArchs monostate
2011-01-27 10:06:38 +00:00
Alexey Spizhevoy
13a6d0b92a
fixed parsing GPU archs in BIN(PTX) format
2011-01-27 08:26:10 +00:00
Vladislav Vinogradov
eda8416358
fixed BruteForceMatcher_GPU (fails if input data is empty)
...
updated BruteForceMatcher_GPU test
2011-01-26 15:58:47 +00:00
Alexey Spizhevoy
310c483da8
added first version of gpu performance tests
2011-01-24 10:33:01 +00:00
Vladislav Vinogradov
811ba31897
added synchronization after NPP calls
2011-01-24 10:32:57 +00:00
Vladislav Vinogradov
8abdb3721f
added gpu threshold.
2011-01-24 10:11:02 +00:00
Alexey Spizhevoy
055c226392
fixed FindNPP.cmake Apple-aimed part
2011-01-21 12:57:40 +00:00
Alexey Spizhevoy
e51b9021bc
forgot to rename namespace
2011-01-21 11:31:45 +00:00
Alexey Spizhevoy
db41449be8
fixed errors under MacOS
2011-01-21 10:53:07 +00:00
Anatoly Baksheev
e3f3de84db
warning fixed
2011-01-21 09:00:19 +00:00
Alexey Spizhevoy
01dafce1a1
fixed some bugs in GPU matrix reductions, removed <functional> into precomp.hpp
2011-01-21 07:43:11 +00:00
Alexey Spizhevoy
0da71a01ff
fixed some GPU tests failing when compiled for 1.1(no doubles) and run on 1.3(with doubles)
2011-01-20 15:08:48 +00:00
Alexey Spizhevoy
9e48f64149
fixed parsing of CC in gpu module
2011-01-20 14:34:27 +00:00
Alexey Spizhevoy
574b3f94a1
updated gpu initialization functions, added compile-time error on CC 1.0
2011-01-20 14:13:07 +00:00
Alexey Spizhevoy
8779306800
updated main CMakeLists.txt gpu module section, now user can manage binary and intermediate code versions of the gpu module image
...
added more functions to check version of gpu code in runtime
2011-01-20 09:22:05 +00:00
Alexey Spizhevoy
4b8425dbb8
fixed minor bugs
2011-01-19 13:28:22 +00:00
Alexey Spizhevoy
186b1fc6ef
refactored gpu module
2011-01-19 12:47:34 +00:00
Alexey Spizhevoy
90ae1e3aed
refactored gpu module
2011-01-19 10:54:58 +00:00
Alexey Spizhevoy
1a0d41fb53
added checkPtxVersion into gpu module
2011-01-18 14:52:35 +00:00
Alexey Spizhevoy
cbb132ccb1
added ensureSizeIsEnough into gpu module, updated reduction methods
2011-01-18 12:36:01 +00:00
Alexey Spizhevoy
f3a2656808
added hasPtxFor and isCompatibleWith functions into gpu module, added docs for them
2011-01-18 12:01:28 +00:00
Anatoly Baksheev
cb63046dcf
gpu face detetcion:
...
1) fixed bug with error codes (enum NppStStatus) shift.
2) added some asserts
2011-01-17 11:32:38 +00:00
Alexey Spizhevoy
db852e0b54
updated gpu module docs
2011-01-13 13:55:13 +00:00
Vladislav Vinogradov
349e0ece93
added docs for GPU Filter Engine
2011-01-13 13:48:58 +00:00
Anatoly Baksheev
1a94186195
First version of CascadeClassifier_GPU.
...
Only for VS2008 now.
Sample for it.
new NPP_staging for VS2008 only
2011-01-13 13:04:00 +00:00
Alexey Spizhevoy
4e23f37ff8
added docs for gpu::HOGDescriptor
2011-01-12 13:07:33 +00:00
Alexey Spizhevoy
4dfbf99dd5
cosmetic changes in gpu module, decreased matchTemplate test running time
2011-01-12 09:30:08 +00:00
Alexey Spizhevoy
d091ae5746
removed unnecessary code from gpu::matchTemplate
2011-01-12 06:49:03 +00:00
Alexey Spizhevoy
57f917d6f2
replaced adding constant in normalization with taking max
2011-01-11 10:15:46 +00:00
Alexey Spizhevoy
dc763e0250
updated normalization routine in the matchTemplate to avoid division by zero on black images (ticket #798 ), added test
2011-01-11 09:36:21 +00:00
Alexey Spizhevoy
2f13e4ce58
refactored hog, added camera support into hog_sample
2010-12-29 15:45:01 +00:00
Alexey Spizhevoy
ba32833c3f
added missed include (for MSVC 2010)
2010-12-28 14:57:23 +00:00
Alexey Spizhevoy
be38864dd0
added buffered version of gpu::convolve
2010-12-27 10:18:42 +00:00
Alexey Spizhevoy
1ecb6cf775
simplified gpu::columnSum test, it doesn't fail on Quadro anymore (when seed is 000001af5a11badd) after BFM test, but something definitely wrong with NPP_Staging's transpose
2010-12-27 09:10:22 +00:00
Alexey Spizhevoy
8f0d36b8b6
refactored gpu::dft
2010-12-27 07:35:41 +00:00
Alexey Spizhevoy
86802ec968
minor warning fixed
2010-12-24 13:44:26 +00:00
Alexey Spizhevoy
eaf35a8421
minor refactoring in gpu module
2010-12-24 12:55:43 +00:00
Alexey Spizhevoy
21b081deff
now single row GPU matrix is continuous one, added aux. functions, updated dft and matchTemplates
2010-12-24 09:26:19 +00:00
Alexey Spizhevoy
54fcdf4cae
minor changes in gpu module (comments and warnings)
2010-12-24 07:00:08 +00:00
Alexey Spizhevoy
6702d55711
added support of scaling into gpu::dft, refactored gpu::convolve
2010-12-24 06:48:23 +00:00
Alexey Spizhevoy
783716838d
updated test for gpu::dft, updated dft for handling continous source
2010-12-23 13:00:33 +00:00
Alexey Spizhevoy
52ca0c4bca
fixed bug in gpu::dft
2010-12-23 10:07:37 +00:00
Alexey Spizhevoy
09735fd208
added gpu::dft implemented via CUFFT
2010-12-23 09:24:33 +00:00
Alexey Spizhevoy
68aba9f2fb
added mulSpectrums functions into GPU module
2010-12-22 13:46:06 +00:00
Alexey Spizhevoy
fef06c25b5
moved crossCorr (as NPP_Staging wrapper) into public GPU module part from the internal matchTemplate files
2010-12-22 08:56:16 +00:00
Alexey Spizhevoy
f9bcef9003
moved sqrIntegral (NPP_Staging wrapper) into public GPU module part from matchTemplate.cpp
2010-12-22 08:17:36 +00:00
Alexey Spizhevoy
a6d9cce500
added CUFFT errors handling into GPU module
2010-12-22 08:03:53 +00:00
Vladislav Vinogradov
8190837dd4
fixed some warnings under linux
2010-12-22 07:30:21 +00:00
Vladislav Vinogradov
0cd587ee34
added gpu transpose and integral based on NPP Staging.
...
added mask support to SURF_GPU.
2010-12-21 14:02:09 +00:00
Vladislav Vinogradov
5bfb44f887
fixed compiler error under linux
2010-12-21 08:13:15 +00:00
Vladislav Vinogradov
0e45a637c4
added SURF_GPU.
...
added support of CV_32FC1 type to gpu copyMakeBorder.
2010-12-20 12:49:40 +00:00
Alexey Spizhevoy
df8529377b
refactoring: moved gpu reduction-based functions into separated file
2010-12-20 09:51:25 +00:00
Alexey Spizhevoy
1922e50f19
refactoring: made gpu bitwise operations inline
2010-12-20 09:16:17 +00:00
Alexey Spizhevoy
0465b89e7e
gpu module refactoring: moved per-element operations into separated file
2010-12-20 09:07:19 +00:00
Alexey Spizhevoy
6891a60149
added host code for gpu::matchTemplate (as NPP_staging was integrated)
2010-12-20 08:09:09 +00:00
Alexey Spizhevoy
e62bf3a2ae
updated gpu bitwise operations
2010-12-20 08:06:13 +00:00
Anatoly Baksheev
0f30fe080f
renamed constantspacebp -> stereocsbp line all other algs
...
created element_operations and matrix_reductions files for farther refactoring
2010-12-19 17:20:54 +00:00
Anatoly Baksheev
9dd4a22a5e
temporary added NPP_staging, functionality from the library will be moved to NPP with next release.
2010-12-17 15:41:26 +00:00
Alexey Spizhevoy
7767038ef0
updated other gpu's bitwise operations
2010-12-17 12:48:04 +00:00
Alexey Spizhevoy
5132ce211b
reimplemented gpu::bitwise_not operation, refactored gpu module
2010-12-17 12:22:51 +00:00
Alexey Spizhevoy
1a93412eca
refactored bitwise operations in gpu module
2010-12-17 10:26:57 +00:00
Alexey Spizhevoy
74197c5b14
fixed link time error in gpu module
2010-12-16 08:46:47 +00:00
Alexey Spizhevoy
343c33d73e
added support of CCOEFF_NORMED for multichannel images (8U) into gpu::matchTemplate
2010-12-16 08:10:31 +00:00
Alexey Spizhevoy
640af6623c
added support of multichannel images into gpu::sqrSum
2010-12-15 16:32:56 +00:00
Alexey Spizhevoy
3db5b687f6
added support of multichannel images into gpu::matchTemplate for CCOEFF method
2010-12-15 16:04:10 +00:00
Alexey Spizhevoy
f56d9c340f
added support of remaining image number of channels into gpu::sum
2010-12-15 15:28:35 +00:00
Alexey Spizhevoy
d8a7ff1e00
refactored gpu module, added vec math operators for uint, added support of 2 channel images into gpu::sum (removed support of double)
2010-12-15 15:12:32 +00:00
Alexey Spizhevoy
e5eec31be1
fixed minor bugs in gpu module
2010-12-15 12:10:30 +00:00
Alexey Spizhevoy
68c3018047
added support of multichannel images into gpu::matchTemplate (all methods except CCOEFF based), refactored
2010-12-15 11:22:37 +00:00
Alexey Spizhevoy
e7cf541f5f
fixed bug in matchTemplate when template size is (1,1), refactored
2010-12-14 09:53:17 +00:00
Alexey Spizhevoy
1887b7d2e4
refactored matchTemplate.cu
2010-12-14 08:45:11 +00:00
Alexey Spizhevoy
ce47a37e6e
added CUDA kernel for CV_TM_CCOEFF_NORMED matchTemplate method
2010-12-14 08:00:53 +00:00
Alexey Spizhevoy
b35aa77418
added CUDA kernel for CV_TM_CCOEFF matchTemplate method
2010-12-14 07:42:55 +00:00
Alexey Spizhevoy
39700c5d54
added some gpu::matchTemplate kernels (other parts after NPP Staging integration)
2010-12-13 16:48:34 +00:00
Alexey Spizhevoy
a81b41fb08
fixed some warning under Ubuntu in gpu module
2010-12-13 15:56:29 +00:00
Alexey Spizhevoy
c9f9f38777
added gpu::sqrSum function
2010-12-13 14:34:02 +00:00
Vladislav Vinogradov
b18a3a5f83
fixed errors in StereoBeliefPropogation under linux
2010-12-13 13:52:40 +00:00
Anatoly Baksheev
070d87fb7f
fixed bug with prefilter_xsobel preset.
2010-12-13 13:42:35 +00:00
Alexey Spizhevoy
e193fa1165
fixed minor bug in gpu::sum
2010-12-13 12:50:52 +00:00
Alexey Spizhevoy
3997514b7c
added tests for gpu::sum, it supports all data types, but single channel images only
2010-12-13 12:00:58 +00:00
Vladislav Vinogradov
442cd75c32
fixed compile error under linux
2010-12-13 09:50:32 +00:00
Vladislav Vinogradov
49ec8ba742
fixed bug in gpu filter engine (incorrect buffer type) and in vector's saturate_cast.
...
changed buffer type in linear filters to float.
added support of 1 channel image to linear filters.
added support of BORDER_REFLECT101, BORDER_REPLICATE and BORDER_CONSTANT border type to gpu linear filters.
minor fix in tests.
update comments in gpu.hpp.
2010-12-13 08:43:04 +00:00
Anatoly Baksheev
6c2cce6e28
some files rename
2010-12-11 15:21:11 +00:00
Anatoly Baksheev
ec4d2b6bb9
Fixed serious kernel crash IN StereoBM_GPU for some parameter. Will more rework for the code farther.
2010-12-11 15:07:14 +00:00
Alexey Spizhevoy
79b1eec3d3
minor refactoring of gpu module
2010-12-10 14:02:41 +00:00
Alexey Spizhevoy
9f80317ffa
fixed minor bug in gpu module, added first version of sum
2010-12-10 13:36:00 +00:00
Alexey Spizhevoy
d557c800a7
refactored gpu module
2010-12-10 10:23:32 +00:00
Alexey Spizhevoy
97484089c5
added support of CV_8U & CV_TM_CCORR into gpu::matchTemplate
2010-12-10 08:06:54 +00:00
Alexey Spizhevoy
9801d07a46
added test for gpu:columnSum
2010-12-08 16:51:12 +00:00
Alexey Spizhevoy
fa322bf46f
added gpu::columnSum, fixed compile error (if there is no cuda), refactored
2010-12-08 15:06:10 +00:00
Alexey Spizhevoy
b1c5b9293e
added support of CV_32F & CV_TM_SQDIFF into gpu::matchTemplate
2010-12-08 13:12:12 +00:00
Alexey Spizhevoy
40304721a7
added support of CV_TM_CCORR (via FFT) into gpu::matchTemplate (versions both with block and without blocks)
2010-12-08 13:03:53 +00:00
Vladislav Vinogradov
905e5f1739
added support of 4-channels images to StereoConstantSpaceBP.
...
refactored transpose_gpu, made it non template function.
2010-12-08 07:23:59 +00:00
Vladislav Vinogradov
c18aa438ec
added support of 4 channels images to StereoBeliefPropagation, minor code refactoring.
2010-12-07 10:20:03 +00:00
Alexey Spizhevoy
349a9fac86
fixed compile-time warning under g++
2010-12-07 07:16:13 +00:00
Alexey Spizhevoy
19b434be50
minor refactoring of gpu module
2010-12-07 06:45:54 +00:00
Anatoly Baksheev
652fb1212e
module reorganization: added folder with pure device functions, cuda_shared.hpp renamed to internal_shared.hpp
2010-12-06 16:37:32 +00:00
Vladislav Vinogradov
fadd19b976
added assertion on Compute Capability >= 1.1 to BruteForceMatcher_GPU_base::radiusMatch
2010-12-06 14:51:47 +00:00
Alexey Spizhevoy
50167f6c26
added first version of gpu::matchTemplate, currently it works only with 8UC1 images and SQDIFF method
2010-12-06 14:19:41 +00:00
Vladislav Vinogradov
8891acb67a
added BruteForceMatcher_GPU
2010-12-06 12:06:51 +00:00
Alexey Spizhevoy
77027f6075
refactored border interpolation in gpu module
2010-12-06 09:44:51 +00:00
Vladislav Vinogradov
17d9014373
added per-element min/max to gpu module.
...
fixed compile error in transform.
2010-12-06 08:10:11 +00:00
Anatoly Baksheev
d96c5ebb7d
misprint
2010-12-06 08:03:33 +00:00
Alexey Spizhevoy
6ef4d9b1dd
added sipport of BORDER_REPLICATE mode into gpu::corner* functions
2010-12-06 07:47:26 +00:00
Alexey Spizhevoy
103bbaf09c
updated BrdReflect101 in gpu module
2010-12-03 13:28:49 +00:00
Alexey Spizhevoy
e75ca4b662
replaced global memory reads with texture memory reads in GPU's corner detectors
2010-12-03 13:11:14 +00:00
Alexey Spizhevoy
df48ad76b0
added support of gamama_correction parameter into gpu::HOGDescriptor, updated tests
2010-12-03 11:11:44 +00:00
Alexey Spizhevoy
2a2590bae2
replaced filter (from the nearest to linear) mode when resizing image in gpu::HOGDescriptor
2010-12-03 10:14:01 +00:00
Alexey Spizhevoy
68c41a6c6c
minor fix in gpu's header
2010-12-02 09:29:49 +00:00
Alexey Spizhevoy
ab04a03621
added support of BORDER_REFLECT into gpu::cornerHarris and etc, added support of border extr. int linear filters
2010-12-02 09:07:13 +00:00
Alexey Spizhevoy
d9489bbe0c
fixed bug in gpu::cornerHarris and gpu::cornerMinEigenVal
2010-12-01 10:00:47 +00:00
Vladislav Vinogradov
57f7678db1
added gpu transpose for CV_8UC4, CV_8SC4, CV_16SC2, CV_16UC2, CV_32SC1 and CV_32FC1 types
2010-12-01 07:00:50 +00:00
Alexey Spizhevoy
d0dfae5143
fixed bug in ms segmentation
2010-11-30 13:02:24 +00:00
Alexey Spizhevoy
32a9b63d2f
added better threads configuration estimator for the minMax, minMaxLoc, countNonZero functions in gpu module
2010-11-30 12:27:21 +00:00
Alexey Spizhevoy
493d7a6e45
added min eigen val based corner detector into gpu module
2010-11-30 08:44:04 +00:00
Alexey Spizhevoy
9adfc2cadc
added Harris corner detector into gpu module
2010-11-30 08:04:37 +00:00
Alexey Spizhevoy
b260382840
added masks support into gpu::minMaxLoc
2010-11-29 13:56:43 +00:00
Alexey Spizhevoy
678f392569
added is_signed into numeric_limits_gpu, fixed incorrect min max finding for floating values
2010-11-29 13:21:43 +00:00
Alexey Spizhevoy
6ad158dbe1
refactored gpu::minMaxLoc
2010-11-29 10:44:25 +00:00
Alexey Spizhevoy
437ac1a2f1
added mask support into gpu::minMax
2010-11-29 10:29:21 +00:00
Alexey Spizhevoy
8c48f3be9d
moved GPU's global counter reset from caller to the kernel's end
2010-11-29 08:09:54 +00:00
Alexey Spizhevoy
3da253a259
replaced one-threads tail reduce with one-block tail reduce in functions gpu: minMax, minMaxLoc
2010-11-29 08:04:39 +00:00
Alexey Spizhevoy
1066bd2fa2
removed double support (gpu: minMax, minMaxLoc, countNonZero) for CC which doesn't have native double support
2010-11-29 07:31:09 +00:00
Alexey Spizhevoy
72f020a8f3
added gpu::count_non_zero version for CC1.0, refactored gpu module a little
2010-11-29 07:18:11 +00:00
Anatoly Baksheev
4c6ffc7c77
temporary removed VERSION_VAR (because this requires cmake 2.8.3)
2010-11-28 11:12:35 +00:00
Anatoly Baksheev
fbdb4f4ab5
1) gpu test refactoring.
...
2) fixed gpu bm test fail
3) CMAKE_MINIMUM_REQUIRED(VERSION 2.8.3)
2010-11-26 17:24:12 +00:00
Alexey Spizhevoy
7e2cc1be1b
added first version of gpu::countNonZero for all data types, it doesn't support compute capability 1.0 yet, also fixed some little bugs
2010-11-26 17:12:48 +00:00
Anatoly Baksheev
e470246ab5
restored version check (VERSION_VAR)
2010-11-26 15:51:43 +00:00
Martin Baeum
a7d979fb7e
remove spurious VERSION_VAR ( fixes #711 )
2010-11-26 13:19:42 +00:00
Alexey Spizhevoy
bdaad9e1fe
added support of buffers into gpu::minMaxLoc, reduced memory requirements, refactored
2010-11-26 07:50:11 +00:00
Alexey Spizhevoy
9ad7a1c927
fixed compile time errors in gpu module (if one compiles without CUDA)
2010-11-25 11:52:15 +00:00
Anatoly Baksheev
36ebf4ef0d
cuda+npp version check at configure time.
2010-11-25 10:42:09 +00:00
Alexey Spizhevoy
fa5422a2ad
added buf support into gpu::minMax
2010-11-25 10:19:06 +00:00
Alexey Spizhevoy
48183f10e7
optimized memory requirements for gpu::minMax's buffers, added support of compute capability 1.0
2010-11-25 09:57:02 +00:00
Anatoly Baksheev
7c77787c84
npp version check
2010-11-25 08:42:12 +00:00
Anatoly Baksheev
d6686240d5
added version detection for NPP
2010-11-24 18:44:11 +00:00
Alexey Spizhevoy
27690e3b6e
added minMaxLoc function into gpu module
2010-11-24 11:40:14 +00:00
Anatoly Baksheev
0e43976259
1) more convenient naming for samples gpu
...
2) added mask support to device 'transform' function
3) sample hog gpu: waitKey(1) -> waitKey(3), in other case image is not displayed.
2010-11-24 09:43:17 +00:00
Alexey Spizhevoy
68a7fc9868
optimized gpu::minMax a little
2010-11-24 09:19:11 +00:00
Alexey Spizhevoy
13d18d65a8
added support of multichannel matrices in gpu::minMax
2010-11-24 09:03:37 +00:00