Commit Graph

415 Commits

Author SHA1 Message Date
Anatoly Baksheev
047c7e0fd6 *applied patch from NVidia (nppstTraspose bug)
*fixed some warnings
*finished gpu test port to gtest framework
2011-02-18 12:23:18 +00:00
Alexey Spizhevoy
993773b74a now GPU stereo_bp test prints error mesage and test system continues execution 2011-02-17 11:09:29 +00:00
Vladislav Vinogradov
54fa600b9e update docs
minor fixes and refactoring of GPU module
2011-02-16 08:31:45 +00:00
Anatoly Baksheev
6b34532901 minor + warnings 2011-02-15 15:09:54 +00:00
Alexey Spizhevoy
2f8af6335e renamed GpuFeature into FeatureSet and updated docs 2011-02-15 13:56:59 +00:00
Alexey Spizhevoy
04709a2793 refactoring of GPU module 2011-02-15 13:25:24 +00:00
Anatoly Baksheev
6b6a63ba38 fixed some warnings
GPU: TargetArchs -> added FEATURE_SET prefix.
2011-02-15 10:54:49 +00:00
Alexey Spizhevoy
725d83b0e5 added init() function into MultiGpuMgr, added samples 2011-02-15 10:03:26 +00:00
Alexey Spizhevoy
5d4913a2ee fixed multi GPU API, added driver_api prefix to multi gpu samples 2011-02-15 09:12:48 +00:00
Vladislav Vinogradov
0821c7ad17 fixed descriptor calculation in SURF_GPU 2011-02-15 08:57:35 +00:00
Vladislav Vinogradov
deac5d972e fixed errors in gpu on old video cards (SURF_GPU, BruteForceMatcher_GPU, min/max, setTo, convertTo)
added assertion after all kernels calls
2011-02-14 15:50:17 +00:00
Alexey Spizhevoy
a42a42858c fixed GPU samples and MultiGpuMgr 2011-02-14 15:09:45 +00:00
Anatoly Baksheev
d7e612cd4b GPU: In strcut _scanElemOp dummy type pass by value 2011-02-14 13:30:00 +00:00
Alexey Spizhevoy
186e46fcca added draft version of MultiGpuMgr (it isn't tested on multi GPU machine yet) 2011-02-14 12:53:59 +00:00
Alexey Spizhevoy
a6aff1856b removed check disabling GPU face detection under Linux, i've checked -- it works 2011-02-10 13:01:36 +00:00
Alexey Spizhevoy
63806c9ab9 renamed gpu::DeviceInfo::has into gpu::DeviceInfo::supports 2011-02-09 12:31:05 +00:00
Vladislav Vinogradov
924670d32c fixed block size calculation in SURF_GPU (fasthessian_gpu and nonmaxonly_gpu kernels) 2011-02-09 09:11:11 +00:00
Anatoly Baksheev
7539b7de65 fixed build under ubuntu, but FS is still disabled 2011-02-07 13:47:10 +00:00
Vladislav Vinogradov
5cd06d6a36 fixed SURF_GPU (fails on empty data)
added test for SURF_GPU and reprojectImageTo3D
2011-02-07 10:12:04 +00:00
Anatoly Baksheev
7b1c265563 fixed some compilation under ubuntu 2011-02-04 18:29:05 +00:00
Anatoly Baksheev
21f0d1e174 removed MS specific from FD code 2011-02-04 15:46:35 +00:00
Anatoly Baksheev
0747f2d863 1) NPP_staging as sources. Binaries removed.
2) NVidia tests for GPU
3) FD sample that uses NVidia's interface.
2011-02-04 15:15:25 +00:00
Alexey Spizhevoy
50429d8a3e fixed some warnings and errors under g++ 2011-02-03 14:51:58 +00:00
Alexey Spizhevoy
ea94b43541 added stereo_multi_gpu sample, cosmetic changes in multi_gpu sample 2011-02-03 12:02:39 +00:00
Alexey Spizhevoy
557dd39f03 fixed gpu::sum* on CC1.0, updated some tests 2011-02-02 07:23:55 +00:00
Alexey Spizhevoy
e303b0dd62 added CUDA_ARCH_FEATUERS define (for correct handling of BIN(PTX) cases) 2011-02-01 12:28:39 +00:00
Alexey Spizhevoy
16e74ab306 added buffered version of norm, updated performance sample and docs 2011-02-01 10:46:19 +00:00
Alexey Spizhevoy
3795142604 implemented gpu::norm via absSum, sqrSum, and minMax (removed norm_diff call), added support of other data types 2011-02-01 10:23:10 +00:00
Alexey Spizhevoy
ae529f4bc6 added absSum function 2011-01-31 14:37:03 +00:00
Alexey Spizhevoy
fa446e7e35 removed linear_filters_beta.cu as its functionality was moved into filters.cu 2011-01-31 13:31:59 +00:00
Vladislav Vinogradov
8274ed22e4 fixed gpu tests (BruteForceMatcher_GPU, divide, phase, cartToPolar, async)
minor code refactoring
2011-01-31 13:20:52 +00:00
Alexey Spizhevoy
7a29d96cf4 added buffered version of gpu::integral function and updated performance test (it still works too slow) 2011-01-31 10:42:33 +00:00
Alexey Spizhevoy
0af5356dbc updated code in case of the compilation without CUDA 2011-01-28 12:30:08 +00:00
Alexey Spizhevoy
575fd1fe4c reafactoring: replaced query device props functions with the DeviceInfo class 2011-01-28 11:59:26 +00:00
Alexey Spizhevoy
65b9f3bc10 fixed TargetArchs implementation in case when HAVE_CUDA=false, added initial structure for multi_gpu sample 2011-01-27 12:17:56 +00:00
Alexey Spizhevoy
891e2ff310 replaced has* methods in the GPU module with the TargetArchs monostate 2011-01-27 10:06:38 +00:00
Alexey Spizhevoy
13a6d0b92a fixed parsing GPU archs in BIN(PTX) format 2011-01-27 08:26:10 +00:00
Vladislav Vinogradov
eda8416358 fixed BruteForceMatcher_GPU (fails if input data is empty)
updated BruteForceMatcher_GPU test
2011-01-26 15:58:47 +00:00
Alexey Spizhevoy
310c483da8 added first version of gpu performance tests 2011-01-24 10:33:01 +00:00
Vladislav Vinogradov
811ba31897 added synchronization after NPP calls 2011-01-24 10:32:57 +00:00
Vladislav Vinogradov
8abdb3721f added gpu threshold. 2011-01-24 10:11:02 +00:00
Alexey Spizhevoy
e51b9021bc forgot to rename namespace 2011-01-21 11:31:45 +00:00
Alexey Spizhevoy
db41449be8 fixed errors under MacOS 2011-01-21 10:53:07 +00:00
Anatoly Baksheev
e3f3de84db warning fixed 2011-01-21 09:00:19 +00:00
Alexey Spizhevoy
01dafce1a1 fixed some bugs in GPU matrix reductions, removed <functional> into precomp.hpp 2011-01-21 07:43:11 +00:00
Alexey Spizhevoy
0da71a01ff fixed some GPU tests failing when compiled for 1.1(no doubles) and run on 1.3(with doubles) 2011-01-20 15:08:48 +00:00
Alexey Spizhevoy
9e48f64149 fixed parsing of CC in gpu module 2011-01-20 14:34:27 +00:00
Alexey Spizhevoy
574b3f94a1 updated gpu initialization functions, added compile-time error on CC 1.0 2011-01-20 14:13:07 +00:00
Alexey Spizhevoy
8779306800 updated main CMakeLists.txt gpu module section, now user can manage binary and intermediate code versions of the gpu module image
added more functions to check version of gpu code in runtime
2011-01-20 09:22:05 +00:00
Alexey Spizhevoy
4b8425dbb8 fixed minor bugs 2011-01-19 13:28:22 +00:00
Alexey Spizhevoy
186b1fc6ef refactored gpu module 2011-01-19 12:47:34 +00:00
Alexey Spizhevoy
90ae1e3aed refactored gpu module 2011-01-19 10:54:58 +00:00
Alexey Spizhevoy
1a0d41fb53 added checkPtxVersion into gpu module 2011-01-18 14:52:35 +00:00
Alexey Spizhevoy
cbb132ccb1 added ensureSizeIsEnough into gpu module, updated reduction methods 2011-01-18 12:36:01 +00:00
Alexey Spizhevoy
f3a2656808 added hasPtxFor and isCompatibleWith functions into gpu module, added docs for them 2011-01-18 12:01:28 +00:00
Anatoly Baksheev
cb63046dcf gpu face detetcion:
1) fixed bug with error codes (enum NppStStatus) shift.
2) added some asserts
2011-01-17 11:32:38 +00:00
Vladislav Vinogradov
349e0ece93 added docs for GPU Filter Engine 2011-01-13 13:48:58 +00:00
Anatoly Baksheev
1a94186195 First version of CascadeClassifier_GPU.
Only for VS2008 now.
Sample for it.
new NPP_staging for VS2008 only
2011-01-13 13:04:00 +00:00
Alexey Spizhevoy
4e23f37ff8 added docs for gpu::HOGDescriptor 2011-01-12 13:07:33 +00:00
Alexey Spizhevoy
4dfbf99dd5 cosmetic changes in gpu module, decreased matchTemplate test running time 2011-01-12 09:30:08 +00:00
Alexey Spizhevoy
d091ae5746 removed unnecessary code from gpu::matchTemplate 2011-01-12 06:49:03 +00:00
Alexey Spizhevoy
57f917d6f2 replaced adding constant in normalization with taking max 2011-01-11 10:15:46 +00:00
Alexey Spizhevoy
dc763e0250 updated normalization routine in the matchTemplate to avoid division by zero on black images (ticket #798), added test 2011-01-11 09:36:21 +00:00
Alexey Spizhevoy
ba32833c3f added missed include (for MSVC 2010) 2010-12-28 14:57:23 +00:00
Alexey Spizhevoy
be38864dd0 added buffered version of gpu::convolve 2010-12-27 10:18:42 +00:00
Alexey Spizhevoy
1ecb6cf775 simplified gpu::columnSum test, it doesn't fail on Quadro anymore (when seed is 000001af5a11badd) after BFM test, but something definitely wrong with NPP_Staging's transpose 2010-12-27 09:10:22 +00:00
Alexey Spizhevoy
8f0d36b8b6 refactored gpu::dft 2010-12-27 07:35:41 +00:00
Alexey Spizhevoy
86802ec968 minor warning fixed 2010-12-24 13:44:26 +00:00
Alexey Spizhevoy
eaf35a8421 minor refactoring in gpu module 2010-12-24 12:55:43 +00:00
Alexey Spizhevoy
21b081deff now single row GPU matrix is continuous one, added aux. functions, updated dft and matchTemplates 2010-12-24 09:26:19 +00:00
Alexey Spizhevoy
6702d55711 added support of scaling into gpu::dft, refactored gpu::convolve 2010-12-24 06:48:23 +00:00
Alexey Spizhevoy
783716838d updated test for gpu::dft, updated dft for handling continous source 2010-12-23 13:00:33 +00:00
Alexey Spizhevoy
52ca0c4bca fixed bug in gpu::dft 2010-12-23 10:07:37 +00:00
Alexey Spizhevoy
09735fd208 added gpu::dft implemented via CUFFT 2010-12-23 09:24:33 +00:00
Alexey Spizhevoy
68aba9f2fb added mulSpectrums functions into GPU module 2010-12-22 13:46:06 +00:00
Alexey Spizhevoy
fef06c25b5 moved crossCorr (as NPP_Staging wrapper) into public GPU module part from the internal matchTemplate files 2010-12-22 08:56:16 +00:00
Alexey Spizhevoy
f9bcef9003 moved sqrIntegral (NPP_Staging wrapper) into public GPU module part from matchTemplate.cpp 2010-12-22 08:17:36 +00:00
Alexey Spizhevoy
a6d9cce500 added CUFFT errors handling into GPU module 2010-12-22 08:03:53 +00:00
Vladislav Vinogradov
8190837dd4 fixed some warnings under linux 2010-12-22 07:30:21 +00:00
Vladislav Vinogradov
0cd587ee34 added gpu transpose and integral based on NPP Staging.
added mask support to SURF_GPU.
2010-12-21 14:02:09 +00:00
Vladislav Vinogradov
5bfb44f887 fixed compiler error under linux 2010-12-21 08:13:15 +00:00
Vladislav Vinogradov
0e45a637c4 added SURF_GPU.
added support of CV_32FC1 type to gpu copyMakeBorder.
2010-12-20 12:49:40 +00:00
Alexey Spizhevoy
df8529377b refactoring: moved gpu reduction-based functions into separated file 2010-12-20 09:51:25 +00:00
Alexey Spizhevoy
1922e50f19 refactoring: made gpu bitwise operations inline 2010-12-20 09:16:17 +00:00
Alexey Spizhevoy
0465b89e7e gpu module refactoring: moved per-element operations into separated file 2010-12-20 09:07:19 +00:00
Alexey Spizhevoy
6891a60149 added host code for gpu::matchTemplate (as NPP_staging was integrated) 2010-12-20 08:09:09 +00:00
Alexey Spizhevoy
e62bf3a2ae updated gpu bitwise operations 2010-12-20 08:06:13 +00:00
Anatoly Baksheev
0f30fe080f renamed constantspacebp -> stereocsbp line all other algs
created element_operations and matrix_reductions files for farther refactoring
2010-12-19 17:20:54 +00:00
Anatoly Baksheev
9dd4a22a5e temporary added NPP_staging, functionality from the library will be moved to NPP with next release. 2010-12-17 15:41:26 +00:00
Alexey Spizhevoy
7767038ef0 updated other gpu's bitwise operations 2010-12-17 12:48:04 +00:00
Alexey Spizhevoy
5132ce211b reimplemented gpu::bitwise_not operation, refactored gpu module 2010-12-17 12:22:51 +00:00
Alexey Spizhevoy
1a93412eca refactored bitwise operations in gpu module 2010-12-17 10:26:57 +00:00
Alexey Spizhevoy
74197c5b14 fixed link time error in gpu module 2010-12-16 08:46:47 +00:00
Alexey Spizhevoy
343c33d73e added support of CCOEFF_NORMED for multichannel images (8U) into gpu::matchTemplate 2010-12-16 08:10:31 +00:00
Alexey Spizhevoy
640af6623c added support of multichannel images into gpu::sqrSum 2010-12-15 16:32:56 +00:00
Alexey Spizhevoy
3db5b687f6 added support of multichannel images into gpu::matchTemplate for CCOEFF method 2010-12-15 16:04:10 +00:00
Alexey Spizhevoy
f56d9c340f added support of remaining image number of channels into gpu::sum 2010-12-15 15:28:35 +00:00
Alexey Spizhevoy
d8a7ff1e00 refactored gpu module, added vec math operators for uint, added support of 2 channel images into gpu::sum (removed support of double) 2010-12-15 15:12:32 +00:00
Alexey Spizhevoy
e5eec31be1 fixed minor bugs in gpu module 2010-12-15 12:10:30 +00:00
Alexey Spizhevoy
68c3018047 added support of multichannel images into gpu::matchTemplate (all methods except CCOEFF based), refactored 2010-12-15 11:22:37 +00:00
Alexey Spizhevoy
e7cf541f5f fixed bug in matchTemplate when template size is (1,1), refactored 2010-12-14 09:53:17 +00:00
Alexey Spizhevoy
1887b7d2e4 refactored matchTemplate.cu 2010-12-14 08:45:11 +00:00
Alexey Spizhevoy
ce47a37e6e added CUDA kernel for CV_TM_CCOEFF_NORMED matchTemplate method 2010-12-14 08:00:53 +00:00
Alexey Spizhevoy
b35aa77418 added CUDA kernel for CV_TM_CCOEFF matchTemplate method 2010-12-14 07:42:55 +00:00
Alexey Spizhevoy
39700c5d54 added some gpu::matchTemplate kernels (other parts after NPP Staging integration) 2010-12-13 16:48:34 +00:00
Alexey Spizhevoy
a81b41fb08 fixed some warning under Ubuntu in gpu module 2010-12-13 15:56:29 +00:00
Alexey Spizhevoy
c9f9f38777 added gpu::sqrSum function 2010-12-13 14:34:02 +00:00
Vladislav Vinogradov
b18a3a5f83 fixed errors in StereoBeliefPropogation under linux 2010-12-13 13:52:40 +00:00
Anatoly Baksheev
070d87fb7f fixed bug with prefilter_xsobel preset. 2010-12-13 13:42:35 +00:00
Alexey Spizhevoy
e193fa1165 fixed minor bug in gpu::sum 2010-12-13 12:50:52 +00:00
Alexey Spizhevoy
3997514b7c added tests for gpu::sum, it supports all data types, but single channel images only 2010-12-13 12:00:58 +00:00
Vladislav Vinogradov
442cd75c32 fixed compile error under linux 2010-12-13 09:50:32 +00:00
Vladislav Vinogradov
49ec8ba742 fixed bug in gpu filter engine (incorrect buffer type) and in vector's saturate_cast.
changed buffer type in linear filters to float.
added support of 1 channel image to linear filters.
added support of BORDER_REFLECT101, BORDER_REPLICATE and BORDER_CONSTANT border type to gpu linear filters.
minor fix in tests.
update comments in gpu.hpp.
2010-12-13 08:43:04 +00:00
Anatoly Baksheev
6c2cce6e28 some files rename 2010-12-11 15:21:11 +00:00
Anatoly Baksheev
ec4d2b6bb9 Fixed serious kernel crash IN StereoBM_GPU for some parameter. Will more rework for the code farther. 2010-12-11 15:07:14 +00:00
Alexey Spizhevoy
79b1eec3d3 minor refactoring of gpu module 2010-12-10 14:02:41 +00:00
Alexey Spizhevoy
9f80317ffa fixed minor bug in gpu module, added first version of sum 2010-12-10 13:36:00 +00:00
Alexey Spizhevoy
d557c800a7 refactored gpu module 2010-12-10 10:23:32 +00:00
Alexey Spizhevoy
97484089c5 added support of CV_8U & CV_TM_CCORR into gpu::matchTemplate 2010-12-10 08:06:54 +00:00
Alexey Spizhevoy
9801d07a46 added test for gpu:columnSum 2010-12-08 16:51:12 +00:00
Alexey Spizhevoy
fa322bf46f added gpu::columnSum, fixed compile error (if there is no cuda), refactored 2010-12-08 15:06:10 +00:00
Alexey Spizhevoy
b1c5b9293e added support of CV_32F & CV_TM_SQDIFF into gpu::matchTemplate 2010-12-08 13:12:12 +00:00
Alexey Spizhevoy
40304721a7 added support of CV_TM_CCORR (via FFT) into gpu::matchTemplate (versions both with block and without blocks) 2010-12-08 13:03:53 +00:00
Vladislav Vinogradov
905e5f1739 added support of 4-channels images to StereoConstantSpaceBP.
refactored transpose_gpu, made it non template function.
2010-12-08 07:23:59 +00:00
Vladislav Vinogradov
c18aa438ec added support of 4 channels images to StereoBeliefPropagation, minor code refactoring. 2010-12-07 10:20:03 +00:00
Alexey Spizhevoy
349a9fac86 fixed compile-time warning under g++ 2010-12-07 07:16:13 +00:00
Anatoly Baksheev
652fb1212e module reorganization: added folder with pure device functions, cuda_shared.hpp renamed to internal_shared.hpp 2010-12-06 16:37:32 +00:00
Vladislav Vinogradov
fadd19b976 added assertion on Compute Capability >= 1.1 to BruteForceMatcher_GPU_base::radiusMatch 2010-12-06 14:51:47 +00:00
Alexey Spizhevoy
50167f6c26 added first version of gpu::matchTemplate, currently it works only with 8UC1 images and SQDIFF method 2010-12-06 14:19:41 +00:00
Vladislav Vinogradov
8891acb67a added BruteForceMatcher_GPU 2010-12-06 12:06:51 +00:00
Alexey Spizhevoy
77027f6075 refactored border interpolation in gpu module 2010-12-06 09:44:51 +00:00
Vladislav Vinogradov
17d9014373 added per-element min/max to gpu module.
fixed compile error in transform.
2010-12-06 08:10:11 +00:00
Anatoly Baksheev
d96c5ebb7d misprint 2010-12-06 08:03:33 +00:00
Alexey Spizhevoy
6ef4d9b1dd added sipport of BORDER_REPLICATE mode into gpu::corner* functions 2010-12-06 07:47:26 +00:00
Alexey Spizhevoy
103bbaf09c updated BrdReflect101 in gpu module 2010-12-03 13:28:49 +00:00
Alexey Spizhevoy
e75ca4b662 replaced global memory reads with texture memory reads in GPU's corner detectors 2010-12-03 13:11:14 +00:00
Alexey Spizhevoy
df48ad76b0 added support of gamama_correction parameter into gpu::HOGDescriptor, updated tests 2010-12-03 11:11:44 +00:00
Alexey Spizhevoy
2a2590bae2 replaced filter (from the nearest to linear) mode when resizing image in gpu::HOGDescriptor 2010-12-03 10:14:01 +00:00
Alexey Spizhevoy
68c41a6c6c minor fix in gpu's header 2010-12-02 09:29:49 +00:00
Alexey Spizhevoy
ab04a03621 added support of BORDER_REFLECT into gpu::cornerHarris and etc, added support of border extr. int linear filters 2010-12-02 09:07:13 +00:00
Alexey Spizhevoy
d9489bbe0c fixed bug in gpu::cornerHarris and gpu::cornerMinEigenVal 2010-12-01 10:00:47 +00:00
Vladislav Vinogradov
57f7678db1 added gpu transpose for CV_8UC4, CV_8SC4, CV_16SC2, CV_16UC2, CV_32SC1 and CV_32FC1 types 2010-12-01 07:00:50 +00:00
Alexey Spizhevoy
d0dfae5143 fixed bug in ms segmentation 2010-11-30 13:02:24 +00:00
Alexey Spizhevoy
32a9b63d2f added better threads configuration estimator for the minMax, minMaxLoc, countNonZero functions in gpu module 2010-11-30 12:27:21 +00:00
Alexey Spizhevoy
493d7a6e45 added min eigen val based corner detector into gpu module 2010-11-30 08:44:04 +00:00
Alexey Spizhevoy
9adfc2cadc added Harris corner detector into gpu module 2010-11-30 08:04:37 +00:00
Alexey Spizhevoy
b260382840 added masks support into gpu::minMaxLoc 2010-11-29 13:56:43 +00:00
Alexey Spizhevoy
678f392569 added is_signed into numeric_limits_gpu, fixed incorrect min max finding for floating values 2010-11-29 13:21:43 +00:00
Alexey Spizhevoy
6ad158dbe1 refactored gpu::minMaxLoc 2010-11-29 10:44:25 +00:00
Alexey Spizhevoy
437ac1a2f1 added mask support into gpu::minMax 2010-11-29 10:29:21 +00:00