Commit Graph

805 Commits

Author SHA1 Message Date
Vladislav Vinogradov
bb23288c3d fixed bug in SURF_GPU 2012-01-30 07:59:14 +00:00
Vladislav Vinogradov
d38ca09738 added gpu::printCudaDeviceInfo to all samples 2012-01-30 07:20:09 +00:00
Vladislav Vinogradov
3ec2f7f5f0 fixed gpu::device::transform with non-aligned sources (Ticket #1567) 2012-01-27 13:10:30 +00:00
Vladislav Vinogradov
9304d32b82 disabled unnecessary code (it can't be compiled with VS 2008 Ticket #1561) 2012-01-26 12:52:32 +00:00
Vladislav Vinogradov
ddf56fa629 added additional error check 2012-01-25 07:32:35 +00:00
Vladislav Vinogradov
5e1808c797 fixed bug in gpu::Canny 2012-01-23 07:41:39 +00:00
Vladislav Vinogradov
1d1da9c5d6 added support of BORDER_REFLECT to gpu::cornerHarris and gpu::cornerMinEigenVal 2012-01-23 07:14:45 +00:00
Vladislav Vinogradov
6c1de21a76 fixed bug in gpu::transformSmart with mask 2012-01-22 13:25:45 +00:00
Vladislav Vinogradov
cdae0743ab fix OpenGL render functions
fix createOpticalFlowNeedleMap
2012-01-18 08:27:08 +00:00
Vladislav Vinogradov
a25b027ef7 updated gpu optical_flow sample 2012-01-11 13:05:03 +00:00
Vladislav Vinogradov
5860477605 fixed gpu::resize 2012-01-11 06:14:51 +00:00
Vladislav Vinogradov
af59a75ffc fixed bug with submatrix in some gpu functions
update gpu tests
2012-01-10 11:11:58 +00:00
Vladislav Vinogradov
eccfc90b77 bug fix 2011-12-28 07:56:19 +00:00
Vladislav Vinogradov
4fee5ef818 fixed build under linux 2011-12-27 10:30:54 +00:00
Vladislav Vinogradov
5e38cf8042 added FAST_GPU and ORB_GPU classes 2011-12-27 09:33:20 +00:00
Vladislav Vinogradov
b398ac7a96 added ImagePyramid class to gpu module 2011-12-27 06:24:31 +00:00
Vladislav Vinogradov
d13a6b74b2 Fixed bug with submatrix in device::transform 2011-12-21 05:59:14 +00:00
Anton Obukhov
ebc3043c86 [*] Fixed two bugs in reduction functor: out of shared memory bounds access and missing volatile on GF100 and further 2011-11-21 17:46:55 +00:00
Vladislav Vinogradov
c2783af781 added OpenGL support to highgui under WIN32 2011-11-21 11:58:52 +00:00
Vladislav Vinogradov
464b8a570c fixed compilation under linux 2011-11-14 14:57:36 +00:00
Vladislav Vinogradov
2695039a79 moved GpuMat's operations implementation to core module 2011-11-14 14:34:36 +00:00
Vladislav Vinogradov
0f53f2993e removed BEGIN_OPENCV_DEVICE_NAMESPACE macros 2011-11-14 09:02:06 +00:00
Vladislav Vinogradov
d926541311 fixed compilation error 2011-11-14 06:22:42 +00:00
Anton Obukhov
54337fd513 [+] CUDA path for NCVImagePyramid 2011-11-13 20:52:48 +00:00
Vladislav Vinogradov
21f5cc7e6e minor 2011-11-09 14:23:13 +00:00
Vladislav Vinogradov
b873fa818f fixed compilation under linux 2011-11-09 14:05:34 +00:00
Vladislav Vinogradov
8e061ac801 minor fix 2011-11-09 13:33:44 +00:00
Vladislav Vinogradov
fcfa72081e moved GpuMat and DevMem2D to core module, some code refactoring 2011-11-09 13:13:52 +00:00
Anton Obukhov
2cb9192604 [*] Fixed #1464
[~] NCVPyramid uses tr1 and thus can be compiled with CL, commented out on linux
[+] Moved reduction functors to NCVAlg
[*] Warnings in NCV
2011-11-06 09:43:17 +00:00
Anton Obukhov
3396a2ec1a [~] Refactored device reduction for better reuse (now with functor semantics) 2011-11-04 13:49:05 +00:00
Anton Obukhov
56531c6161 [*] Fixed #1461 2011-11-04 12:41:54 +00:00
Vladislav Vinogradov
09ba133ddf prepared GpuMat for moving to core module 2011-10-31 08:51:00 +00:00
Vladislav Vinogradov
0b4e7d6057 fixed remap under cc < 2.0 2011-10-31 08:50:14 +00:00
Anton Obukhov
5afb445283 [+] Pixel types via templates
[+] Color conversions stub via pixel types
[+] Pyramid calculation (required for mipmaps in CUDA 4.1)
[~] Changed C strings to C++ throughout NCV
[~] Fixed a couple of bugs in NCV
2011-10-27 10:13:28 +00:00
Alexey Spizhevoy
b74e9c101a Fixed bug in the stitching module (some images appeared at result pano center) 2011-10-24 11:52:36 +00:00
Vladislav Vinogradov
e7502e7641 implemented gpu::gemm via CUBLAS 2011-10-19 13:29:54 +00:00
Vladislav Vinogradov
90ff3dd990 added cublasSafeCall 2011-10-19 11:49:44 +00:00
Vladislav Vinogradov
40ee754e28 added WITH_CUFFT and WITH_CUBLAS flags to cmake scripts
fixed gpu module error reporting
added asynchronous version of some functions
2011-10-19 09:53:22 +00:00
Vladislav Vinogradov
b0536279eb added wrappers for BroxOpticalFlow and interpolateFrames 2011-10-17 13:12:39 +00:00
Vladislav Vinogradov
87f3451ec6 fixed warnings 2011-10-17 06:44:40 +00:00
Alexey Spizhevoy
8cebb0982b Updated optimal DFT size estimation in gpu::convolve() 2011-10-14 09:42:25 +00:00
Anatoly Baksheev
7076dfd7d6 gpu module: refactored devmem2d.hpp (atomic bomb) 2011-10-13 15:14:04 +00:00
Alexey Spizhevoy
916703c6e5 Updated optimal block size estimation in gpu::convolve() 2011-10-13 13:02:41 +00:00
Vladislav Vinogradov
631d4b483a minor fix 2011-10-12 11:36:10 +00:00
Vladislav Vinogradov
8f108b4e7e bug fix 2011-10-12 11:01:11 +00:00
Vladislav Vinogradov
d3c4e90760 new optimized implementation of BruteForceMatcher_GPU (~2-3x faster) 2011-10-12 10:23:03 +00:00
Vladislav Vinogradov
f38596b783 implemented brute force convolve for small kernel sizes 2011-10-10 11:58:47 +00:00
Anatoly Baksheev
ee768d4605 fixed bug in ptr in optflow.cu 2011-10-10 09:37:08 +00:00
Vladislav Vinogradov
ed749c4bfe added missing arithm operations to gpu module 2011-10-10 08:19:11 +00:00
Anatoly Baksheev
f444f9b43e optflow gpu compilation fix under vs2010 2011-10-04 17:37:05 +00:00
Alexey Spizhevoy
8a799aa89a Updated optimal block size estimation for the convolve() function 2011-10-03 14:05:52 +00:00
Anatoly Baksheev
923f771cfc warnings
optical flow gpu cosmetics
2011-10-03 09:26:00 +00:00
Alexey Spizhevoy
dce50b67fc Fixed support of translation in the GPU-based plane warper 2011-10-03 07:42:16 +00:00
Anatoly Baksheev
0fc8fc77c0 eliminated shread_ptr dependency from optical flow gpu. It caused compilation errors. 2011-09-30 11:27:30 +00:00
Anatoly Baksheev
b9ea3ef741 compilation under ubuntu 2011-09-30 09:02:03 +00:00
Anton Obukhov
42c7aece36 [+] Added Brox optical flow (implementation courtesy of Michael Smirnov) 2011-09-29 14:46:39 +00:00
Vladislav Vinogradov
4d3b1a4a02 fixed build for compute capability < 2.0 2011-09-28 16:24:51 +00:00
Vladislav Vinogradov
f536dfa439 fixed build under linux 2011-09-28 12:08:17 +00:00
Vladislav Vinogradov
9da6d78989 optimized gpu filters, added buffered version for different filters 2011-09-28 11:52:26 +00:00
Vladislav Vinogradov
ea4f65b570 implemented optimized version of bf_radius_match for train collection 2011-09-27 06:45:17 +00:00
Vladislav Vinogradov
b119833ad1 implemented optimized version of gpu::bf_radius_match 2011-09-26 11:18:30 +00:00
Vladislav Vinogradov
67a9b79433 added performance tests for gpu::reduce 2011-09-26 08:14:31 +00:00
Alexey Spizhevoy
48dec9c03a Fixed GPU version of multi-band blending in stitching module 2011-09-24 05:58:29 +00:00
Vladislav Vinogradov
8b23c79294 implemented gpu::reduce 2011-09-22 07:08:26 +00:00
Vladislav Vinogradov
0aaaad1ea8 implemented gpu::addWeighted 2011-09-21 08:58:54 +00:00
Vladislav Vinogradov
64119dd924 implemented gpu::copyMakeBorder for all border modes 2011-09-21 07:02:45 +00:00
Vladislav Vinogradov
5ed65979c1 fixed bug in gpu::HOGDescriptor::getBuffer (ticket #1379) 2011-09-20 06:38:40 +00:00
Alexey Spizhevoy
23636433d7 Updated bundle adjustment in stitching module: 1) it minimizes reprojection error now, 2) it minimizes error over focal, aspect, p.p.x, p.p.y parameters. Refactored and updated warpers. 2011-09-16 12:25:23 +00:00
Anatoly Baksheev
ffc5a7b809 minor 2011-09-15 05:51:41 +00:00
Vladislav Vinogradov
be8e31f14d minor gpu module refactoring: split big .cu files, disabled unnecessary template instantiation 2011-09-14 06:23:46 +00:00
Alexey Spizhevoy
a72da12c80 Added fix for 0 found features in stitching matcher 2011-09-13 14:15:18 +00:00
Vladislav Vinogradov
4b46e35287 minor 2011-09-12 15:15:02 +00:00
Vladislav Vinogradov
dac9f69ba4 fixed bug in SURF_GPU (memory access error) 2011-09-12 15:14:02 +00:00
Alexey Spizhevoy
485e71bcc2 Bug in multiply was fixed (by Vlad Vinogradov) 2011-09-12 14:58:56 +00:00
Vladislav Vinogradov
29b9720aab added check if src.size() == dst.size() to gpu::resize 2011-09-12 10:23:20 +00:00
Vladislav Vinogradov
b2079d6de1 implemented gpu::resize for all types 2011-09-12 10:13:14 +00:00
Vladislav Vinogradov
acac27d8fe optimized gpu::multiply 2011-09-12 08:45:56 +00:00
Alexey Spizhevoy
fd5b0c1f18 Reverted some unnecessary changes in GPU module 2011-09-10 12:30:28 +00:00
Alexey Spizhevoy
a7c6b84390 Added TODOs into GPU module 2011-09-10 12:28:03 +00:00
Alexey Spizhevoy
903f835d9f added support of 8UC4*32FC1 multiply into GPU module 2011-09-10 11:56:55 +00:00
Vladislav Vinogradov
df9f707ff0 fixed bug in gpu::remap under win32 2011-09-06 13:42:50 +00:00
Vladislav Vinogradov
84022779a1 added bicubic interpolation to gpu::remap 2011-09-06 07:01:23 +00:00
Vladislav Vinogradov
e0c0461787 optimized gpu::remap (use texture memory) 2011-09-05 07:59:59 +00:00
Vladislav Vinogradov
c26b005371 optimized gpu::remap (use texture memory if possible), added stream support to gpu::remap 2011-09-05 07:51:00 +00:00
Vladislav Vinogradov
b2d5839af7 added GpuFeature assertion to gpu::Canny 2011-09-05 06:20:34 +00:00
Vladislav Vinogradov
b371bd68df added assertion for small image sizes to SURF_GPU (ticket #1323) 2011-09-05 06:03:49 +00:00
Vladislav Vinogradov
ef430882a1 fixed build under linux 2011-09-05 05:40:57 +00:00
Vladislav Vinogradov
ea2f5b1391 added BORDER_REFLECT and BORDER_WRAP support to gpu module
switched to gpu::remap in opencv_stitching
2011-08-31 13:21:14 +00:00
Vladislav Vinogradov
47d68f6967 implemented gpu::remap for all types 2011-08-31 11:42:54 +00:00
Anatoly Baksheev
39373cd9f9 minor update of device layer 2011-08-30 09:08:41 +00:00
Anatoly Baksheev
c19f88853a minor update of device layer 2011-08-30 09:04:31 +00:00
Vladislav Vinogradov
0f5f57561e optimized memory usage in BruteForceMatcher_GPU_base::knnMatch when k==2 2011-08-30 08:49:11 +00:00
Vladislav Vinogradov
50d012b56b fixed gpu::pyrUp implementation 2011-08-29 11:32:18 +00:00
Vladislav Vinogradov
5f9e47a9cd refactored gpu::BruteForceMatcher (moved some utility functions to device layer) 2011-08-29 11:31:52 +00:00
Alexey Spizhevoy
50b72197ab fixed bug in gpu::matchTemplate (added normalization routine to make the GPU version consistent with the CPU one), added test cases from the ticket #1341 2011-08-29 08:06:45 +00:00
Anatoly Baksheev
e90aeb08f9 1) ticket #1332
2) device layer utility
2011-08-26 09:34:49 +00:00
Vladislav Vinogradov
173ac5a64b optimized gpu pyrDown and pyrUp 2011-08-24 11:16:42 +00:00
Vladislav Vinogradov
0ed6d86a49 optimized some gpu::cvtColor functions 2011-08-22 11:07:52 +00:00
Vladislav Vinogradov
971e35f283 fixed bug in gpu::cvtColor 2011-08-17 13:46:36 +00:00
Vladislav Vinogradov
5e9ae6b19f added TransformFunctorTraits, optimized some functions that use transform 2011-08-17 11:32:24 +00:00
Vladislav Vinogradov
6e3a1f7b49 implement optimized version of gpu::BruteForceMatcher::knnMatch when k == 2 2011-08-15 12:14:02 +00:00
Vladislav Vinogradov
bad7dadb75 added remap test, fix compilation error 2011-08-10 12:05:24 +00:00
Vladislav Vinogradov
8bb9e4302e added registerPageLocked/unregisterPageLocked functions
added convert functions to BruteForceMatcher_GPU
other minor fixes
2011-08-10 11:32:48 +00:00
Anatoly Baksheev
edcfa64d99 experimental kernels for cuda 2011-08-10 09:51:36 +00:00
Anatoly Baksheev
b535356ec7 experimental kernels for cuda 2011-08-10 09:35:57 +00:00
Vladislav Vinogradov
c5adaa717b minor stitching optimization (improve buffer reuse) 2011-08-09 13:08:54 +00:00
Vladislav Vinogradov
13e392763b fixed compilation error without cuda 2011-08-09 11:35:42 +00:00
Vladislav Vinogradov
36bfa6ea1c minor 2011-08-09 09:15:04 +00:00
Vladislav Vinogradov
b2b1d41da8 moved GpuMat class to separate header file 2011-08-09 07:51:48 +00:00
Vladislav Vinogradov
a9fdc1bdff added buffered version of cornerHarris, cornerMinEigenVal, histEven and histRange 2011-08-08 13:05:50 +00:00
Vladislav Vinogradov
42ced17c2c fixed some warnings under win64 2011-08-08 11:28:14 +00:00
Vladislav Vinogradov
767ac9aa10 added gpu::Canny function 2011-08-08 08:53:55 +00:00
Vladislav Vinogradov
dfaa8af6ee fixed #1279 2011-08-08 05:59:09 +00:00
Anatoly Baksheev
7e2fffb194 ticket #1280 2011-08-06 09:59:51 +00:00
Vladislav Vinogradov
3fac5d54ca minor fix 2011-08-03 12:30:40 +00:00
Vladislav Vinogradov
5f03548530 minor fix 2011-08-03 12:18:17 +00:00
Vladislav Vinogradov
3ab2728da1 gpu device layer code refactoring 2011-08-03 12:10:36 +00:00
Vladislav Vinogradov
6ff975af9e added gpu::equalizeHist function 2011-08-02 08:33:27 +00:00
Vladislav Vinogradov
33df5ea07b added gpu::calcHist function 2011-08-01 13:15:37 +00:00
Vladislav Vinogradov
e746b3e8ae added buffered version of pyrDown and pyrUp
added stream support to downsample, upsample, pyrUp and pyrDown
2011-08-01 08:15:31 +00:00
Anatoly Baksheev
cb9dd71f82 compilation 2011-07-22 13:26:31 +00:00
Anatoly Baksheev
de204fe413 2011-07-22 13:24:27 +00:00
Anatoly Baksheev
3a1beb1c01 added cv::gpu::pow, ticket #1227 2011-07-21 08:47:44 +00:00
Vadim Pisarevsky
ff5e97c8e4 added COVAR_ and SORT_ enums to core.hpp; fixed many, many VS2005, VS2010 and MinGW (GCC 4.5.2) warnings 2011-07-19 12:27:07 +00:00
Alexey Spizhevoy
abe34e3b33 removed debug func. call 2011-07-08 14:00:54 +00:00
Alexey Spizhevoy
e9837b0aa9 added support of old CC into GPU mean shift routines 2011-07-08 13:59:23 +00:00
Alexey Spizhevoy
ad454d83b9 added gpu::buildWarpPlaneMaps and gpu::buildWarpCylindricalMaps functions, integrated into stitching 2011-07-01 07:07:54 +00:00
Alexey Spizhevoy
5d703abdc2 fix for GPU module compile time (VS2010) error (from 2.3 branch) 2011-06-30 14:47:20 +00:00
Alexey Spizhevoy
674b763395 GPU: updated upsample, downsample functions, added pyrDown, pyrUp, added support of 16S filtering; put spherical warper on GPU (from opencv_stitching) 2011-06-30 14:39:48 +00:00
Vladislav Vinogradov
d1b4b5f0de refactored gpu module tests 2011-06-29 10:14:16 +00:00
Alexander Shishkov
f010539aaf Fixed MacOS GPU Compilation for i386 2011-06-16 15:02:58 +00:00
Anatoly Baksheev
74f1162a41 tabs 2011-06-16 10:33:32 +00:00
Vladislav Vinogradov
0d09352fca fixed gpu::cvtColor 2011-06-15 11:57:08 +00:00
Anton Obukhov
c6a7432e92 [*] Approach to the bug with integral image calculation on SM_2.0 (Fermi) 2011-06-14 17:34:00 +00:00
Vladislav Vinogradov
e05c488868 minor fix 2011-06-14 11:57:26 +00:00
Vladislav Vinogradov
1c1a61dd37 added __forceinline__ to device functions
fixed BFM warning ("cannot tell what pointer points to")
2011-06-14 11:27:32 +00:00
Anatoly Baksheev
8f4c7db3f6 ~40 warnings under VS2008
HAVE_CONFIG_H -> HAVE_CVCONFIG_H
2011-06-11 17:24:09 +00:00
Anatoly Baksheev
dc8572dc7b data_mov_utils.hpp 2011-06-11 10:40:55 +00:00
Vladislav Vinogradov
26dd71d981 fixed gpu::cvtColor according #5324 changes 2011-06-09 08:26:08 +00:00
Vladislav Vinogradov
f906c9b259 added resetDevice function, removed MultiGpuManager 2011-06-01 10:11:27 +00:00
Vladislav Vinogradov
926a6bba00 modified according to CUDA 4.0 API updates 2011-05-31 08:31:10 +00:00
Vladislav Vinogradov
331062360d fixed bug in SURF_GPU (compute descriptors, tid < 25) 2011-05-25 08:37:46 +00:00
Vladislav Vinogradov
6259520aa1 fixed bug in SURF_GPU (reduce_sum25) 2011-05-25 08:16:17 +00:00
Vladislav Vinogradov
15677d6d28 minor optimization of SURF_GPU (orientation calculation, changed block size to 32x4) 2011-05-24 08:02:39 +00:00
Vladislav Vinogradov
7fbcc1ec05 minor SURF_GPU optimization (descriptor calculation, change block size to 6x6) 2011-05-23 18:40:09 +00:00
Vladislav Vinogradov
ae6d5252ab minor fix 2011-05-23 08:46:27 +00:00
Vladislav Vinogradov
0b19f915be minor optimization of SURF_GPU (reduce memory transfers, use structure of arrays instead of array of structures) 2011-05-23 07:59:20 +00:00
Vadim Pisarevsky
23768b1826 replaced #include <some_opencv_hdr> with #include "some_opencv_hdr" (ticket #719) 2011-05-21 18:32:34 +00:00
Alexey Spizhevoy
7d350280c0 compilation under mac 2011-05-18 13:02:27 +00:00
Vladislav Vinogradov
34e2c78cec minor fix 2011-05-18 07:18:30 +00:00
Vladislav Vinogradov
f3b45af876 GpuMat::setTo optimization (call cudaMemcpy2D if possible) 2011-05-18 06:51:05 +00:00
Vladislav Vinogradov
f11efdced3 added support of Hamming distance to BruteForceMatcher_GPU 2011-05-16 08:38:27 +00:00
Vladislav Vinogradov
6a03be2632 added lightweight class DeviceBuffer to matrix_reductions.cpp 2011-05-11 07:30:46 +00:00
Vladislav Vinogradov
1c9f4e7ca2 fixed gpu::meanStdDev and gpu::norm under CUDA 4.0
fixed compilation under Win64
2011-05-10 12:39:12 +00:00
Vladislav Vinogradov
58e26313dd fixed bug in cv::LUT (multi-channel source array and single-channel table)
added missing cudaSafeCall
2011-05-10 06:11:03 +00:00
Anatoly Baksheev
1c18e5fef9 fixed ~300 warnings under windows (had to hack gtest a bit) 2011-05-06 21:45:48 +00:00
Vladislav Vinogradov
6200f388dd fixed bug in BruteForceMatcher_GPU::knnMatch (allDist buffer reuse) 2011-05-04 08:53:43 +00:00
Vladislav Vinogradov
c21a783646 added aligned memory allocation to CudaMem (if allocation type is ZERO_COPY)
fixed bugs in gpu::cvtColor
2011-05-03 09:09:05 +00:00
Anton Obukhov
b28c33f5dc [~] Added nppStSetActiveCUDAstream and nppStGetActiveCUDAstream to the export table 2011-04-25 15:35:33 +00:00
Alexey Spizhevoy
c9894f9655 fixed compile-time bug under Ubuntu 2011-04-25 08:38:54 +00:00
Anton Obukhov
0c325cace3 [~] Minor refactoring, clean-up
[+] Added 128-bit transpose
2011-04-24 21:39:08 +00:00
Alexey Spizhevoy
1a02877ab7 fixed gpu::downsample and gpu::blendLinear compite-time bugs under Ubuntu 2011-04-11 05:11:23 +00:00
Alexey Spizhevoy
97282d8ff8 added downsample function into gpu module, refactored it a little bit, added guard for CUDA related include in cascadeclassifier_nvidia_api.cpp 2011-04-08 08:04:56 +00:00
Alexey Spizhevoy
6cec5ff552 added blendLinear into gpu module 2011-04-08 05:21:47 +00:00
Anton Obukhov
07d19c2c6f [~] Refactored, cleaned up, and consolidated the code of GPU examples (cascadeclassifier and cascadeclassifier_nvidia_api) 2011-04-07 12:59:01 +00:00
Anton Obukhov
58476b64a6 [*] Fixed #974 ("GPU CascadeClassifier fails with some training files"): Moved IsNodeLeaf bit from NodeDescriptor to FeatureDescriptor for both left and right nodes, therefore from now on max number of rects in a feature is 31 2011-04-04 11:47:21 +00:00
Anatoly Baksheev
a69de4bf26 minor bug in hog (unbind texture) 2011-03-31 11:14:23 +00:00
Anatoly Baksheev
5f56b27616 added ROI support for HOG_GPU 2011-03-31 10:55:39 +00:00
Anatoly Baksheev
8b8ad17f4a fixed extra memory allocations. 2011-03-30 11:42:23 +00:00
Vladislav Vinogradov
57195e9627 fixed SURF_GPU bug (features count > max dimension of grid)
minor gpu docs fixes
2011-03-28 10:34:44 +00:00
Alexey Spizhevoy
ca1584dd1f fixed gpu::graphcut preconditions 2011-03-24 12:22:23 +00:00
Vladislav Vinogradov
9342c4b076 added upright parameter to SURF_GPU 2011-03-17 14:15:54 +00:00
Vladislav Vinogradov
fcff126086 fixed some warnings in surf.cu 2011-03-16 07:01:27 +00:00
Vladislav Vinogradov
ca803e12fd fixed mask calculation in SURF_GPU 2011-03-15 07:06:44 +00:00
Vladislav Vinogradov
d05c6b8b68 fixed surf.cu compilation on CC 2.0 2011-03-14 14:33:10 +00:00
Anatoly Baksheev
9214173c2c some includes moved to precomp.hpp for gpu module 2011-03-14 09:40:13 +00:00
Vladislav Vinogradov
58f6919795 made GPU version of SURF more consistent with CPU one 2011-03-10 13:53:58 +00:00
Alexey Spizhevoy
23416e3db5 make GPU version of solvePnPRansac more consistent with CPU one 2011-03-07 14:01:18 +00:00
Alexey Spizhevoy
1d62fddd31 updated solvePnpRansac performance test 2011-03-07 13:36:47 +00:00
Jose Luis Blanco
673061fb17 Fixed build errors in MSVC when building without CUDA. 2011-03-06 21:59:04 +00:00
Alexey Spizhevoy
4e6572acd9 removed test whether z is negative in gpu::solvePnpRansac (there is no need in this for tests passing) 2011-03-02 09:40:14 +00:00
Alexey Spizhevoy
74c398e6b7 refactored gpu::solvePnpRansac a bit 2011-03-02 08:04:08 +00:00
Alexey Spizhevoy
60e33921e3 added performance sample for solvePnpRansac + refactoring 2011-03-01 09:17:22 +00:00
Alexey Spizhevoy
5e85566477 added distortion coefs support into gpu::solvePnpRansac 2011-03-01 07:44:36 +00:00
Alexey Spizhevoy
eb8c0b8b4b parallelized hypotheses evaluation cycle in gpu::solvePnpRansac 2011-02-28 14:21:41 +00:00
Alexey Spizhevoy
cae59a7caf added gpu::solvePnpRansac 2011-02-28 12:44:19 +00:00
Vladislav Vinogradov
32a2fde8ac temporarily disabled compute descriptor kernel for new cards (some problems with threads synchronization), old version of kernels is used. 2011-02-22 09:27:42 +00:00
Alexey Spizhevoy
5b3d786e30 added asynchronous versions of transform- and projectPoints into the GPU module, added docs 2011-02-22 07:27:10 +00:00
Anatoly Baksheev
efe16c6f3e minor refactoring 2011-02-21 16:50:19 +00:00
Alexey Spizhevoy
289af671ee added projectPoints and transformPoints into GPU module 2011-02-21 10:58:05 +00:00
Anatoly Baksheev
047c7e0fd6 *applied patch from NVidia (nppstTraspose bug)
*fixed some warnings
*finished gpu test port to gtest framework
2011-02-18 12:23:18 +00:00
Alexey Spizhevoy
993773b74a now GPU stereo_bp test prints error mesage and test system continues execution 2011-02-17 11:09:29 +00:00
Vladislav Vinogradov
54fa600b9e update docs
minor fixes and refactoring of GPU module
2011-02-16 08:31:45 +00:00
Anatoly Baksheev
6b34532901 minor + warnings 2011-02-15 15:09:54 +00:00
Alexey Spizhevoy
2f8af6335e renamed GpuFeature into FeatureSet and updated docs 2011-02-15 13:56:59 +00:00
Alexey Spizhevoy
04709a2793 refactoring of GPU module 2011-02-15 13:25:24 +00:00
Anatoly Baksheev
6b6a63ba38 fixed some warnings
GPU: TargetArchs -> added FEATURE_SET prefix.
2011-02-15 10:54:49 +00:00
Alexey Spizhevoy
725d83b0e5 added init() function into MultiGpuMgr, added samples 2011-02-15 10:03:26 +00:00
Alexey Spizhevoy
5d4913a2ee fixed multi GPU API, added driver_api prefix to multi gpu samples 2011-02-15 09:12:48 +00:00
Vladislav Vinogradov
0821c7ad17 fixed descriptor calculation in SURF_GPU 2011-02-15 08:57:35 +00:00
Vladislav Vinogradov
deac5d972e fixed errors in gpu on old video cards (SURF_GPU, BruteForceMatcher_GPU, min/max, setTo, convertTo)
added assertion after all kernels calls
2011-02-14 15:50:17 +00:00
Alexey Spizhevoy
a42a42858c fixed GPU samples and MultiGpuMgr 2011-02-14 15:09:45 +00:00
Anatoly Baksheev
d7e612cd4b GPU: In strcut _scanElemOp dummy type pass by value 2011-02-14 13:30:00 +00:00
Alexey Spizhevoy
186e46fcca added draft version of MultiGpuMgr (it isn't tested on multi GPU machine yet) 2011-02-14 12:53:59 +00:00
Alexey Spizhevoy
a6aff1856b removed check disabling GPU face detection under Linux, i've checked -- it works 2011-02-10 13:01:36 +00:00
Alexey Spizhevoy
63806c9ab9 renamed gpu::DeviceInfo::has into gpu::DeviceInfo::supports 2011-02-09 12:31:05 +00:00
Vladislav Vinogradov
924670d32c fixed block size calculation in SURF_GPU (fasthessian_gpu and nonmaxonly_gpu kernels) 2011-02-09 09:11:11 +00:00
Anatoly Baksheev
7539b7de65 fixed build under ubuntu, but FS is still disabled 2011-02-07 13:47:10 +00:00
Vladislav Vinogradov
5cd06d6a36 fixed SURF_GPU (fails on empty data)
added test for SURF_GPU and reprojectImageTo3D
2011-02-07 10:12:04 +00:00
Anatoly Baksheev
7b1c265563 fixed some compilation under ubuntu 2011-02-04 18:29:05 +00:00
Anatoly Baksheev
21f0d1e174 removed MS specific from FD code 2011-02-04 15:46:35 +00:00
Anatoly Baksheev
0747f2d863 1) NPP_staging as sources. Binaries removed.
2) NVidia tests for GPU
3) FD sample that uses NVidia's interface.
2011-02-04 15:15:25 +00:00
Alexey Spizhevoy
50429d8a3e fixed some warnings and errors under g++ 2011-02-03 14:51:58 +00:00
Alexey Spizhevoy
ea94b43541 added stereo_multi_gpu sample, cosmetic changes in multi_gpu sample 2011-02-03 12:02:39 +00:00
Alexey Spizhevoy
557dd39f03 fixed gpu::sum* on CC1.0, updated some tests 2011-02-02 07:23:55 +00:00
Alexey Spizhevoy
e303b0dd62 added CUDA_ARCH_FEATUERS define (for correct handling of BIN(PTX) cases) 2011-02-01 12:28:39 +00:00
Alexey Spizhevoy
16e74ab306 added buffered version of norm, updated performance sample and docs 2011-02-01 10:46:19 +00:00
Alexey Spizhevoy
3795142604 implemented gpu::norm via absSum, sqrSum, and minMax (removed norm_diff call), added support of other data types 2011-02-01 10:23:10 +00:00
Alexey Spizhevoy
ae529f4bc6 added absSum function 2011-01-31 14:37:03 +00:00
Alexey Spizhevoy
fa446e7e35 removed linear_filters_beta.cu as its functionality was moved into filters.cu 2011-01-31 13:31:59 +00:00
Vladislav Vinogradov
8274ed22e4 fixed gpu tests (BruteForceMatcher_GPU, divide, phase, cartToPolar, async)
minor code refactoring
2011-01-31 13:20:52 +00:00
Alexey Spizhevoy
7a29d96cf4 added buffered version of gpu::integral function and updated performance test (it still works too slow) 2011-01-31 10:42:33 +00:00
Alexey Spizhevoy
0af5356dbc updated code in case of the compilation without CUDA 2011-01-28 12:30:08 +00:00
Alexey Spizhevoy
575fd1fe4c reafactoring: replaced query device props functions with the DeviceInfo class 2011-01-28 11:59:26 +00:00
Alexey Spizhevoy
65b9f3bc10 fixed TargetArchs implementation in case when HAVE_CUDA=false, added initial structure for multi_gpu sample 2011-01-27 12:17:56 +00:00
Alexey Spizhevoy
891e2ff310 replaced has* methods in the GPU module with the TargetArchs monostate 2011-01-27 10:06:38 +00:00
Alexey Spizhevoy
13a6d0b92a fixed parsing GPU archs in BIN(PTX) format 2011-01-27 08:26:10 +00:00
Vladislav Vinogradov
eda8416358 fixed BruteForceMatcher_GPU (fails if input data is empty)
updated BruteForceMatcher_GPU test
2011-01-26 15:58:47 +00:00
Alexey Spizhevoy
310c483da8 added first version of gpu performance tests 2011-01-24 10:33:01 +00:00
Vladislav Vinogradov
811ba31897 added synchronization after NPP calls 2011-01-24 10:32:57 +00:00
Vladislav Vinogradov
8abdb3721f added gpu threshold. 2011-01-24 10:11:02 +00:00
Alexey Spizhevoy
e51b9021bc forgot to rename namespace 2011-01-21 11:31:45 +00:00
Alexey Spizhevoy
db41449be8 fixed errors under MacOS 2011-01-21 10:53:07 +00:00
Anatoly Baksheev
e3f3de84db warning fixed 2011-01-21 09:00:19 +00:00
Alexey Spizhevoy
01dafce1a1 fixed some bugs in GPU matrix reductions, removed <functional> into precomp.hpp 2011-01-21 07:43:11 +00:00
Alexey Spizhevoy
0da71a01ff fixed some GPU tests failing when compiled for 1.1(no doubles) and run on 1.3(with doubles) 2011-01-20 15:08:48 +00:00
Alexey Spizhevoy
9e48f64149 fixed parsing of CC in gpu module 2011-01-20 14:34:27 +00:00
Alexey Spizhevoy
574b3f94a1 updated gpu initialization functions, added compile-time error on CC 1.0 2011-01-20 14:13:07 +00:00
Alexey Spizhevoy
8779306800 updated main CMakeLists.txt gpu module section, now user can manage binary and intermediate code versions of the gpu module image
added more functions to check version of gpu code in runtime
2011-01-20 09:22:05 +00:00
Alexey Spizhevoy
4b8425dbb8 fixed minor bugs 2011-01-19 13:28:22 +00:00
Alexey Spizhevoy
186b1fc6ef refactored gpu module 2011-01-19 12:47:34 +00:00
Alexey Spizhevoy
90ae1e3aed refactored gpu module 2011-01-19 10:54:58 +00:00
Alexey Spizhevoy
1a0d41fb53 added checkPtxVersion into gpu module 2011-01-18 14:52:35 +00:00
Alexey Spizhevoy
cbb132ccb1 added ensureSizeIsEnough into gpu module, updated reduction methods 2011-01-18 12:36:01 +00:00
Alexey Spizhevoy
f3a2656808 added hasPtxFor and isCompatibleWith functions into gpu module, added docs for them 2011-01-18 12:01:28 +00:00
Anatoly Baksheev
cb63046dcf gpu face detetcion:
1) fixed bug with error codes (enum NppStStatus) shift.
2) added some asserts
2011-01-17 11:32:38 +00:00
Vladislav Vinogradov
349e0ece93 added docs for GPU Filter Engine 2011-01-13 13:48:58 +00:00
Anatoly Baksheev
1a94186195 First version of CascadeClassifier_GPU.
Only for VS2008 now.
Sample for it.
new NPP_staging for VS2008 only
2011-01-13 13:04:00 +00:00
Alexey Spizhevoy
4e23f37ff8 added docs for gpu::HOGDescriptor 2011-01-12 13:07:33 +00:00
Alexey Spizhevoy
4dfbf99dd5 cosmetic changes in gpu module, decreased matchTemplate test running time 2011-01-12 09:30:08 +00:00
Alexey Spizhevoy
d091ae5746 removed unnecessary code from gpu::matchTemplate 2011-01-12 06:49:03 +00:00
Alexey Spizhevoy
57f917d6f2 replaced adding constant in normalization with taking max 2011-01-11 10:15:46 +00:00
Alexey Spizhevoy
dc763e0250 updated normalization routine in the matchTemplate to avoid division by zero on black images (ticket #798), added test 2011-01-11 09:36:21 +00:00
Alexey Spizhevoy
ba32833c3f added missed include (for MSVC 2010) 2010-12-28 14:57:23 +00:00
Alexey Spizhevoy
be38864dd0 added buffered version of gpu::convolve 2010-12-27 10:18:42 +00:00
Alexey Spizhevoy
1ecb6cf775 simplified gpu::columnSum test, it doesn't fail on Quadro anymore (when seed is 000001af5a11badd) after BFM test, but something definitely wrong with NPP_Staging's transpose 2010-12-27 09:10:22 +00:00
Alexey Spizhevoy
8f0d36b8b6 refactored gpu::dft 2010-12-27 07:35:41 +00:00
Alexey Spizhevoy
86802ec968 minor warning fixed 2010-12-24 13:44:26 +00:00
Alexey Spizhevoy
eaf35a8421 minor refactoring in gpu module 2010-12-24 12:55:43 +00:00
Alexey Spizhevoy
21b081deff now single row GPU matrix is continuous one, added aux. functions, updated dft and matchTemplates 2010-12-24 09:26:19 +00:00
Alexey Spizhevoy
6702d55711 added support of scaling into gpu::dft, refactored gpu::convolve 2010-12-24 06:48:23 +00:00
Alexey Spizhevoy
783716838d updated test for gpu::dft, updated dft for handling continous source 2010-12-23 13:00:33 +00:00
Alexey Spizhevoy
52ca0c4bca fixed bug in gpu::dft 2010-12-23 10:07:37 +00:00
Alexey Spizhevoy
09735fd208 added gpu::dft implemented via CUFFT 2010-12-23 09:24:33 +00:00
Alexey Spizhevoy
68aba9f2fb added mulSpectrums functions into GPU module 2010-12-22 13:46:06 +00:00
Alexey Spizhevoy
fef06c25b5 moved crossCorr (as NPP_Staging wrapper) into public GPU module part from the internal matchTemplate files 2010-12-22 08:56:16 +00:00
Alexey Spizhevoy
f9bcef9003 moved sqrIntegral (NPP_Staging wrapper) into public GPU module part from matchTemplate.cpp 2010-12-22 08:17:36 +00:00
Alexey Spizhevoy
a6d9cce500 added CUFFT errors handling into GPU module 2010-12-22 08:03:53 +00:00
Vladislav Vinogradov
8190837dd4 fixed some warnings under linux 2010-12-22 07:30:21 +00:00
Vladislav Vinogradov
0cd587ee34 added gpu transpose and integral based on NPP Staging.
added mask support to SURF_GPU.
2010-12-21 14:02:09 +00:00
Vladislav Vinogradov
5bfb44f887 fixed compiler error under linux 2010-12-21 08:13:15 +00:00
Vladislav Vinogradov
0e45a637c4 added SURF_GPU.
added support of CV_32FC1 type to gpu copyMakeBorder.
2010-12-20 12:49:40 +00:00
Alexey Spizhevoy
df8529377b refactoring: moved gpu reduction-based functions into separated file 2010-12-20 09:51:25 +00:00
Alexey Spizhevoy
1922e50f19 refactoring: made gpu bitwise operations inline 2010-12-20 09:16:17 +00:00
Alexey Spizhevoy
0465b89e7e gpu module refactoring: moved per-element operations into separated file 2010-12-20 09:07:19 +00:00
Alexey Spizhevoy
6891a60149 added host code for gpu::matchTemplate (as NPP_staging was integrated) 2010-12-20 08:09:09 +00:00
Alexey Spizhevoy
e62bf3a2ae updated gpu bitwise operations 2010-12-20 08:06:13 +00:00
Anatoly Baksheev
0f30fe080f renamed constantspacebp -> stereocsbp line all other algs
created element_operations and matrix_reductions files for farther refactoring
2010-12-19 17:20:54 +00:00
Anatoly Baksheev
9dd4a22a5e temporary added NPP_staging, functionality from the library will be moved to NPP with next release. 2010-12-17 15:41:26 +00:00
Alexey Spizhevoy
7767038ef0 updated other gpu's bitwise operations 2010-12-17 12:48:04 +00:00
Alexey Spizhevoy
5132ce211b reimplemented gpu::bitwise_not operation, refactored gpu module 2010-12-17 12:22:51 +00:00
Alexey Spizhevoy
1a93412eca refactored bitwise operations in gpu module 2010-12-17 10:26:57 +00:00
Alexey Spizhevoy
74197c5b14 fixed link time error in gpu module 2010-12-16 08:46:47 +00:00
Alexey Spizhevoy
343c33d73e added support of CCOEFF_NORMED for multichannel images (8U) into gpu::matchTemplate 2010-12-16 08:10:31 +00:00
Alexey Spizhevoy
640af6623c added support of multichannel images into gpu::sqrSum 2010-12-15 16:32:56 +00:00
Alexey Spizhevoy
3db5b687f6 added support of multichannel images into gpu::matchTemplate for CCOEFF method 2010-12-15 16:04:10 +00:00
Alexey Spizhevoy
f56d9c340f added support of remaining image number of channels into gpu::sum 2010-12-15 15:28:35 +00:00
Alexey Spizhevoy
d8a7ff1e00 refactored gpu module, added vec math operators for uint, added support of 2 channel images into gpu::sum (removed support of double) 2010-12-15 15:12:32 +00:00
Alexey Spizhevoy
e5eec31be1 fixed minor bugs in gpu module 2010-12-15 12:10:30 +00:00
Alexey Spizhevoy
68c3018047 added support of multichannel images into gpu::matchTemplate (all methods except CCOEFF based), refactored 2010-12-15 11:22:37 +00:00
Alexey Spizhevoy
e7cf541f5f fixed bug in matchTemplate when template size is (1,1), refactored 2010-12-14 09:53:17 +00:00
Alexey Spizhevoy
1887b7d2e4 refactored matchTemplate.cu 2010-12-14 08:45:11 +00:00
Alexey Spizhevoy
ce47a37e6e added CUDA kernel for CV_TM_CCOEFF_NORMED matchTemplate method 2010-12-14 08:00:53 +00:00
Alexey Spizhevoy
b35aa77418 added CUDA kernel for CV_TM_CCOEFF matchTemplate method 2010-12-14 07:42:55 +00:00
Alexey Spizhevoy
39700c5d54 added some gpu::matchTemplate kernels (other parts after NPP Staging integration) 2010-12-13 16:48:34 +00:00
Alexey Spizhevoy
a81b41fb08 fixed some warning under Ubuntu in gpu module 2010-12-13 15:56:29 +00:00
Alexey Spizhevoy
c9f9f38777 added gpu::sqrSum function 2010-12-13 14:34:02 +00:00
Vladislav Vinogradov
b18a3a5f83 fixed errors in StereoBeliefPropogation under linux 2010-12-13 13:52:40 +00:00
Anatoly Baksheev
070d87fb7f fixed bug with prefilter_xsobel preset. 2010-12-13 13:42:35 +00:00
Alexey Spizhevoy
e193fa1165 fixed minor bug in gpu::sum 2010-12-13 12:50:52 +00:00
Alexey Spizhevoy
3997514b7c added tests for gpu::sum, it supports all data types, but single channel images only 2010-12-13 12:00:58 +00:00
Vladislav Vinogradov
442cd75c32 fixed compile error under linux 2010-12-13 09:50:32 +00:00
Vladislav Vinogradov
49ec8ba742 fixed bug in gpu filter engine (incorrect buffer type) and in vector's saturate_cast.
changed buffer type in linear filters to float.
added support of 1 channel image to linear filters.
added support of BORDER_REFLECT101, BORDER_REPLICATE and BORDER_CONSTANT border type to gpu linear filters.
minor fix in tests.
update comments in gpu.hpp.
2010-12-13 08:43:04 +00:00
Anatoly Baksheev
6c2cce6e28 some files rename 2010-12-11 15:21:11 +00:00
Anatoly Baksheev
ec4d2b6bb9 Fixed serious kernel crash IN StereoBM_GPU for some parameter. Will more rework for the code farther. 2010-12-11 15:07:14 +00:00
Alexey Spizhevoy
79b1eec3d3 minor refactoring of gpu module 2010-12-10 14:02:41 +00:00
Alexey Spizhevoy
9f80317ffa fixed minor bug in gpu module, added first version of sum 2010-12-10 13:36:00 +00:00
Alexey Spizhevoy
d557c800a7 refactored gpu module 2010-12-10 10:23:32 +00:00
Alexey Spizhevoy
97484089c5 added support of CV_8U & CV_TM_CCORR into gpu::matchTemplate 2010-12-10 08:06:54 +00:00
Alexey Spizhevoy
9801d07a46 added test for gpu:columnSum 2010-12-08 16:51:12 +00:00
Alexey Spizhevoy
fa322bf46f added gpu::columnSum, fixed compile error (if there is no cuda), refactored 2010-12-08 15:06:10 +00:00
Alexey Spizhevoy
b1c5b9293e added support of CV_32F & CV_TM_SQDIFF into gpu::matchTemplate 2010-12-08 13:12:12 +00:00
Alexey Spizhevoy
40304721a7 added support of CV_TM_CCORR (via FFT) into gpu::matchTemplate (versions both with block and without blocks) 2010-12-08 13:03:53 +00:00
Vladislav Vinogradov
905e5f1739 added support of 4-channels images to StereoConstantSpaceBP.
refactored transpose_gpu, made it non template function.
2010-12-08 07:23:59 +00:00
Vladislav Vinogradov
c18aa438ec added support of 4 channels images to StereoBeliefPropagation, minor code refactoring. 2010-12-07 10:20:03 +00:00
Alexey Spizhevoy
349a9fac86 fixed compile-time warning under g++ 2010-12-07 07:16:13 +00:00
Anatoly Baksheev
652fb1212e module reorganization: added folder with pure device functions, cuda_shared.hpp renamed to internal_shared.hpp 2010-12-06 16:37:32 +00:00
Vladislav Vinogradov
fadd19b976 added assertion on Compute Capability >= 1.1 to BruteForceMatcher_GPU_base::radiusMatch 2010-12-06 14:51:47 +00:00
Alexey Spizhevoy
50167f6c26 added first version of gpu::matchTemplate, currently it works only with 8UC1 images and SQDIFF method 2010-12-06 14:19:41 +00:00
Vladislav Vinogradov
8891acb67a added BruteForceMatcher_GPU 2010-12-06 12:06:51 +00:00
Alexey Spizhevoy
77027f6075 refactored border interpolation in gpu module 2010-12-06 09:44:51 +00:00
Vladislav Vinogradov
17d9014373 added per-element min/max to gpu module.
fixed compile error in transform.
2010-12-06 08:10:11 +00:00
Anatoly Baksheev
d96c5ebb7d misprint 2010-12-06 08:03:33 +00:00
Alexey Spizhevoy
6ef4d9b1dd added sipport of BORDER_REPLICATE mode into gpu::corner* functions 2010-12-06 07:47:26 +00:00
Alexey Spizhevoy
103bbaf09c updated BrdReflect101 in gpu module 2010-12-03 13:28:49 +00:00
Alexey Spizhevoy
e75ca4b662 replaced global memory reads with texture memory reads in GPU's corner detectors 2010-12-03 13:11:14 +00:00
Alexey Spizhevoy
df48ad76b0 added support of gamama_correction parameter into gpu::HOGDescriptor, updated tests 2010-12-03 11:11:44 +00:00
Alexey Spizhevoy
2a2590bae2 replaced filter (from the nearest to linear) mode when resizing image in gpu::HOGDescriptor 2010-12-03 10:14:01 +00:00
Alexey Spizhevoy
68c41a6c6c minor fix in gpu's header 2010-12-02 09:29:49 +00:00
Alexey Spizhevoy
ab04a03621 added support of BORDER_REFLECT into gpu::cornerHarris and etc, added support of border extr. int linear filters 2010-12-02 09:07:13 +00:00
Alexey Spizhevoy
d9489bbe0c fixed bug in gpu::cornerHarris and gpu::cornerMinEigenVal 2010-12-01 10:00:47 +00:00
Vladislav Vinogradov
57f7678db1 added gpu transpose for CV_8UC4, CV_8SC4, CV_16SC2, CV_16UC2, CV_32SC1 and CV_32FC1 types 2010-12-01 07:00:50 +00:00
Alexey Spizhevoy
d0dfae5143 fixed bug in ms segmentation 2010-11-30 13:02:24 +00:00
Alexey Spizhevoy
32a9b63d2f added better threads configuration estimator for the minMax, minMaxLoc, countNonZero functions in gpu module 2010-11-30 12:27:21 +00:00
Alexey Spizhevoy
493d7a6e45 added min eigen val based corner detector into gpu module 2010-11-30 08:44:04 +00:00
Alexey Spizhevoy
9adfc2cadc added Harris corner detector into gpu module 2010-11-30 08:04:37 +00:00
Alexey Spizhevoy
b260382840 added masks support into gpu::minMaxLoc 2010-11-29 13:56:43 +00:00
Alexey Spizhevoy
678f392569 added is_signed into numeric_limits_gpu, fixed incorrect min max finding for floating values 2010-11-29 13:21:43 +00:00
Alexey Spizhevoy
6ad158dbe1 refactored gpu::minMaxLoc 2010-11-29 10:44:25 +00:00
Alexey Spizhevoy
437ac1a2f1 added mask support into gpu::minMax 2010-11-29 10:29:21 +00:00
Alexey Spizhevoy
8c48f3be9d moved GPU's global counter reset from caller to the kernel's end 2010-11-29 08:09:54 +00:00
Alexey Spizhevoy
3da253a259 replaced one-threads tail reduce with one-block tail reduce in functions gpu: minMax, minMaxLoc 2010-11-29 08:04:39 +00:00
Alexey Spizhevoy
1066bd2fa2 removed double support (gpu: minMax, minMaxLoc, countNonZero) for CC which doesn't have native double support 2010-11-29 07:31:09 +00:00
Alexey Spizhevoy
72f020a8f3 added gpu::count_non_zero version for CC1.0, refactored gpu module a little 2010-11-29 07:18:11 +00:00
Alexey Spizhevoy
7e2cc1be1b added first version of gpu::countNonZero for all data types, it doesn't support compute capability 1.0 yet, also fixed some little bugs 2010-11-26 17:12:48 +00:00
Alexey Spizhevoy
bdaad9e1fe added support of buffers into gpu::minMaxLoc, reduced memory requirements, refactored 2010-11-26 07:50:11 +00:00
Alexey Spizhevoy
9ad7a1c927 fixed compile time errors in gpu module (if one compiles without CUDA) 2010-11-25 11:52:15 +00:00
Anatoly Baksheev
36ebf4ef0d cuda+npp version check at configure time. 2010-11-25 10:42:09 +00:00
Alexey Spizhevoy
fa5422a2ad added buf support into gpu::minMax 2010-11-25 10:19:06 +00:00
Alexey Spizhevoy
48183f10e7 optimized memory requirements for gpu::minMax's buffers, added support of compute capability 1.0 2010-11-25 09:57:02 +00:00