Alexander Alekhin
13c4a02157
ocl: low-level API to support OpenCL binary programs
2017-12-05 22:25:14 +03:00
Vadim Pisarevsky
5ce38e516e
Merge pull request #10223 from vpisarev:ocl_mac_fixes
...
* fixed OpenCL functions on Mac, so that the tests pass
* fixed compile warnings; temporarily disabled OCL branch of TV L1 optical flow on mac
* fixed other few warnings on macos
2017-12-05 13:32:28 +03:00
woody.chow
611cf8d86f
Use Eigen::SelfAdjointEigenSolver in cv::eigen
2017-12-05 02:40:55 +03:00
Alexander Alekhin
0595ab3eef
ocl: fix usage of invalid OpenCL cache on mixed 64/32-bit platforms
...
Observed during launch of 32/64-bit applications on Windows.
Added '32-bit' prefix for 32-bit OpenCL devices. No prefix on 64-bit configurations.
2017-12-01 14:20:18 +03:00
Vadim Pisarevsky
f5dba12762
Merge pull request #10180 from alalek:ocl_avoid_unnecessary_initialization
2017-11-29 11:42:22 +00:00
Roman Cattaneo
d381e499ea
Fixes Issue #10181
...
This PR fixes incorrect division by zero handling in template
specialization of `Div_SIMD` for type `double`.
2017-11-28 13:48:40 +01:00
Alexander Alekhin
0ed3209b00
ocl: avoid unnecessary loading/initializing OpenCL subsystem
...
If there are no OpenCL/UMat methods calls from application.
OpenCL subsystem is initialized:
- haveOpenCL() is called from application
- useOpenCL() is called from application
- access to OpenCL allocator: UMat is created (empty UMat is ignored) or UMat <-> Mat conversions are called
Don't call OpenCL functions if OPENCV_OPENCL_RUNTIME=disabled
(independent from OpenCL linkage type)
2017-11-28 14:02:42 +03:00
Alexander Alekhin
abad8977b6
Merge pull request #9247 from paroj:wrap_alog_rw
2017-11-28 10:35:33 +00:00
Pavel Rojtberg
6fbf0758bc
Python: wrap Algorithm::read and Algorithm::write
2017-11-27 17:04:56 +01:00
Alexander Alekhin
c4b158ff91
Merge pull request #10167 from alalek:ocl_fix_issue_contrib1467
2017-11-27 11:05:07 +00:00
Vadim Pisarevsky
a83c12c3d5
Merge pull request #10154 from alalek:ocl_cleanup_obsolete_cache
2017-11-27 10:04:04 +00:00
Alexander Alekhin
92b35e6758
ocl: fix null pointer access crash
2017-11-27 12:43:29 +03:00
Alexander Alekhin
c38620e966
ocl: update OpenCL runtime loader
...
- fix compilation on Apple (undefined 'oclpath')
- don't warn on OPENCV_OPENCL_RUNTIME=disabled
2017-11-24 19:18:22 +03:00
Alexander Alekhin
b6abf0d3f9
ocl: drop obsolete cache directories after upgrade of OpenCL driver
...
Entries with the same platform name, the same device name and with different driver versions
are assumed obsolete.
2017-11-24 17:02:28 +03:00
Alexander Alekhin
e5d1790b7b
Merge pull request #10018 from alalek:ocl_binary_cache
2017-11-23 13:37:32 +00:00
Alexander Alekhin
9db5cbf9a4
dnn: sync output/internals blobs back
2017-11-22 14:00:58 +03:00
Alexander Alekhin
8e6280fc8e
ocl: binary program cache
2017-11-22 12:56:38 +03:00
Alexander Alekhin
b389c1cfe7
core(ocl): syntax fix
2017-11-20 19:19:35 +03:00
Tomoaki Teshima
3cbe60cca2
Merge pull request #9753 from tomoaki0705:universalMatmul
...
* add accuracy test and performance check for matmul
* add performance tests for transform and dotProduct
* add test Core_TransformLargeTest for 8u version of transform
* remove raw SSE2/NEON implementation from matmul.cpp
* use universal intrinsic instead of raw intrinsic
* remove unused templated function
* add v_matmuladd which multiply 3x3 matrix and add 3x1 vector
* add v_rotate_left/right in universal intrinsic
* suppress intrinsic on some function and platform
* add pure SW implementation of new universal intrinsics
* add test for new universal intrinsics
* core: prevent memory access after the end of buffer
* fix perf tests
2017-11-20 15:56:53 +03:00
Maksim Shabunin
751cee8e67
Merge pull request #9907 from seiko2plus:vsxFixesImproves
2017-11-16 15:20:16 +00:00
Alexander Alekhin
981009ac1f
Merge pull request #9999 from mshabunin:fix-gcc72-warnings
2017-11-07 13:37:25 +00:00
Maksim Shabunin
184daa155f
Fixed minor issues reported by GCC 7.2
2017-11-03 18:06:39 +03:00
Alexander Alekhin
9c4f0a984f
ocl: drop CV_OclDbgAssert
2017-11-03 13:31:37 +03:00
Alexander Alekhin
8fb48c09f7
ocl: improve debug information
2017-11-03 13:31:37 +03:00
Alexander Alekhin
7db612a545
core(ocl): fix parameters for 'intelblas_gemm_buffer_NT' kernel
2017-11-02 12:50:32 +03:00
Alexander Alekhin
bc9c9abab0
Merge pull request #9877 from mapreri:non-linux
2017-10-30 15:41:36 +00:00
Alexander Alekhin
7809c4156f
core(ocl): workaround CL_OUT_OF_RESOURCES error
...
Flush deallocation queue before calling map/unmap
2017-10-30 17:54:56 +03:00
Sayed Adel
def444d99f
core: Several improvements to Power/VSX
...
- changed behavior of vec_ctf, vec_ctu, vec_cts
in gcc and clang to make them compatible with XLC
- implemented most of missing conversion intrinsics in gcc and clang
- implemented conversions intrinsics of odd-numbered elements
- ignored gcc bug warning that caused by -Wunused-but-set-variable in rare cases
- replaced right shift with algebraic right shift for signed vectors
to shift in the sign bit.
- added new universal intrinsics v_matmuladd, v_rotate_left/right
- avoid using floating multiply-add in RNG
2017-10-28 17:46:12 +00:00
Maksim Shabunin
bf418ba342
Merge pull request #9917 from alalek:ocl_cache_program_failures
2017-10-23 12:25:25 +00:00
Alexander Alekhin
734ea77c9a
ocl(macosx): fix CL_INVALID_BUILD_OPTIONS for gemm programs
...
MacOSX OpenCL compiler is very strict to whitespace issues
2017-10-23 13:56:11 +03:00
Alexander Alekhin
d96cac1341
ocl: cache program build failures
...
To prevent unnecessary compiler invocations
2017-10-23 13:46:56 +03:00
Alexander Alekhin
185faf99bd
ocl: simplify ocl::Timer interface
2017-10-18 16:01:21 +03:00
Mattia Rizzolo
d026d7dcfb
Fix build for non-linux architectures but still glibc-based
...
Exampls of these are gnu/kfreebsd and gnu/hurd, both available as
unofficial Debian ports.
They don't define __linux__ (as they are non-linux…) but still define
__GLIBC__, so check on that.
Signed-off-by: Mattia Rizzolo <mattia@mapreri.org>
2017-10-17 00:45:27 +02:00
Vadim Pisarevsky
1563300197
Merge pull request #9833 from tomoaki0705:universalMathFuncs
2017-10-16 10:46:56 +00:00
Gregory Morse
d30a0c6f03
Merge pull request #9856 from GregoryMorse:patch-1
...
* Update OpenCVCompilerOptimizations.cmake
Neon not supported on MSVC ARM breaking build fix
* Update OpenCVCompilerOptimizations.cmake
Whitespace
* Update intrin.hpp
Many problems in MSVC ARM builds (at least on VS2017) being fixed in this PR now.
C:\Users\Gregory\DOCUME~1\MYLIBR~1\OPENCV~3\opencv\sources\modules\core\include\opencv2/core/hal/intrin.hpp(444): error C3861: '_tzcnt_u32': identifier not found
* Update hal_replacement.hpp
Passing variadic expansion in a macro to another macro does not work properly in MSVC and a famous known workaround is hereby applied. Discussion of it: https://stackoverflow.com/questions/5134523/msvc-doesnt-expand-va-args-correctly
Only needed the fix for ARM builds: TEGRA_ macros are used for cv_hal_ functions in the carotene library.
C:\Users\Gregory\Documents\My Libraries\opencv330\opencv\sources\modules\core\src\arithm.cpp(2378): warning C4003: not enough actual parameters for macro 'TEGRA_ADD'
C:\Users\Gregory\Documents\My Libraries\opencv330\opencv\sources\modules\core\src\arithm.cpp(2378): error C2143: syntax error: missing ')' before ','
C:\Users\Gregory\Documents\My Libraries\opencv330\opencv\sources\modules\core\src\arithm.cpp(2378): error C2059: syntax error: ')'
* Update hal_replacement.hpp
All hal_replacement's using carotene\hal\tegra_hal.hpp TEGRA_ functions as macros preprocessed by variadic macros should be changed, identical as was done in core.
C:\Users\Gregory\Documents\My Libraries\opencv330\opencv\sources\modules\imgproc\src\color.cpp(9604): warning C4003: not enough actual parameters for macro 'TEGRA_CVTBGRTOBGR'
C:\Users\Gregory\Documents\My Libraries\opencv330\opencv\sources\modules\imgproc\src\color.cpp(9604): error C2059: syntax error: '=='
* Update OpenCVCompilerOptimizations.cmake
* Update hal_replacement.hpp
* Update hal_replacement.hpp
2017-10-16 12:12:35 +03:00
Tomoaki Teshima
2a781bb616
remove raw SSE2/NEON implementation from mathfuncs.cpp
...
* replace the implementation by universal intrinsic
* make sure no degradation happens on ARM platform
2017-10-15 00:24:31 +09:00
Alexander Alekhin
b0c6bd0a5b
build: resolve naming issue
2017-10-12 13:28:30 +03:00
Vadim Pisarevsky
7d55c09a9f
Merge pull request #9763 from seiko2plus:addVsxCore
2017-10-10 10:00:31 +00:00
Vadim Pisarevsky
bfb12acde6
Merge pull request #9792 from alalek:port_9776
2017-10-09 12:46:06 +00:00
Vadim Pisarevsky
44699c59b3
Merge pull request #9799 from alalek:ocl_program
2017-10-09 12:43:46 +00:00
Wu Zhiwen
dbe9ee0924
ocl: simplify ocl::Timer
...
Use clFinish to gurantee commands completed, instead of waiting for events.
Signed-off-by: Wu Zhiwen <zhiwen.wu@intel.com>
2017-10-09 13:48:38 +08:00
Sayed Adel
d077778074
Added support for VSX
2017-10-09 00:32:29 +00:00
Alexander Alekhin
6be25727ec
ocl: refactor program compilation
2017-10-08 19:55:01 +03:00
Alexander Alekhin
04b4495493
ocl: define ProgramSource before program
...
no changes in code
2017-10-08 19:55:01 +03:00
Igor Wodiany
ffb9554787
Extract code from scalarToRawData
...
The same code was repeated several time for different data types, so
it was extracted as a templated function to improve maintability and
make a code more clear.
2017-10-07 19:46:45 +01:00
Igor Wodiany
b638aa74d7
Fix a memory leak in the Mat copying constructor
...
Exception may be rasied inside the body of a copying constructor after
refcount has been increased, and beacause in the case of the exception
destrcutor is never called what causes memory leak. This commit adds a
workaround that calls the release() function before the exception is
thrown outside the contructor.
2017-10-07 10:50:17 +03:00
pengli
e340ff9c3a
Merge pull request #9114 from pengli:dnn_rebase
...
add libdnn acceleration to dnn module (#9114 )
* import libdnn code
Signed-off-by: Li Peng <peng.li@intel.com>
* add convolution layer ocl acceleration
Signed-off-by: Li Peng <peng.li@intel.com>
* add pooling layer ocl acceleration
Signed-off-by: Li Peng <peng.li@intel.com>
* add softmax layer ocl acceleration
Signed-off-by: Li Peng <peng.li@intel.com>
* add lrn layer ocl acceleration
Signed-off-by: Li Peng <peng.li@intel.com>
* add innerproduct layer ocl acceleration
Signed-off-by: Li Peng <peng.li@intel.com>
* add HAVE_OPENCL macro
Signed-off-by: Li Peng <peng.li@intel.com>
* fix for convolution ocl
Signed-off-by: Li Peng <peng.li@intel.com>
* enable getUMat() for multi-dimension Mat
Signed-off-by: Li Peng <peng.li@intel.com>
* use getUMat for ocl acceleration
Signed-off-by: Li Peng <peng.li@intel.com>
* use CV_OCL_RUN macro
Signed-off-by: Li Peng <peng.li@intel.com>
* set OPENCL target when it is available
and disable fuseLayer for OCL target for the time being
Signed-off-by: Li Peng <peng.li@intel.com>
* fix innerproduct accuracy test
Signed-off-by: Li Peng <peng.li@intel.com>
* remove trailing space
Signed-off-by: Li Peng <peng.li@intel.com>
* Fixed tensorflow demo bug.
Root cause is that tensorflow has different algorithm with libdnn
to calculate convolution output dimension.
libdnn don't calculate output dimension anymore and just use one
passed in by config.
* split gemm ocl file
split it into gemm_buffer.cl and gemm_image.cl
Signed-off-by: Li Peng <peng.li@intel.com>
* Fix compile failure
Signed-off-by: Li Peng <peng.li@intel.com>
* check env flag for auto tuning
Signed-off-by: Li Peng <peng.li@intel.com>
* switch to new ocl kernels for softmax layer
Signed-off-by: Li Peng <peng.li@intel.com>
* update softmax layer
on some platform subgroup extension may not work well,
fallback to non subgroup ocl acceleration.
Signed-off-by: Li Peng <peng.li@intel.com>
* fallback to cpu path for fc layer with multi output
Signed-off-by: Li Peng <peng.li@intel.com>
* update output message
Signed-off-by: Li Peng <peng.li@intel.com>
* update fully connected layer
fallback to gemm API if libdnn return false
Signed-off-by: Li Peng <peng.li@intel.com>
* Add ReLU OCL implementation
* disable layer fusion for now
Signed-off-by: Li Peng <peng.li@intel.com>
* Add OCL implementation for concat layer
Signed-off-by: Wu Zhiwen <zhiwen.wu@intel.com>
* libdnn: update license and copyrights
Also refine libdnn coding style
Signed-off-by: Wu Zhiwen <zhiwen.wu@intel.com>
Signed-off-by: Li Peng <peng.li@intel.com>
* DNN: Don't link OpenCL library explicitly
* DNN: Make default preferableTarget to DNN_TARGET_CPU
User should set it to DNN_TARGET_OPENCL explicitly if want to
use OpenCL acceleration.
Also don't fusion when using DNN_TARGET_OPENCL
* DNN: refine coding style
* Add getOpenCLErrorString
* DNN: Use int32_t/uint32_t instread of alias
* Use namespace ocl4dnn to include libdnn things
* remove extra copyTo in softmax ocl path
Signed-off-by: Li Peng <peng.li@intel.com>
* update ReLU layer ocl path
Signed-off-by: Li Peng <peng.li@intel.com>
* Add prefer target property for layer class
It is used to indicate the target for layer forwarding,
either the default CPU target or OCL target.
Signed-off-by: Li Peng <peng.li@intel.com>
* Add cl_event based timer for cv::ocl
* Rename libdnn to ocl4dnn
Signed-off-by: Li Peng <peng.li@intel.com>
Signed-off-by: wzw <zhiwen.wu@intel.com>
* use UMat for ocl4dnn internal buffer
Remove allocateMemory which use clCreateBuffer directly
Signed-off-by: Li Peng <peng.li@intel.com>
Signed-off-by: wzw <zhiwen.wu@intel.com>
* enable buffer gemm in ocl4dnn innerproduct
Signed-off-by: Li Peng <peng.li@intel.com>
* replace int_tp globally for ocl4dnn kernels.
Signed-off-by: wzw <zhiwen.wu@intel.com>
Signed-off-by: Li Peng <peng.li@intel.com>
* create UMat for layer params
Signed-off-by: Li Peng <peng.li@intel.com>
* update sign ocl kernel
Signed-off-by: Li Peng <peng.li@intel.com>
* update image based gemm of inner product layer
Signed-off-by: Li Peng <peng.li@intel.com>
* remove buffer gemm of inner product layer
call cv::gemm API instead
Signed-off-by: Li Peng <peng.li@intel.com>
* change ocl4dnn forward parameter to UMat
Signed-off-by: Li Peng <peng.li@intel.com>
* Refine auto-tuning mechanism.
- Use OPENCV_OCL4DNN_KERNEL_CONFIG_PATH to set cache directory
for fine-tuned kernel configuration.
e.g. export OPENCV_OCL4DNN_KERNEL_CONFIG_PATH=/home/tmp,
the cache directory will be /home/tmp/spatialkernels/ on Linux.
- Define environment OPENCV_OCL4DNN_ENABLE_AUTO_TUNING to enable
auto-tuning.
- OPENCV_OPENCL_ENABLE_PROFILING is only used to enable profiling
for OpenCL command queue. This fix basic kernel get wrong running
time, i.e. 0ms.
- If creating cache directory failed, disable auto-tuning.
* Detect and create cache dir on windows
Signed-off-by: Li Peng <peng.li@intel.com>
* Refine gemm like convolution kernel.
Signed-off-by: Li Peng <peng.li@intel.com>
* Fix redundant swizzleWeights calling when use cached kernel config.
* Fix "out of resource" bug when auto-tuning too many kernels.
* replace cl_mem with UMat in ocl4dnnConvSpatial class
* OCL4DNN: reduce the tuning kernel candidate.
This patch could reduce 75% of the tuning candidates with less
than 2% performance impact for the final result.
Signed-off-by: Zhigang Gong <zhigang.gong@intel.com>
* replace cl_mem with umat in ocl4dnn convolution
Signed-off-by: Li Peng <peng.li@intel.com>
* remove weight_image_ of ocl4dnn inner product
Actually it is unused in the computation
Signed-off-by: Li Peng <peng.li@intel.com>
* Various fixes for ocl4dnn
1. OCL_PERFORMANCE_CHECK(ocl::Device::getDefault().isIntel())
2. Ptr<OCL4DNNInnerProduct<float> > innerProductOp
3. Code comments cleanup
4. ignore check on OCL cpu device
Signed-off-by: Li Peng <peng.li@intel.com>
* add build option for log softmax
Signed-off-by: Li Peng <peng.li@intel.com>
* remove unused ocl kernels in ocl4dnn
Signed-off-by: Li Peng <peng.li@intel.com>
* replace ocl4dnnSet with opencv setTo
Signed-off-by: Li Peng <peng.li@intel.com>
* replace ALIGN with cv::alignSize
Signed-off-by: Li Peng <peng.li@intel.com>
* check kernel build options
Signed-off-by: Li Peng <peng.li@intel.com>
* Handle program compilation fail properly.
* Use std::numeric_limits<float>::infinity() for large float number
* check ocl4dnn kernel compilation result
Signed-off-by: Li Peng <peng.li@intel.com>
* remove unused ctx_id
Signed-off-by: Li Peng <peng.li@intel.com>
* change clEnqueueNDRangeKernel to kernel.run()
Signed-off-by: Li Peng <peng.li@intel.com>
* change cl_mem to UMat in image based gemm
Signed-off-by: Li Peng <peng.li@intel.com>
* check intel subgroup support for lrn and pooling layer
Signed-off-by: Li Peng <peng.li@intel.com>
* Fix convolution bug if group is greater than 1
Signed-off-by: Li Peng <peng.li@intel.com>
* Set default layer preferableTarget to be DNN_TARGET_CPU
Signed-off-by: Li Peng <peng.li@intel.com>
* Add ocl perf test for convolution
Signed-off-by: Li Peng <peng.li@intel.com>
* Add more ocl accuracy test
Signed-off-by: Li Peng <peng.li@intel.com>
* replace cl_image with ocl::Image2D
Signed-off-by: Li Peng <peng.li@intel.com>
* Fix build failure in elementwise layer
Signed-off-by: Li Peng <peng.li@intel.com>
* use getUMat() to get blob data
Signed-off-by: Li Peng <peng.li@intel.com>
* replace cl_mem handle with ocl::KernelArg
Signed-off-by: Li Peng <peng.li@intel.com>
* dnn(build): don't use C++11, OPENCL_LIBRARIES fix
* dnn(ocl4dnn): remove unused OpenCL kernels
* dnn(ocl4dnn): extract OpenCL code into .cl files
* dnn(ocl4dnn): refine auto-tuning
Defaultly disable auto-tuning, set OPENCV_OCL4DNN_ENABLE_AUTO_TUNING
environment variable to enable it.
Use a set of pre-tuned configs as default config if auto-tuning is disabled.
These configs are tuned for Intel GPU with 48/72 EUs, and for googlenet,
AlexNet, ResNet-50
If default config is not suitable, use the first available kernel config
from the candidates. Candidate priority from high to low is gemm like kernel,
IDLF kernel, basick kernel.
* dnn(ocl4dnn): pooling doesn't use OpenCL subgroups
* dnn(ocl4dnn): fix perf test
OpenCV has default 3sec time limit for each performance test.
Warmup OpenCL backend outside of perf measurement loop.
* use ocl::KernelArg as much as possible
Signed-off-by: Li Peng <peng.li@intel.com>
* dnn(ocl4dnn): fix bias bug for gemm like kernel
* dnn(ocl4dnn): wrap cl_mem into UMat
Signed-off-by: Li Peng <peng.li@intel.com>
* dnn(ocl4dnn): Refine signature of kernel config
- Use more readable string as signture of kernel config
- Don't count device name and vendor in signature string
- Default kernel configurations are tuned for Intel GPU with
24/48/72 EUs, and for googlenet, AlexNet, ResNet-50 net model.
* dnn(ocl4dnn): swap width/height in configuration
* dnn(ocl4dnn): enable configs for Intel OpenCL runtime only
* core: make configuration helper functions accessible from non-core modules
* dnn(ocl4dnn): update kernel auto-tuning behavior
Avoid unwanted creation of directories
* dnn(ocl4dnn): simplify kernel to workaround OpenCL compiler crash
* dnn(ocl4dnn): remove redundant code
* dnn(ocl4dnn): Add more clear message for simd size dismatch.
* dnn(ocl4dnn): add const to const argument
Signed-off-by: Li Peng <peng.li@intel.com>
* dnn(ocl4dnn): force compiler use a specific SIMD size for IDLF kernel
* dnn(ocl4dnn): drop unused tuneLocalSize()
* dnn(ocl4dnn): specify OpenCL queue for Timer and convolve() method
* dnn(ocl4dnn): sanitize file names used for cache
* dnn(perf): enable Network tests with OpenCL
* dnn(ocl4dnn/conv): drop computeGlobalSize()
* dnn(ocl4dnn/conv): drop unused fields
* dnn(ocl4dnn/conv): simplify ctor
* dnn(ocl4dnn/conv): refactor kernelConfig localSize=NULL
* dnn(ocl4dnn/conv): drop unsupported double / untested half types
* dnn(ocl4dnn/conv): drop unused variable
* dnn(ocl4dnn/conv): alignSize/divUp
* dnn(ocl4dnn/conv): use enum values
* dnn(ocl4dnn): drop unused innerproduct variable
Signed-off-by: Li Peng <peng.li@intel.com>
* dnn(ocl4dnn): add an generic function to check cl option support
* dnn(ocl4dnn): run softmax subgroup version kernel first
Signed-off-by: Li Peng <peng.li@intel.com>
2017-10-02 15:38:00 +03:00
Vadim Pisarevsky
f646f61dad
Merge pull request #9752 from alalek:eigenNonSymmetric
2017-10-02 11:51:13 +00:00
Alexander Alekhin
1f18282f04
build: fix MSVS2017 warning
2017-10-01 10:48:35 +00:00
Alexander Alekhin
529632f8d0
core: cv::eigenNonSymmetric() via EigenvalueDecomposition
2017-10-01 07:45:32 +00:00
Alexander Alekhin
69103a9fab
Merge pull request #9740 from tomoaki0705:universalArithm
2017-09-29 14:17:01 +00:00
Tomoaki Teshima
e187219699
use universal intrinsic in arithm.cpp
...
* remove raw NEON/SSE2 implementation as much as possible
* replace them to universal intrinsic in InRange/Compare/AddWeighted
2017-09-29 08:33:15 +09:00
Vadim Pisarevsky
af8fc3e209
Merge pull request #9604 from alalek:ocl_kernel_profiling
2017-09-28 13:48:34 +00:00
Alexander Alekhin
0f0f5652fb
Merge pull request #9717 from alalek:fix_fma
2017-09-27 13:48:09 +00:00
Deric Crago
a0d7cf89f2
Fixed 'boundary' spelling
2017-09-26 23:42:48 -04:00
Alexander Alekhin
3dee92ec50
fix usage of CV_FMA3 macro
2017-09-26 17:23:54 +03:00
Rostislav Vasilikhin
945706a8e8
softfloat f64_sqrt bug fix
2017-09-25 15:52:56 +03:00
Alexander Alekhin
e5890fc92a
ocl: fix build
2017-09-19 15:34:36 +03:00
Alexander Alekhin
1283d62e49
ocl: Kernel::runProfiling()
2017-09-19 15:34:35 +03:00
Alexander Alekhin
d9ab31490c
ocl: profiling queue
2017-09-19 15:32:15 +03:00
Alexander Alekhin
2e68f89225
ocl: update kernel global size adjustment
...
Prevents 10000x1 => 10000x8 transformation after getContinuousSize() call
2017-09-18 19:04:46 +03:00
Vadim Pisarevsky
f4136679ea
Merge pull request #9551 from ChristofKaufmann:MultiChannelMask
2017-09-18 09:28:34 +00:00
Vadim Pisarevsky
73298ea875
Merge pull request #9642 from saskatchewancatch:psnr-checks
2017-09-18 09:24:03 +00:00
Christof Kaufmann
7ec59fc097
Revert changes of mean and meanStdDev
2017-09-17 21:00:28 +02:00
RAJ NATARAJAN
0e44ad5543
Removed unnecessary assertion
2017-09-16 19:55:45 -07:00
RAJ NATARAJAN
6da62f56fa
Add some useful input assertions to PSNR function
...
Input arrays must be depth CV_8U and of identical size.
2017-09-15 23:36:58 -06:00
Vadim Pisarevsky
32bb71d686
Merge pull request #9603 from alalek:ocl_device_extensions
2017-09-13 14:43:36 +00:00
Vadim Pisarevsky
622318aeae
Merge pull request #9561 from dkurt:fix_8693
2017-09-13 13:34:49 +00:00
Alexander Alekhin
169add5aa6
ocl: added cv::ocl::Device::isExtensionSupported() method
2017-09-10 20:32:30 +00:00
Alexander Alekhin
597b3f4456
Merge pull request #9580 from mshabunin:fix-static-6
2017-09-09 19:31:21 +00:00
Alexander Alekhin
26ad229bd6
Merge pull request #9590 from alalek:ocl_runtime_fix
2017-09-09 19:30:07 +00:00
Alexander Alekhin
44b75eb116
core: fix OpenCL runtime compilation with HAVE_OPENCL_STATIC
2017-09-08 12:43:33 +03:00
Maksim Shabunin
248e2c7d47
Fixed some issues found by static analysis
2017-09-08 12:22:12 +03:00
Pavel Vlasov
37ab318657
Compatibility improvement with old IPP versions (tested on 9.0.1);
...
Manual IPP dispatcher simplification;
2017-09-08 11:08:24 +03:00
Vadim Pisarevsky
d25cbaaba8
Merge pull request #9293 from sovrasov:assert_improvement
2017-09-07 11:17:42 +00:00
Dmitry Kurtaev
8b094755fa
Fix https://github.com/opencv/opencv/issues/8693
2017-09-05 23:33:18 +03:00
Christof Kaufmann
46a668c565
Add multi-channel mask support to mean, meanStdDev and setTo
...
This adds the possibility to use multi-channel masks for the functions
cv::mean, cv::meanStdDev and the method Mat::setTo. The tests have now a
probability to use multi-channel masks for operations that support them.
This also includes Mat::copyTo, which supported multi-channel masks
before, but there was no test confirming this.
2017-09-04 19:40:27 +02:00
Alexander Alekhin
0451629e22
core(persistence): resolve DMatch/KeyPoint problem
2017-08-31 19:35:48 +03:00
Vadim Pisarevsky
518c6ae8c6
Merge pull request #9327 from sovrasov:fs_free_on_error_fix
2017-08-28 20:25:34 +00:00
Zoltán Mizsei
6258ff36bc
Haiku build fix
2017-08-26 11:37:59 +02:00
Alexander Alekhin
603fa03ac6
Merge pull request #9441 from wzw-intel:delete_program
2017-08-25 12:03:27 +00:00
Wu Zhiwen
da3da84a20
ocl: Add a function to unload a run-time cached program
...
This function is the counterpart of "Context::getProg".
With this function, users have chance to unload a program
from global run-time cached programs, and save resource.
2017-08-25 08:42:11 +08:00
Alexander Alekhin
9c14a2f0aa
Merge pull request #9395 from lupustr3:pvlasov/icv2017u3_update
2017-08-24 11:48:53 +00:00
Alexander Alekhin
d0509f6702
Merge pull request #9449 from ribalda:ocv
2017-08-23 19:40:36 +00:00
Ricardo Ribalda Delgado
6fc5697950
ocl: Fix OpenCL library detection in Linux
...
OpenCL runtime does not require OpenCL development file (libOpenCL.so),
just the "run" library (so.1).
This patch searches for the run library (so.1) if the dev library (.so)
is not found.
Web search shows that this error has been present since at least 2015
http://answers.opencv.org/question/80532/haveopencl-return-false/
Signed-off-by: Ricardo Ribalda Delgado <ricardo.ribalda@gmail.com>
2017-08-23 16:38:06 +02:00
Pavel Vlasov
a57718e1ac
ICV2017u3 package update;
...
- Optimizations set change. Now IPP integrations will provide code for SSE42, AVX2 and AVX512 (SKX) CPUs only. For HW below SSE42 IPP code is disabled.
- Performance regressions fixes for IPP code paths;
- cv::boxFilter integration improvement;
- cv::filter2D integration improvement;
2017-08-23 14:24:43 +03:00
KUANG Fangjun
97ec91ad67
fix cv::CommandLineParser.
...
It should handle bool value not only of "true" but also of "TRUE" and "True".
2017-08-23 11:38:58 +02:00
Rostislav Vasilikhin
66b0651607
Merge pull request #9329 from savuor:softfloat_sincos
...
SoftFloat: added sin, cos and docs (#9329 )
* softfloat: comparison operators made inline, min() max() eps() isSubnormal() added
* softfloat: get/set sign/exp
* softfloat: get/set frac
* softfloat: tests rewritten with new tools
* softfloat: added pi(), sin(), cos()
* softfloat: more comments
* softfloat: updated sincos arg reduction
* softfloat: initial tests for sincos added
* softfloat: test works, code cleanup is pending
* softfloat: sincos argreduce rewritten
* softfloat: sincos refactored and simplified
* softfloat sincos: epsilons calibrated
* softfloat: junk code removed from tests
* softfloat: docs added
* inline comparisons undone; warning fixed
2017-08-15 09:23:26 +00:00
Alexander Alekhin
a048cb9f0d
Merge pull request #9338 from dkurt:fix_ocl
2017-08-14 12:56:07 +00:00
Alexander Alekhin
ca9a88785e
core(build): fix compilation of parallel.cpp (OpenMP configuration)
2017-08-14 11:42:49 +03:00
Alexander Alekhin
0ebabe17df
core: fix flag processing for nested regions in cv::parallel_for_()
2017-08-10 08:37:47 +00:00
Dmitry Kurtaev
41519d3ac0
Fixed some OpenCL interface bugs
2017-08-09 11:54:55 +03:00
Vladislav Sovrasov
9a10bdbae5
core: use new assert in matmul.cpp
2017-08-08 23:00:11 +03:00
Vladislav Sovrasov
5e68b28ad3
core: fix file not closed when exception in FS
2017-08-07 21:03:59 +03:00
Alexander Alekhin
9ca39821c8
core: divUp function
2017-08-03 19:51:45 +03:00
Alexander Alekhin
dcc63d7408
Merge pull request #9248 from alalek:alloc_refactoring
2017-08-03 16:25:29 +00:00
Alexander Alekhin
16fb74425e
ocl: fix program cache key
2017-07-31 17:24:08 +03:00
Alexander Alekhin
e58a778bd5
core(stat): disable IPP optimization in meanStdDev (cn > 1)
2017-07-31 14:09:18 +03:00
Alexander Alekhin
fffd0f5b68
Merge pull request #9241 from alalek:tlsSlotsSize
2017-07-30 09:53:39 +00:00
Alexander Alekhin
b46e741c95
core(alloc): drop unused code, use memalign() functions instead of hacks
...
valgrind provides better detection without memory buffer hacks
2017-07-27 18:10:41 +03:00
Alexander Alekhin
34f9c039c5
Merge pull request #9238 from alalek:valgrind_fixes
2017-07-27 14:33:01 +00:00
Alexander Alekhin
d35422b523
core(tls): hide assertions from Thread Sanitizer
2017-07-27 17:31:51 +03:00
Alexander Alekhin
68ef903a7c
core(tls): don't use tlsSlots without synchronization
2017-07-26 22:45:55 +03:00
Alexander Alekhin
bf0173bf38
ts: update valgrind suppressions
2017-07-26 17:21:45 +03:00
Alexander Alekhin
b4e300b78b
Merge pull request #9236 from dkurt:fix_json_bool
2017-07-26 13:08:13 +00:00
Alexander Alekhin
402a77e7f7
Merge pull request #9237 from alalek:fix_winrt_build
2017-07-26 10:42:49 +00:00
dkurt
583b327523
Fix JSON booleans without quotes
2017-07-26 12:51:06 +03:00
Alexander Alekhin
c512bf6c66
Merge pull request #9232 from dkurt:json_named_nodes
2017-07-25 15:56:03 +00:00
dkurt
3515f6ec33
Missed NAMED bit of JSON nodes tag
2017-07-25 13:39:32 +03:00
Alexander Alekhin
602f047fe8
build: replace WIN32 => _WIN32
2017-07-25 13:30:48 +03:00
Alexander Alekhin
7f3eea6325
core: fix Mat/UMat cleanup on exceptions in deallocate()
2017-07-25 12:27:30 +03:00
Alexander Alekhin
bc3c7e80a6
Merge pull request #9209 from alalek:fix_persistence_format
2017-07-21 10:55:40 +00:00
Alexander Alekhin
544eb4be1f
IPP: update minMaxIdx, disable some AVX optimizations with mask
2017-07-21 12:56:36 +03:00
Alexander Alekhin
ec7ce81401
core: fix FileStorage format detection in case of .gz archives
2017-07-20 19:58:36 +03:00
Alexander Alekhin
5bc291937f
test: FileStorage format regression test
2017-07-20 19:58:10 +03:00
Alexander Alekhin
e5ed9cc612
Merge pull request #8498 from savuor:bit_exact_lab
2017-07-17 14:01:05 +00:00
Alexander Alekhin
b4716b1d92
core: fix convertTo() AVX2 optimization
2017-07-17 15:02:14 +03:00
Rostislav Vasilikhin
4b75be801e
initial version of Lab2RGB_f tetrahedral interpolation written
...
RGB2Lab_f added, bugs fixed, moved to float
several bugs fixed
LUT fixed, no switch in tetraInterpolate()
temporary code; to be removed and rewritten
before refactoring
extra interpolations removed, some things to do left
added Lab2RGB_b +XYZ version, etc.
basic version is done, to be sped up
tetra refactored
interpolations: LUT for weights, refactor., etc.
address arithm optimized
initial version of vectorized code added (not compiling now)
compilation fixed, now segfaults
a lot of fixes, vectorization temp. disabled
fixed trilinear shift size, max error dropped from 19 to 10
fixed several bugs (255 vs 256, signed vs unsigned, bIdx)
minor changes
packed: address arithmetics fixed
shorter code
experiments with pure integer calculations
Lab2RGB max error decreased to 2; need to clean the code
ready for vectorization; need cleaning
vectorized, to be debugged
precision fixed, max error is 2
Lab->XYZ shortened
minor fixes
Lab2RGB_f version fixed, to be completely rewritten using _b code
RGB2Lab_f vectorized
minors
moved to separate file
refactored Lab2RGB to float and int versions
minor fix
Lab2RGB_f vectorized
minor refactoring
Lab2RGBint refactored: process methods, vectorize by 4 pix
Lab2RGB_f int version is done
cleanup extra code
code copied to color.cpp
fixed blue idx bug
optimizations enabled when testing; mulFracConst introduced
divConst -> mulFracConst
calc min time in perf instead of avg
minors
process() slightly sped up
Lab2RGB_f: disabled int version
reinterpret added, minor fixes in names
some warnings fixed
changes transferred to color.cpp
RGB2Lab_f code (and trilinear interpolation code) moved to rgb2lab_faster
whitespace
shift negative fixed
more warnings fixed
"constant condition" warnings fixed, little speed up
minor changes
test_photo decolor fixed
changes copied to test_lab.cpp
idx bounds checking in LUT init
several fixes
WIP: softfloat almost integrated
test_lab partially rewritten to SoftFloat
color.cpp rewritten to SoftFloat
test_lab.cpp: accuracy code added
several fixes
RGB2Lab_b testing fixed
splineBuild() rewritten to SoftFloat
accuracy control improved
rounding fixed
Luv <=> RGB: rewritten to SoftFloat
OCL cvtColor Lab and Lut rewritten to SoftFloat
minor fixes
refactored to new SoftFloat interface
round() -> cvRound, etc.
fixed OCL tests
softfloat.cpp: internal functions made static, unused ones removed
meaningful constants
extra lines removed
unused function removed
unfinished work
it works, need to fix TODOs
refactoring; more calls rewritten
mulFracConst removed
constants made bit exact; minors
changes moved to color.cpp
fixed 1 bug and 4 warnings
OCL: fixed constants
pow(x, _1_3f) replaced by cubeRoot(x)
fixed compilation on MSVC32
magic constants explained
file with internal accuracy&speed tests moved to lab_tetra branch
2017-07-17 00:32:30 +03:00
Alexander Alekhin
5a54acef4e
Merge pull request #9130 from alalek:android_define
2017-07-14 17:17:24 +00:00
Alexander Alekhin
11626fe32c
Merge pull request #8975 from sovrasov:fs_additional_errors
2017-07-14 17:13:50 +00:00
Alexander Alekhin
4e39d0371d
Merge pull request #9074 from alalek:cpu_dispatch_core_hamming
...
cpu dispatch(core): hamming
2017-07-14 16:48:07 +00:00
Alexander Alekhin
eef78f5664
Merge pull request #9061 from terfendail:convert_avx
...
AVX and SSE4.1 optimized conversion migrated
2017-07-14 16:43:54 +00:00
Alexander Alekhin
5ebfb52a4a
ipp(minmaxIdx): disable SSE4.2 optimizations for 32f datatype
...
NaN values handling issue
2017-07-12 16:06:18 +03:00
PkLab.net
6dd9e18b2e
add std::string overload for cv::read()
2017-07-12 15:51:11 +03:00
Vladislav Sovrasov
5b833db558
core: forbid conversion real->int in some cases in FileStorage
2017-07-12 15:50:57 +03:00
Maksim Shabunin
02db592014
Fixed several issues found by static analysis (Windows specific)
2017-07-10 23:14:02 +03:00
Alexander Alekhin
a4a47b538c
build: detect Android via '__ANDROID__' macro
...
https://sourceforge.net/p/predef/wiki/OperatingSystems
2017-07-10 12:43:59 +03:00
Alexander Alekhin
da8dbf6cf5
ocl: async cl_buffer cleanup queue (for event callback)
2017-07-07 13:41:20 +03:00
Alexander Alekhin
daee982106
ocl: rework events handling with clSetEventCallback
2017-07-06 13:25:32 +03:00
Vitaly Tuzov
5448d9186a
AVX and SSE4.1 optimized conversion implementations migrated to separate files
2017-07-04 14:48:01 +03:00
Alexander Alekhin
b66c349bba
core(stat): add required CV_AVX_GUARD
...
Added guard with 'vzeroupper' instruction
2017-07-02 22:45:10 +00:00
Alexander Alekhin
c45d3568ae
core(stat): register dispatched code, fix build
2017-07-02 22:45:10 +00:00
Alexander Alekhin
6a6222d21c
core(stat): remove useless checks
2017-07-02 22:45:10 +00:00
Alexander Alekhin
880052d3f3
core(stat): create dispatch.cpp file
2017-07-02 22:45:10 +00:00
Alexander Alekhin
85afbd409b
core(stat): move implementations into .hpp file w/o changes
2017-07-02 22:45:09 +00:00
Alexander Alekhin
03c3e0edcf
core(stat): stat.cpp minor refactoring
...
- remove unused code
- added: #if CV_ENABLE_UNROLLED in Hamming's functions
2017-07-02 22:45:09 +00:00
Maksim Shabunin
1f23202ad8
Issues found by static analysis (5th round)
2017-07-01 18:56:24 +03:00
Maksim Shabunin
e0393f8557
Fixed some issues found by static analysis (4th round)
2017-06-30 12:26:53 +03:00
Vadim Pisarevsky
2ac819018d
Merge pull request #9014 from sovrasov:compare_scalars_fix
2017-06-29 11:14:44 +00:00
Maksim Shabunin
a769d69a9d
Fixed several issues found by static analysis
2017-06-28 18:06:18 +03:00
Vladislav Sovrasov
35a1ecef2a
core: fix infinite recursion in compare
2017-06-28 15:00:52 +03:00
Maksim Shabunin
32d4af36e2
Fixing some static analysis issues
2017-06-27 14:30:26 +03:00
Alexander Alekhin
650830b9d6
build: eliminate warning
2017-06-27 08:16:40 +03:00
Vadim Pisarevsky
ef2e5a9f82
Merge pull request #8988 from sovrasov:repeat_src_eq_dst_fix
2017-06-26 21:58:26 +00:00
Rostislav Vasilikhin
e63feba8e2
fixed typo
2017-06-26 20:19:18 +03:00
Alexander Alekhin
006966e629
trace: initial support for code trace
2017-06-26 17:07:13 +03:00
Vladislav Sovrasov
4f9871817a
core: forbid handling of the case when src=dst in cv::repeat
2017-06-26 14:02:52 +03:00
Vadim Pisarevsky
fa7e7e0ff9
Merge pull request #8900 from alalek:update_android_build
2017-06-23 10:58:53 +00:00
Alexander Alekhin
3e3e2dd512
android: make optional "cpufeatures", build fixes for NDK r15
2017-06-21 13:34:19 +03:00
Alexander Alekhin
d3ebe665e0
core: fix IPP optimization for sortIdx
2017-06-21 03:04:16 +00:00
Rostislav Vasilikhin
939c8e8a99
float constant replaced by int hex representations
2017-06-15 15:10:41 +03:00
Rostislav Vasilikhin
29593635ed
licence updated
2017-06-14 21:20:10 +03:00
Alexander Alekhin
9fa90c8851
Merge pull request #8899 from tomoaki0705:fixSuppressWarningsUnreachable
2017-06-14 01:07:47 +00:00
Vadim Pisarevsky
fbafc700ea
added v_reduce_sum4() universal intrinsic; corrected number of threads in cv::getNumThreads() in the case of GCD
2017-06-13 18:04:00 +03:00
Tomoaki Teshima
94848a3e1f
suppress unreachable code warning
...
- fix the define condition based on the comment
2017-06-13 08:11:04 +09:00
Alexander Alekhin
a3189e36c0
Merge pull request #8753 from RyuheiMori:fix-cpu-feature-detection-on-android
2017-06-12 16:08:08 +00:00
Alexander Alekhin
3dee87b697
update CPU detection on ANDROID patch
2017-06-11 05:06:49 +00:00
Alexander Alekhin
0213b508dc
Merge pull request #8868 from alalek:fix_build_softfloat
2017-06-08 20:22:53 +00:00
Alexander Alekhin
e3c0d11b55
Merge pull request #8876 from alalek:fix_build_msvs
2017-06-08 19:53:29 +00:00
Maksim Shabunin
f71ea4dfe9
Merge pull request #8816 from mshabunin:sprintf-fix
...
Fixed snprintf for VS 2013 (#8816 )
* Fixed snprintf for VS 2013
* snprintf: removed declaration from header, changed implementation
* cv_snprintf corrected according to comments
* update snprintf patch
2017-06-08 21:53:16 +02:00
Alexander Alekhin
5c0a287ce8
build: fix warning
...
C4189: 'clImageUV' : local variable is initialized but not referenced
2017-06-08 20:40:36 +03:00
Alexander Alekhin
71517a910a
build: fix errors for MSVS2010-2013, reduce default softfloat scope
2017-06-08 01:09:21 +00:00
Alexander Alekhin
125abe2fe4
Merge pull request #8838 from tomoaki0705:dispatchFp16
2017-06-06 15:31:42 +00:00
Tomoaki Teshima
e269ef96cb
update convertFp16 using CV_CPU_CALL_FP16
...
* avoid link error (move the implementation of software version to header)
* make getConvertFuncFp16 local (move from precomp.hpp to convert.hpp)
* fix error on 32bit x86
2017-06-06 22:26:51 +09:00
Rostislav Vasilikhin
c6a3a18894
SoftFloat integrated ( #8668 )
...
* everything is put into softfloat.cpp and softfloat.hpp
* WIP: try to integrate softfloat into OpenCV
* extra functions removed
* softfloat made stateless
* CV_EXPORTS added
* operators fixed
* exp added, log: WIP
* log32 fixed
* shorter names; a lot of TODOs
* log64 rewritten
* cbrt32 added
* minors, refactoring
* "inline" -> "CV_INLINE"
* cast to bool warnings fixed
* several warnings fixed
* fixed warning about unsigned unary minus
* fixed warnings on type cast
* inline -> CV_INLINE
* special cases processing added (NaNs, Infs, etc.)
* constants for NaN and Inf added
* more macros and helper functions added
* added (or fixed) tests for pow32, pow64, cbrt32
* exp-like functions fixed
* minor changes
* fixed random number generation for tests
* tests for exp32 and exp64: values are compared to SoftFloat-based naive implementation
* minor warning fix
* pow(f, i) 32/64: special cases handling added
* unused functions removed
* refactoring is in progress (not compiling)
* CV_inline added
* unions {uint_t, float_t} removed
* tests compilation fixed
* static const members -> static methods returning const
* reinterpret_cast
* warning fixed
* const-ness fixed
* all FP calculations (even compile-time) are done in SoftFloat + minor fixes
* pow(f, i) removed from interface (can cause incorrect cast) to internals of pow(f, f), tests fixed
* CV_INLINE -> inline
* internal constants moved to .cpp file
* toInt_minMag() methods merged into toInt() methods
* macros moved to .cpp file
* refactoring: types renamed to softfloat and softdouble; explicit constructors, etc.
* toFloat(), toDouble() -> operator float(), operator double()
* removed f32/f64 prefixes from functions names
* toType() methods removed, round() and trunc() functions added
* minor change
* minors
* MSVC: warnings fixed
* added int cvRound(), cvFloor, cvCeil, cvTrunc, saturate_cast<T>()
* typo fixed
* type cast fixed
2017-05-29 17:07:25 +03:00
Alexander Alekhin
36918b3bb8
Merge pull request #8814 from woodychow:openmp_num_threads
2017-05-29 13:17:41 +00:00
Woody Chow
6e00c7651b
Use num_threads clause of #pragma omp parallel instead to avoid calling omp_set_num_threads for the entire application
2017-05-29 14:16:10 +09:00
Vadim Pisarevsky
ee257ffe9e
Merge pull request #8455 from terfendail:ovxhal_skipsmall
2017-05-26 12:10:18 +00:00
Vitaly Tuzov
1d62a025b3
Moved size restrictions for OpenVX processed images to corresponding cpp files
2017-05-25 19:25:17 +03:00
Matthias Grundmann
cf4e9e5ce2
Update matrix.cpp
...
Fix race condition in getDefaultAllocator and setDefaultAllocator interaction / not threadsafe currently
2017-05-24 13:55:18 +03:00
Vadim Pisarevsky
7c3577f7ae
Merge pull request #8779 from vpisarev:empty_cmp_fix
2017-05-23 19:06:57 +00:00
Alexander Alekhin
15a2c7724d
Merge pull request #8743 from tomoaki0705:featureConvertFp16UMat
2017-05-23 15:32:12 +00:00
Vadim Pisarevsky
4eda8efd42
resolves https://github.com/opencv/opencv/issues/7792
2017-05-23 18:16:40 +03:00
Vadim Pisarevsky
a065e4b9aa
Merge pull request #8769 from mshabunin:kw-fixes
2017-05-23 14:59:36 +00:00
Tomoaki Teshima
d81cdb8e1c
add OpenCL version of convertFp16 and test
...
* disable vector operation for now
* brush up the implementation based on comment
2017-05-23 20:00:21 +09:00
Maksim Shabunin
f23b6ba652
Fixed multidimensional count non-zero IPP implementation
2017-05-23 13:23:59 +03:00
Maksim Shabunin
b04ed5956e
Fixed several issues found by static analysis in core module
2017-05-23 12:35:31 +03:00
Vadim Pisarevsky
17b89b2a35
Merge pull request #8770 from alalek:fix_pthreads_default
2017-05-22 22:18:26 +00:00
Alexander Alekhin
16ea72e6b9
build: fix snprintf() usage
2017-05-22 22:24:17 +03:00
Alexander Alekhin
900c406541
core: fix threads count in pthreads
2017-05-22 21:45:25 +03:00
Ryuhei Mori
bb3a416320
Fix cpu features detection on android
2017-05-19 21:19:13 +08:00
Alexander Alekhin
17eef4d8a9
Merge pull request #8596 from nnorwitz:nnorwitz
2017-05-12 19:48:28 +00:00
Philipp Hasper
dcd8589b67
Fixed exp32f() compilation on MSVC
2017-05-10 18:25:39 +02:00
Vadim Pisarevsky
b683e68223
Merge pull request #8398 from woodychow:normL2Sqr_avx2
2017-05-03 14:49:56 +00:00
nnorwitz
256b6bb3db
Don't blow out the stack. Use a smaller buffer and prevent buffer overruns with snprintf.
2017-05-03 16:56:09 +03:00
Vadim Pisarevsky
925594d1e3
Merge pull request #7894 from alalek:ocl_program
2017-05-03 13:48:58 +00:00
Alexander Alekhin
75f28245a8
core: fix persistence bug in RAW I/O code
...
- persistence.cpp code expects special sizeof value for passed structures
- this assumption is lead to memory corruption problems
- fixed/workarounded test to prevent memory corruption on Linux 32-bit systems
2017-04-26 17:19:26 +03:00
Alexander Alekhin
26be2402a3
Merge pull request #8629 from lupustr3:pvlasov/icv2017u2_update2
2017-04-26 10:45:37 +00:00
Pavel Vlasov
11c2ffaf1c
Update for IPP for OpenCV 2017u2 integration;
...
Updated integrations for:
cv::split
cv::merge
cv::insertChannel
cv::extractChannel
cv::Mat::convertTo - now with scaled conversions support
cv::LUT - disabled due to performance issues
Mat::copyTo
Mat::setTo
cv::flip
cv::copyMakeBorder - currently disabled
cv::polarToCart
cv::pow - ipp pow function was removed due to performance issues
cv::hal::magnitude32f/64f - disabled for <= SSE42, poor performance
cv::countNonZero
cv::minMaxIdx
cv::norm
cv::canny - new integration. Disabled for threaded;
cv::cornerHarris
cv::boxFilter
cv::bilateralFilter
cv::integral
2017-04-25 15:53:12 +03:00
Vadim Pisarevsky
96aaac186d
Merge pull request #8616 from vpisarev:dnn4
2017-04-25 06:32:16 +00:00
Alexander Alekhin
f1c8094f5f
Merge pull request #8575 from lupustr3:pvlasov/icv2017u2_initial_update
2017-04-21 10:55:29 +00:00
Pavel Vlasov
35c7216846
IPP for OpenCV 2017u2 initial enabling patch;
2017-04-20 20:26:30 +03:00
Vadim Pisarevsky
dd54f7a22a
got rid of Blob and BlobShape completely; use cv::Mat and std::vector<int> instead
2017-04-19 23:20:17 +03:00
Arnaud Brejeon
636ab095b0
Merge pull request #8535 from arnaudbrejeon:std_array
...
Add support for std::array<T, N> (#8535 )
* Add support for std::array<T, N>
* Add std::array<Mat, N> support
* Remove UMat constructor with std::array parameter
2017-04-19 13:13:39 +03:00
insoow
2922738b6d
Merge pull request #8104 from insoow:master
...
Gemm kernels for Intel GPU (#8104 )
* Fix an issue with Kernel object reset release when consecutive Kernel::run calls
Kernel::run launch OCL gpu kernels and set a event callback function
to decreate the ref count of UMat or remove UMat when the lauched workloads
are completed. However, for some OCL kernels requires multiple call of
Kernel::run function with some kernel parameter changes (e.g., input
and output buffer offset) to get the final computation result.
In the case, the current implementation requires unnecessary
synchronization and cleanupMat.
This fix requires the user to specify whether there will be more work or not.
If there is no remaining computation, the Kernel::run will reset the
kernel object
Signed-off-by: Woo, Insoo <insoo.woo@intel.com>
* GEMM kernel optimization for Intel GEN
The optimized kernels uses cl_intel_subgroups extension for better
performance.
Note: This optimized kernels will be part of ISAAC in a code generation
way under MIT license.
Signed-off-by: Woo, Insoo <insoo.woo@intel.com>
* Fix API compatibility error
This patch fixes a OCV API compatibility error. The error was reported
due to the interface changes of Kernel::run. To resolve the issue,
An overloaded function of Kernel::run is added. It take a flag indicating
whether there are more work to be done with the kernel object without
releasing resources related to it.
Signed-off-by: Woo, Insoo <insoo.woo@intel.com>
* Renaming intel_gpu_gemm.cpp to intel_gpu_gemm.inl.hpp
Signed-off-by: Woo, Insoo <insoo.woo@intel.com>
* Revert "Fix API compatibility error"
This reverts commit 2ef427db91
.
Conflicts:
modules/core/src/intel_gpu_gemm.inl.hpp
* Revert "Fix an issue with Kernel object reset release when consecutive Kernel::run calls"
This reverts commit cc7f9f5469
.
* Fix the case of uninitialization D
When C is null and beta is non-zero, D is used without initialization.
This resloves the issue
Signed-off-by: Woo, Insoo <insoo.woo@intel.com>
* fix potential output error due to 0 * nan
Signed-off-by: Woo, Insoo <insoo.woo@intel.com>
* whitespace fix, eliminate non-ASCII symbols
* fix build warning
2017-04-19 12:57:54 +03:00
Yuriy Solovyov
26ccc09c46
Fix zlib issue on iOS
2017-04-14 17:16:00 +03:00
Vitaly Tuzov
bf5b7843e8
Extended set of OpenVX HAL calls disabled for small images
2017-04-06 18:17:32 +03:00
Alexander Alekhin
297ba85323
Merge pull request #8441 from alalek:dispatch_mathfuncs_core
2017-04-03 14:03:49 +00:00
Alexander Alekhin
1e6ce1d2f8
core(mathfuncs_core): cpu optimization dispatched code
2017-03-23 16:17:10 +03:00
KUANG, Fangjun
03c4c37969
fix issue 8189.
2017-03-22 22:24:20 +01:00
Vadim Pisarevsky
0b3d13645f
Merge pull request #8364 from csukuangfj:patch-2
2017-03-22 14:13:13 +00:00
Alexander Alekhin
ba104b61bf
Merge branch 'pr8392'
2017-03-22 13:45:24 +03:00
Vadim Pisarevsky
8abd163464
Merge pull request #8404 from khnaba:stream-with-custom-allocator
2017-03-21 20:06:56 +00:00
Vadim Pisarevsky
e5dbd2c3a5
Merge pull request #8406 from khnaba:dft-as-algorithm
2017-03-21 20:05:54 +00:00
Vadim Pisarevsky
a57d144076
Merge pull request #7462 from alalek:cpu_multi_target
2017-03-21 19:51:32 +00:00
Naba Kumar
29680100ac
Support for creating streams with custom allocator
2017-03-21 14:50:14 +02:00
Naba Kumar
00f3ad7217
Implement DFT as cv::Algorithm to support concurrent streams
2017-03-21 13:55:13 +02:00
Naba Kumar
cdcf44b3ef
Expose BufferPool class for external use also
2017-03-21 13:50:02 +02:00
Woody Chow
c370cc10e9
Optimize normL2Sqr_ with AVX2
2017-03-16 14:20:41 +09:00
Woody Chow
a8763c1fec
Optimize exp32f with AVX2
2017-03-15 17:03:36 +09:00
KUANG, Fangjun
debc1c4c95
fix an error while setting kernel argument for a 3-D matrix.
2017-03-12 18:29:49 +01:00
KUANG, Fangjun
be7d4608fb
Add more comments to the members of CoreTLSData related to OpenCL.
2017-03-12 16:13:00 +01:00
Alexander Alekhin
5d31d6ebbb
Merge pull request #8306 from chacha21:portability
2017-03-03 04:46:05 +00:00
chacha21
74abbd0898
Fix compilation when USE_ZLIB is false
...
create a dummy gzFile type
2017-03-02 16:58:51 +01:00
chacha21
aa1b031274
get rid of warning C4800 under VS2010
...
the "std::basic_ios::operator bool" differs between C++98 and C++11. The
"double not" syntax is portable and covers both cases with equivalent
meaning
2017-03-02 16:56:20 +01:00
Vadim Pisarevsky
c7049ca627
Merge pull request #8293 from alalek:update_rng_in_parallel_for
2017-03-02 05:51:01 +00:00
Vadim Pisarevsky
ddfe688be6
Merge pull request #8299 from sovrasov:fs_fix_kpts_dmatch_output
2017-03-02 05:46:38 +00:00
Alexander Alekhin
47c4dcc8a3
Merge pull request #8204 from terfendail:ovx_tlcontext
2017-03-01 12:36:37 +00:00
Vladislav Sovrasov
c321d025c4
Fix DMatch and Keypoint I/O in FileStorage
2017-03-01 15:07:38 +03:00
Alexander Alekhin
649bb7ac04
core: parallel_for_(): update RNG state of the main thread
2017-02-28 18:28:15 +03:00
Alexander Alekhin
b28fd79fdc
core: parallel_for_(): propagate RNG state from the main thread
2017-02-28 18:22:46 +03:00
Alexander Alekhin
eee638fd81
Merge pull request #8244 from sovrasov:adjust_roi_fix
2017-02-24 11:18:35 +00:00
Vadim Pisarevsky
12d7429ff0
Merge pull request #8064 from terfendail:sgbm_bigbuffer
2017-02-23 20:11:26 +00:00
Vladislav Sovrasov
595437bdd1
hal: replace round() with cvRound()
2017-02-22 14:08:38 +03:00
Vladislav Sovrasov
14451f3f06
core: fix adjustROI behavior on indexes overflow
2017-02-22 14:05:51 +03:00
Vitaly Tuzov
9a4b5a4545
OpenVX calls updated to use single common OpenVX context per thread
2017-02-21 16:08:23 +03:00
Alexander Alekhin
ec7f74f7b4
core(TLS): add cleanup() method
2017-02-21 16:08:23 +03:00
Fangjun KUANG
526220a171
Fix typos in the documentation ( #8226 )
...
* fix typos.
* Fix typos.
* Fix typos.
* Fix typos.
* Fix typos.
2017-02-21 12:48:15 +03:00
Vadim Pisarevsky
ca0b6fbb95
Merge pull request #8203 from alalek:ocl_kernelarg_validate
2017-02-16 14:58:25 +00:00
Jinay Patel
f710d96602
Update ocl.cpp function Device::OpenCLVersion()
...
Instead of CL_DEVICE_EXTENSIONS in Querying OpenCLVersion, it should be CL_DEVICE_VERSION flag which needs to be passed to the function.
2017-02-16 15:20:57 +05:30
Fangjun KUANG
5e4dfb9546
Change the argument type to cl_platform_info.
...
Although both `cl_platform_info` and `cl_device_info` are defined as macro `cl_uint`, it needs to use `cl_platform_info` to get
the platform information.
2017-02-15 16:44:40 +01:00
Alexander Alekhin
4c7aa8645a
ocl: validate arguments in KernelArgs constructor
...
- don't use undefined flag=0. It should be CONSTANT instead.
- don't allow 'UMat* m=NULL' argument (except LOCAL/CONSTANT flags).
This case is not handled well to provide NULL __global pointers.
It is better to use '-D' macro defines instead (at least for performance)
2017-02-14 16:10:32 +03:00
Alexander Alekhin
e16227b53c
cmake: support multiple CPU targets
2017-02-13 19:52:59 +03:00
Vitaly Tuzov
4950f542d1
Fix for SGBM compute() buffer allocation failure on big images
2017-02-08 12:49:21 +03:00
Alexander Alekhin
3476440ec3
Merge pull request #8078 from tomoaki0705:universalIntrinsicLapack
2017-01-31 18:46:54 +00:00
Vadim Pisarevsky
dfb348ef0b
Merge pull request #8081 from mshabunin:fix-kmeans-compactness
2017-01-31 12:19:46 +00:00
Maksim Shabunin
b417b4dbee
KMeans improvement
...
- fixed returned compactness value
- added centers drawing to the example app
- added compactness test
2017-01-31 12:05:08 +03:00
Tomoaki Teshima
fd711219a2
use universal intrinsic in VBLAS
...
- brush up v_reduce_sum of SSE version
2017-01-31 05:36:27 +09:00
Tomoaki Teshima
d0bdf99501
check correct flag
2017-01-27 18:42:58 +09:00
Tomoaki Teshima
820fdbff23
brush up AVX optimization in popcount
...
- make sure SIMD optimization works even when AVX is not available
2017-01-26 18:37:56 +09:00
Alexander Alekhin
a22f03e749
Merge pull request #7863 from tomoaki0705:universalIntrinsicPopcount
2017-01-18 17:25:02 +00:00
Alexander Alekhin
7c91700cc6
Merge pull request #7999 from alalek:fix_lapack
2017-01-18 13:20:30 +00:00
Vladislav Sovrasov
896c34fab3
Add support of type headings from YAML1.2
2017-01-17 16:40:38 +03:00
Alexander Alekhin
3668a01fca
eliminate warnings
...
hal_internal.cpp(101): warning C4267: 'initializing': conversion from 'size_t' to 'int', possible loss of data
2017-01-17 15:56:42 +03:00
Tomoaki Teshima
8b22099da2
use universal intrinsic and SSE4 popcount instruction in normHamming
...
- add v_popcount in universal intrinsic
- add test for v_popcount
- add wrapper of popcount for both MSVC and GCC
2017-01-12 09:09:22 +09:00
atinfinity
b3c77e664d
added to check the support of AVX2
2017-01-09 19:51:33 +09:00
Matt Bennett
c3a8db6d6c
Merge pull request #7952 from mattmyne:JSONWriteFixTrailingDecimalPoint
...
Append zero to trailing decimal place for FileStorage JSON write of a float or double value (#7952 )
* Fix for FileStorage JSON write of a float or double value that has no fractional part; appends a zero character after the trailing decimal place to meet JSON standard.
* strlen return to size_t type rather than unnecessary cast to int
2017-01-02 20:28:07 +00:00
Steve
8a6fabed83
Avoid segmentation fault
...
Change `<=` to `<` to avoid accessing the first character of an empty string
2016-12-29 17:23:40 -08:00
Vadim Pisarevsky
ad74fdd7cc
moved BLAS/LAPACK detection scripts from opencv_contrib/dnn to the main repository ( #7918 )
...
* moved BLAS/LAPACK detection scripts from opencv_contrib/dnn to the main repository.
* trying to fix the bug with undefined symbols sgesdd_ and dgesdd_
* removed extra whitespaces; disabled LAPACK on IOS
2016-12-22 22:57:44 +03:00
Alexander Alekhin
c3e409f950
ocl: update program cache defaults
2016-12-22 04:52:43 +03:00
Alexander Alekhin
7a95e654eb
ocl: update compiled programs
...
- minimize library initialization time (lazy calculations of program hash)
- LRU cache of in-memory compiled programs
2016-12-19 17:17:20 +03:00
Alexander Alekhin
86bb523991
Merge pull request #7871 from savuor:fix/openvx_cvt_on
2016-12-19 13:51:53 +00:00
Alexander Alekhin
05d187ec90
ocl: move deprecated code from ocl.cpp
2016-12-19 00:34:50 +03:00
Rostislav Vasilikhin
fcdbe16252
openvx_cvt disabled for Khronos, fixed sstep and dstep usage
2016-12-16 23:00:55 +03:00
Alexander Alekhin
862c3aa6e1
Merge pull request #7873 from addisonElliott:Mat_Range_InitializerList
2016-12-16 16:45:17 +00:00
Alexander Alekhin
0e4dde1781
Merge pull request #7872 from alalek:merge-2.4
2016-12-16 16:03:14 +02:00
Rostislav Vasilikhin
cf5e976fad
OpenVX convert enabled
2016-12-16 15:57:21 +03:00
Alexander Alekhin
c038d1be60
Merge pull request #7858 from addisonElliott:master
2016-12-16 10:57:27 +00:00
Addison Elliott
eb04b2bfa9
Added N-dim submat selection with vectors
...
Currently, to select a submatrix of a N-dimensional matrix, it requires
two lines of code while only one line of code is required if using a 2D
array.
I added functionality to be able to select an N-dim submatrix using a
vector list instead of a Range pointer. This allows initializer lists to
be used for a one-line selection.
2016-12-15 09:16:40 -06:00
Addison Elliott
fa6692afcf
Added new overloaded functions for Mat and UMat that accepts std::vector<int> instead of int * for the sizes on a N-dimensional array.
...
This allows for an N-dimensional array to be setup in one line instead of two when using C++11 initializer lists. cv::Mat(3, {zDim, yDim, xDim}, ...) can be used instead of having to create an int pointer to hold the size array.
2016-12-14 13:52:03 -06:00
Rostislav Vasilikhin
8b9422a052
OpenVX wrappers rewritten with CV_OVX_RUN, VX_DbgThrow
2016-12-14 17:49:41 +03:00
Alexander Alekhin
fbf2383d5d
Merge pull request #7787 from alalek:ocl_explicit_only
2016-12-13 10:22:33 +00:00
Alexander Alekhin
cb1d4e692b
Merge pull request #7801 from alalek:fix_7795
2016-12-13 10:20:31 +00:00
apavlenko
76c38f0c80
trying to enable canny_vx adding a new test comparing canny_cv vs canny_vx
2016-12-09 14:53:06 +03:00
apavlenko
1e2ddc30b1
Canny via OpenVX, Node wrapper extended (query/set attribute), some naming fixes
2016-12-09 14:53:06 +03:00
Alexander Alekhin
beea04cc89
Merge pull request #7826 from mshabunin:fix-base64-issue
2016-12-08 17:01:33 +00:00
Alexander Alekhin
48bff3bfd3
Merge pull request #7748 from LaurentBerger:Normalize3d
2016-12-08 16:20:11 +00:00
mshabunin
e941259434
Persistence: fixed valgrind warning in base64 decoder
2016-12-08 14:58:18 +03:00
Rostislav Vasilikhin
695b20172b
Merge pull request #7794 from savuor:fix/ovx_cvt_continuous
...
Fixed OpenVX wrapper for Mat::convertTo() (#7794 )
* fixed for cases of unrolled (w*h x 1) matrices
* more error handling
2016-12-06 18:29:44 +02:00
Alexander Alekhin
fef94315af
core: exp/log workaround for MSVS
2016-12-06 16:57:35 +03:00
Alexander Alekhin
44d9d59f08
ocl: stop using of OpenCL without explicit UMat arguments
2016-12-04 02:34:17 +03:00
mshabunin
a0e771f545
Fixed valgrind issues in persistence functions
2016-12-02 18:41:25 +03:00
Vitaly Tuzov
ced81f72bc
Added OpenVX based processing to LUT
2016-12-02 14:36:47 +03:00
Vadim Pisarevsky
924ac10159
Merge pull request #7649 from terfendail:ovx_meanstddev
2016-12-01 17:46:30 +00:00
Vadim Pisarevsky
0a775e43e1
Merge pull request #7659 from savuor:openvx_convert_depth
2016-12-01 17:44:49 +00:00
Vadim Pisarevsky
ddc5155a85
Merge pull request #7677 from terfendail:ovx_minmaxloc
2016-12-01 17:42:18 +00:00
mshabunin
695c518384
Updated TBB search script and code checks
2016-12-01 16:58:38 +03:00
Rostislav Vasilikhin
2b56b174e8
fixed: data types, empty input case
2016-11-29 17:52:50 +03:00
Rostislav Vasilikhin
0a6958813c
added OpenVX call to Mat::convertTo() (w/o scaling)
2016-11-29 17:52:36 +03:00
Vitaly Tuzov
a4b9eb23d1
Added OpenVX based processing to meanStdDev
2016-11-29 16:59:25 +03:00
Vitaly Tuzov
e9cc0f6a4c
Added OpenVX based processing to minMaxLoc
2016-11-29 15:33:48 +03:00
LaurentBerger
c56c0e140b
Solve exception for 3D Mat
2016-11-29 12:10:33 +01:00
Vadim Pisarevsky
be0865406f
Merge pull request #7548 from tomoaki0705:fixRngAvoidFmadd
2016-11-28 21:04:03 +00:00
Alexander Alekhin
0bdea2b714
core: fix absdiff (non-optimized, fp) to prevent "-0" results
2016-11-23 13:55:38 +03:00
Philippe Renon
e79f83e573
calib3d: fix missing cv::redirectError symbol error
...
happens on msys2 with gcc 6.2.0
see also http://stackoverflow.com/questions/38552221/undefined-reference-to-cvredirecterror-while-creating-shared-build-of-opencv-3
2016-11-15 08:56:01 +01:00
Vadim Pisarevsky
6b7d570c34
Merge pull request #7539 from Tetragramm:90DegreeRotations
2016-11-11 20:30:25 +00:00
Tetragramm
24379fcb5f
Use transpose() as suggested, because it works on pre-existing destination Mats.
2016-11-10 21:35:00 -06:00
Pavel Vlasov
349d5ba012
--perf_instrument parameter now has int type and 0, 1, 2 modes (1 - simple trees, 2 - expanded trees for functions with same name but different calling address);
...
Maximum depth limit var was added to the instrumentation structure;
Trace names output console output fix: improper tree formatting could happen;
Output in case of error was added;
Custom regions improvements;
Improved timing and weight calculation for parallel regions; New TC (threads counter) value to indicate how many different threads accessed particular node;
parallel_for, warnings fixes and ReturnAddress code from Alexander Alekhin;
2016-11-08 10:18:05 +03:00
Tomoaki Teshima
676736bdb2
avoid using fmadd in RNG
2016-11-03 23:11:52 +09:00
Tetragramm
ad5c50a923
Improve the efficiency as suggested by vpisarev.
...
Alter the Rotation enum to be unambiguous as to direction.
2016-11-02 17:44:13 -05:00
Vadim Pisarevsky
280cdb5b94
Merge pull request #7595 from sovrasov:fs_utf8_bom
2016-11-02 15:27:48 +00:00
Vladislav Sovrasov
e955d17c51
Add new error messages
2016-11-02 14:53:56 +03:00
Vladislav Sovrasov
38483fe7a7
Skip UTF-8 BOM in FileStorage
2016-11-02 14:51:07 +03:00
Tomoaki Teshima
cba22349b7
add universal hardware support check function
...
* use hasSIMD128 rather than calling checkHardwareSupport
* add SIMD check in spartialgradient.cpp
* add SIMD check in stereosgbm.cpp
* add SIMD check in canny.cpp
2016-10-29 13:24:31 +09:00
fegorsch
ddf0554fff
Use colon in "%YAML:1.0" directive when persisting to YAML-files
...
In YAML 1.0 the colon is mandatory. See http://yaml.org/spec/1.0/#id2558635 .
This also allows prior releases to read YAML files created with the current version.
2016-10-25 18:18:35 +02:00
Tetragramm
6f7bf653f7
Add 90 degree rotation methods. This provides a quick simple way of doing 90 degree rotations.
...
Also fix warnings that show up on other compilers in test builds.
2016-10-22 12:48:52 -05:00
Vincent Rabaud
a89aa8c90a
Fix imageSize overflow in IplImage
2016-10-20 11:23:45 +02:00
Alexander Alekhin
fe8501c931
ocl: fix SVM code
2016-10-10 20:52:30 +03:00
Vadim Pisarevsky
fad0598634
normHamming done with AVX 2 intrinsics (ver 2) ( #7405 )
...
* normHamming done with AVX 2 intrinsics.
* fixed compiling AVX2-optimized norm hamming with MSVC
2016-10-05 21:23:32 +04:00
Vadim Pisarevsky
83f2eb79f1
make sure that the empty mat is copied to UMat properly - i.e. UMat becomes empty. Before the patch such copy operation crashed
2016-10-05 14:07:50 +03:00
Alexander Alekhin
411e551335
ocl: autogenerated code
2016-10-02 02:43:02 +03:00
Alexander Alekhin
a0add98b30
ocl: eliminate build warning
...
Templates are replaced by macro
2016-10-02 02:43:02 +03:00
Alexander Alekhin
a9ab629f60
build: fix aarch64 build with aarch64-linux-gnu-g++-4.8
2016-09-29 17:26:19 +03:00
Tomoaki Teshima
c7cb116dc0
check FP16 build condition correctly
...
* use __GNUC_MINOR__ in correct place to check the version of GCC
* check processor support of FP16 at run time
* check compiler support of FP16 and pass correct compiler option
* rely on ENABLE_AVX on gcc since AVX is generated when mf16c is passed
* guard correctly using ifdef in case of various configuration
* use v_float16x4 correctly by including the right header file
2016-09-23 11:04:22 +09:00
Pavel Vlasov
0a9470bf09
Compatibility fix for Visual Studio 2012;
2016-09-16 10:03:42 +03:00
Alexander Alekhin
10d18a315c
instrumentation: minor refactoring, wrap OpenCL
...
- calculate ticksTotal instead of ticksMean
- local / global width is based on ticksTotal value
- added instrumentation for OpenCL program compilation
- added instrumentation for OpenCL kernel execution
2016-09-08 16:13:44 +03:00
Alexander Alekhin
f6208e3dea
Merge pull request #7247 from valeriyvan:matrix
2016-09-08 09:49:02 +00:00
Alexander Alekhin
125e2dcbba
Merge pull request #7156 from sovrasov:hal_qr_decomp
2016-09-08 09:47:52 +00:00
Alexander Alekhin
c8d77fd961
Merge pull request #7233 from tomoaki0705:featureUniversalIntrinsicFp16
2016-09-07 14:42:46 +00:00
Valeriy Van
c562f0ad38
Minor fix in MatAllocator::download
...
Minor fix in MatAllocator::upload
Minor fix in MatAllocator::copy
Minor fix in setSize function
Minor fix in Mat::Mat
Minor fix in cvMatNDToMat function
Minor fix in _InputArray::getMatVector
Minor fix in _InputArray::getUMatVector
Minor fix in cv::hconcat
Minor fix in cv::vconcat
Minor fix in cv::setIdentity
Minor fix in cv::trace
Minor fix in transposeI_ template function
Minor fix in reduceC_ template function
Minor fix in sort_ template function
Minor fix in sortIdx_ template function
Minor fix in cvRange function
Minor fix in MatConstIterator::seek
Minor fix in SparseMat::create
Minor fix in SparseMat::copyTo
Minor fix in SparseMat::convertTo
Minor fix in SparseMat::convertTo
Minor fix in SparseMat::ptr
Minor fix in SparseMat::resizeHashTab
Fixes indentation
2016-09-07 17:25:36 +03:00
Vladislav Sovrasov
dfe4519c07
Add QR decomposition to HAL
2016-09-05 18:20:04 +03:00
Sergey Alexandrov
765dd30981
Add support for GpuMat in InputArray::isContinuous()
2016-09-05 09:27:35 +02:00
Tomoaki Teshima
903789f7af
use universal intrinsic for FP16
...
* use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
* define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
* brush up implementation on old compiler (guard correctly)
* add test for v_load_f16 and round trip conversion of v_float16x4
* fix conversion error
2016-09-05 08:13:52 +09:00
Alexander Alekhin
4e8f8922e2
Merge pull request #7170 from alalek:fix_build
2016-08-29 10:15:58 +00:00
Alexander Alekhin
5383a625a7
fix Win build, change default WINNT macro to WinXP
...
Returning to Win2k value is possible via this CMake option:
cmake -DOPENCV_EXTRA_FLAGS="/D_WIN32_WINNT=0x0500" ...
2016-08-25 16:21:52 +03:00
Alexander Alekhin
da5ead2c23
Merge pull request #7166 from tomoaki0705:brushUpFp16
2016-08-25 11:49:23 +00:00
Tomoaki Teshima
c5d7791b67
brush up fp16 implementation
...
* DRY
* switch to Cv32suf and remove fp32Int32
* add Cv16suf
2016-08-25 05:31:25 +09:00
Vadim Pisarevsky
b03e3abd62
Merge pull request #7088 from wiryls:FileStorageJSON
2016-08-24 16:49:06 +00:00
MYLS
47f2e618ee
Add two tests.
...
- one test for type_id;
- another for comments in JSON;
2016-08-24 17:01:57 +08:00
Alexander Alekhin
b3b434e221
Merge pull request #7141 from lupustr3:pvlasov/instrumentation_extension
2016-08-23 16:48:25 +00:00
mpkh
01d2e7af71
fix building on FreeBSD
...
resolve undeclared identifier close error
2016-08-22 17:17:06 +04:00
Pavel Vlasov
30a6cee2fe
Instrumentation for OpenCV API regions and IPP functions;
2016-08-19 18:10:03 +03:00
Pavel Vlasov
680ca88ce0
Outdated ICV restrictions were removed;
2016-08-19 15:08:39 +03:00
Alexander Alekhin
93e5947f55
Merge pull request #7101 from lupustr3:pvlasov/instrumentation
2016-08-18 14:56:41 +00:00
Alexander Alekhin
5d7ee48031
Merge pull request #7085 from sovrasov:hal_doc_fix
2016-08-17 15:15:08 +00:00
Pavel Vlasov
a462d76a95
Implementations instrumentation framework for OpenCV performance system;
2016-08-17 16:26:03 +03:00
Maksim Shabunin
cd943cc60d
Merge pull request #7106 from lupustr3:pvlasov/ipp_hal_fix
2016-08-15 10:08:44 +00:00
Maksim Shabunin
031076ab93
Merge pull request #7077 from LaurentBerger:I7063
2016-08-15 09:08:44 +00:00
Pavel Vlasov
3860b8db02
IPP was enabled in mathfuncs_core;
...
Exp and Log IPP implementations are changed to hal interface;
2016-08-12 18:16:04 +03:00
Pavel Vlasov
8bcec7d77f
IPP enabling fix and typo fix;
2016-08-12 10:11:47 +03:00
Vladislav Sovrasov
13d2154f9a
Fix HAL documentation
2016-08-11 14:41:49 +03:00
LaurentBerger
b75bac7975
Solve Issue 7063
...
consequences of changes
accuracy test
Solve issue 7063
2016-08-11 10:56:50 +02:00
MYLS
d99832163a
solve warnings and do some cleaning.
2016-08-11 03:31:35 +08:00
MYLS
8596e82d98
Add JSON support.
...
a JSON emitter, a parser, tests and some basic doc.
2016-08-11 00:53:15 +08:00
Vadim Pisarevsky
96d48331da
Merge pull request #7076 from sovrasov:mkl_lapack
2016-08-10 10:25:17 +00:00
Vadim Pisarevsky
5eee757658
Merge pull request #6949 from wiryls:FileStorageBase64DocsTests
2016-08-10 09:26:13 +00:00
Vladislav Sovrasov
a113d9bcc1
Add MKL support
2016-08-10 11:20:23 +03:00
Tomoaki Teshima
3debc78a5f
fix build error on JetsonTK1
...
* avoid using vld1_f16 and vst1_f16 on gcc 4 series (Ubuntu 14.04)
* guard correctly with #if
* use static inline
2016-08-09 17:12:22 +09:00
Tomoaki Teshima
2db2d137ce
follow the naming rule as other API
...
* stop using 'CUDA' prefix
* add explanation
2016-08-05 22:14:55 +09:00
MYLS
1da8a19af3
fixed two bugs that I caused.
2016-08-05 16:15:03 +08:00
MYLS
86238d2748
make some changes according to the code review.
...
- use `std::string` instead of `new char`;
- reserve 1 MB for buffer;
2016-08-05 15:17:21 +08:00
Alexander Alekhin
b2698f24b0
Merge pull request #7033 from tomoaki0705:brushUpFp16
2016-08-04 09:31:52 +00:00
Vadim Pisarevsky
0a7e53da0a
Merge pull request #7010 from K-Shinotsuka:issue10
2016-08-03 18:44:02 +00:00
Vadim Pisarevsky
03d6c5e473
Merge pull request #7011 from K-Shinotsuka:issue11
2016-08-03 18:34:08 +00:00
k-shinotsuka
1a18a33688
improve SumSqr_SIMD<schar, int, int>()
2016-08-03 19:17:51 +09:00
Tomoaki Teshima
87ca607fd4
brush up convertFp16
...
* raise an error when wrong bit depth passed
* raise an build error when wrong depth is specified for cvtScaleHalf_
* remove unnecessary safe check in cvtScaleHalf_
* use intrinsic instead of direct pointer access
* update the explanation
2016-08-03 17:27:45 +09:00
k-shinotsuka
0b4e06ad3c
improve SumSqr_SIMD<uchar, int, int>()
2016-08-02 23:46:03 +09:00
Tomoaki Teshima
2974b049e7
cudev: add feature to convert FP32(float) from/to FP16(half) on GPU
...
* add feature of Fp16 on GPU (cudev)
* add test
* leave template function as unimplemented to raise error
2016-08-01 00:55:16 +09:00
MYLS
3935855365
Merge remote-tracking branch 'refs/remotes/opencv/master' into FileStorageBase64DocsTests
...
# Conflicts:
# modules/core/test/test_io.cpp
2016-07-30 01:08:27 +08:00
MYLS
08911cbfae
change code
to snippet
and do some cleaning.
...
- use `@snippet` instead of `@code` in docs.
- remove some functions that were not used.
2016-07-30 00:35:41 +08:00
Tomoaki Teshima
c57f8780e9
show CPU feature correctly when FP16 is available
...
* make sure that CV_FP16 has the correct meaning
* check FP16 feature correctly
2016-07-29 14:10:33 +09:00
Vadim Pisarevsky
167a4e6592
Merge pull request #6942 from alalek:fix_android_pack
2016-07-21 07:30:15 +00:00
Alexander Alekhin
2acfb263bb
Merge pull request #6910 from ilya-lavrenov:clone
2016-07-20 15:32:22 +00:00
Alexander Alekhin
2ec63e4dd1
fix android pack build
2016-07-20 16:49:57 +03:00
Vadim Pisarevsky
c4cc54e5da
Merge pull request #6953 from ilya-lavrenov:arithm
2016-07-20 13:42:30 +00:00
Ilya Lavrenov
71cbd6f02e
fixed bug #5775
2016-07-20 11:38:15 +03:00
Rostislav Vasilikhin
96edb270a6
fixed memory corruption when normal dist. params have less channels than target matrix; test added
2016-07-19 17:26:39 +03:00
MYLS
cf2d6f6721
solve errors and warnings
...
Modified performance test and solve a problem caused by an enum type.
2016-07-19 21:18:41 +08:00
MYLS
78ca5ddd45
solve errors and warnings
2016-07-19 19:56:57 +08:00
Ilya Lavrenov
87ae4e0564
fixed clone of empty matrix
2016-07-19 11:31:53 +03:00
MYLS
617df09143
Modify Base64 functions and add test and documentation
...
Major changes:
- modify the Base64 functions to compatible with `cvWriteRawData` and so
on.
- add a Base64 flag for FileStorage and outputs raw data in Base64
automatically.
- complete all testing and documentation.
2016-07-19 15:54:38 +08:00
Vadim Pisarevsky
48b747903b
Merge pull request #6830 from tomoaki0705:featureSupportFp16
2016-07-18 15:56:00 +00:00
Ilya Lavrenov
957bf1452c
added some tests for push_back
2016-07-15 14:48:22 +03:00
Ilya Lavrenov
c4b57d0ded
disable ippiMinMaxIndx_32f_C1R usage since it crashes on Nans
2016-07-14 14:52:58 +03:00
Alexander Alekhin
53f4eaa7a7
Merge pull request #6892 from ioxp:fsSpace
2016-07-13 11:46:50 +00:00
Alexander Alekhin
a6b33c3590
Merge pull request #6846 from mbarnach:6086-cuda-initialization
2016-07-11 14:01:51 +00:00
Alexander Alekhin
7176c1327e
Merge pull request #6850 from alalek:bigdata_test
2016-07-11 14:01:06 +00:00
Alexander Alekhin
9b3d65c5e4
Merge pull request #6867 from alalek:fix_lapack
2016-07-11 14:00:22 +00:00
Alexander Alekhin
5f269d08b4
bigdata: add test, resolve split/merge issue
2016-07-08 18:05:53 +03:00
Philipp Hasper
00112bbe10
persistence: fixing crash with space-only values
2016-07-08 12:41:05 +02:00
Alexander Alekhin
b23527bf02
core: fix arguments types for cblas calls
2016-06-30 20:25:08 +03:00
Vadim Pisarevsky
7fea7e0629
Merge pull request #6697 from wiryls:FileStorageBase64
2016-06-30 17:17:03 +00:00
MYLS
7c92ee2e6e
Split cvWriteRawData_Base64
into three functions
...
The three new functions:
```cpp
void cvStartWriteRawData_Base64(::CvFileStorage * fs, const char* name,
int len, const char* dt);
void cvWriteRawData_Base64(::CvFileStorage *
fs, const void* _data, int len);
void
cvEndWriteRawData_Base64(::CvFileStorage * fs);
```
Test is also updated. (And it's remarkable that there is a bug in
`cvWriteReadData`.)
2016-06-24 22:27:42 +08:00
MYLS
29921d055d
change the parameter to CvMat
and CvMatND
...
```cpp
cvWriteMat_Base64(::cv::FileStorage & fs, ::cv::String const & name,
::cv::Mat const & mat)
```
becomes:
```cpp
CV_EXPORTS void cvWriteMat_Base64(::CvFileStorage* fs, const char* name,
const ::CvMat* mat);
CV_EXPORTS void
cvWriteMatND_Base64(::CvFileStorage* fs, const char* name, const
::CvMatND* mat);
```
2016-06-20 16:59:58 +08:00
MYLS
9faa2a7fd0
solve warning for IOS
...
Two test are still needed:
1. Verify the Base64 data.
2. Read an old YML file for compatibility test.
2016-06-19 02:44:39 +08:00
MYLS
958263d245
Solve warnings, and adjusted the test case.
2016-06-19 02:00:32 +08:00
MYLS
882e4221e7
fix errors from test.
...
Two other test are still needed.
1. Verify the Base64 data.
2. Read an old YML file for compatibility test.
2016-06-19 00:45:51 +08:00
MYLS
d1b097f409
fix most coding style warnings and errors
2016-06-18 23:28:12 +08:00
MYLS
ecd827fc8e
Add Base64 support for FileStorage
...
[GSoC] FileStorage:
Add base64 support for reading and writting XML\YML file.
The two new functions:
```
void cvWriteRawData_Base64(cv::FileStorage & fs, const void* _data, int
len, const char* dt);
void cvWriteMat_Base64(cv::FileStorage & fs, cv::String const & name,
cv::Mat const & mat);
```
2016-06-18 21:40:29 +08:00
Tomoaki Teshima
87d0c91dcf
fix warning of build
2016-06-09 18:24:00 +09:00
Tomoaki Teshima
d0a8390963
fix run time error on Mac
...
* integrate HW version and SW version to same function
2016-06-09 08:41:37 +09:00
Tomoaki Teshima
fd76ed5c0f
fix to support wider compiler
...
* check compiler more strictly
* use gcc version of fp16 conversion if it's possible (gcc 4.7 and later)
* use current SW implementation in other cases
2016-06-07 18:32:47 +09:00
Tomoaki Teshima
6f6eebbcb9
fix warning
2016-06-07 18:31:18 +09:00
Vladislav Sovrasov
055f5c7395
Change names of some internal functions
2016-06-07 10:58:12 +03:00
Tomoaki Teshima
fbfd3158a7
fix corner case when number is small
2016-06-07 08:59:28 +09:00
Tomoaki Teshima
eccf2fa4c3
follow other interface
...
* remove useHW option
* update test
2016-06-06 08:56:37 +09:00
Vladislav Sovrasov
a2d0cc878c
Implement internal HAL for GEMM and matrix decompositions
2016-06-03 10:38:30 +03:00
Alexander Alekhin
d3930cdee1
Merge pull request #6482 from paroj:filestorage_py
2016-05-31 11:53:34 +00:00
Pavel Rojtberg
bf688da50b
FileStorage: add simplified API for bindings
...
at least it is possible to read/ write calibration files. Fixes #4282 .
Also add CPP method for writing comments.
2016-05-25 16:01:55 +02:00
Maksim Shabunin
64a5126ff8
Merge pull request #6018 from sturkmen72:patch-2
2016-05-23 11:57:26 +00:00
Tomoaki Teshima
b2ad7cd9c0
add feature to convert FP32(float) to FP16(half)
...
* check compiler support
* check HW support before executing
* add test doing round trip conversion from / to FP32
* treat array correctly if size is not multiple of 4
* add declaration to prevent warning
* make it possible to enable fp16 on 32bit ARM
* let the conversion possible on non-supported HW, too.
* add test using both HW and SW implementation
2016-05-21 21:31:33 +09:00
Suleyman TURKMEN
c6e6d4c822
Update pca.cpp
2016-05-21 14:23:45 +03:00
Julien Nabet
a29c814bd8
cppcheck: fix some reports
...
All of these: (performance) Prefer prefix ++/-- operators for non-primitive types.
[modules/calib3d/src/fundam.cpp:1049] -> [modules/calib3d/src/fundam.cpp:1049]: (style) Same expression on both sides of '&&'.
2016-04-29 15:41:39 +02:00
Vadim Pisarevsky
904381c602
Merge pull request #6020 from mshabunin:hal_dxt
2016-04-25 11:37:18 +00:00
Maksim Shabunin
5a938309c1
More compilation warnings fixed
2016-04-19 16:08:48 +03:00
Maksim Shabunin
11378fcb17
Fixed compiation problems
2016-04-19 14:50:07 +03:00
Maksim Shabunin
5e9192287d
Added HAL cvtColor interface
2016-04-18 16:34:25 +03:00
Alexander Alekhin
96937bac74
Merge pull request #6581 from mshabunin:hal_mag
2016-06-21 13:16:17 +00:00
Maksim Shabunin
1e667de1f3
HAL math interfaces: fastAtan2, magnitude, sqrt, invSqrt, log, exp
2016-05-31 11:54:52 +03:00
Maksim Shabunin
1307bb1d03
Merge pull request #6314 from mvukad:bugfix_dtreeswrite
2016-04-12 13:26:25 +00:00
Philipp Hasper
47c4ee974a
operator<< handles keys starting with underscore
2016-04-11 09:55:33 +02:00
Maksim Shabunin
233612efd7
Reworked HAL dft/dct interface, added replacement documentation
2016-04-08 16:03:51 +03:00
Maksim Shabunin
f40d701427
DFT: renamed HAL functions
2016-04-08 11:19:28 +03:00
Maksim Shabunin
008abd28fd
Extracted HAL interfaces for DFT/DCT, added new test
2016-04-08 11:19:28 +03:00
mvukad
695e33b25b
Fix missing format when writing Algorithm-based objects
...
Added a writeFormat() method to Algorithm which must be called by the
write() method of derived classes.
2016-04-07 13:49:42 -07:00
Maksim Shabunin
5877debb6f
HAL resize, warpAffine, warpPerspective interface
...
- added HAL documentation support
- added documentation to HAL replacement interface
- updated several HAL functions in imgproc module
2016-04-07 18:47:24 +03:00
Vladislav Sovrasov
72ed4173df
Change setRNGSeed implementaion
2016-03-29 10:54:42 +03:00
Vladislav Sovrasov
a490b64e9b
Add function setRNGSeed and seed setup in python tests
2016-03-25 16:17:58 +03:00
Dan
42c36c0c80
Linux bug fix for ThrustAllocator.
2016-03-14 11:09:52 -04:00
Suleyman TURKMEN
826b5a2ef1
Update lda.cpp
...
typo correction
2016-03-13 21:11:34 +02:00
Dan
95608b1b84
free -> deallocate
2016-02-18 15:05:13 -05:00
Dan
c0e34934c8
Merge branch 'thrust_allocator' of https://github.com/dtmoodie/opencv
2016-02-18 00:43:18 -05:00
Dan
da93a5527d
Thrust allocator for temporary storage used with thrust algorithms.
2016-02-18 00:23:20 -05:00
Alexander Alekhin
c7bdbef504
ocl: fix OpenGL sharing detection (6052)
...
Apple OpenCL framework hasn't OpenGL sharing extension
2016-02-11 12:46:22 +03:00
Alexander Alekhin
87bbaa2c27
ocl: OpenCL headers are located in "CL" subfolder (3rdparty/include)
2016-02-11 12:44:45 +03:00
Mathieu Barnachon
6a0d3b3e42
Called after setDevice.
...
Wrap in a cudaSafeCall.
2016-02-09 14:40:09 +01:00
Mathieu Barnachon
b030ac0433
Ensure the Cuda context is initialized correctly as long as the setDevice is not called in a multi-thread environment.
2016-02-08 13:17:08 +01:00
Vadim Pisarevsky
56c89e0a98
Merge pull request #6014 from alalek:fix_5991
2016-02-04 09:54:27 +00:00
Alexander Alekhin
7e472fbf68
ocl: thread-safe OpenCL loading (6056)
2016-02-03 18:30:40 +03:00
Alexander Alekhin
2978a16c85
core: preserve sizes values ( fixes #5991 )
...
_sizes can point to internal structure which is destroyed
by release() call
2016-01-26 14:11:53 +03:00
Alexander Alekhin
c7ccde69a8
fix TLS implementation ( closes #5967 )
2016-01-14 14:38:37 +03:00
Alexander Alekhin
6997d423c8
fix normalize in case of inplace operations
...
fixes #5876
2015-12-25 15:33:06 +03:00
jiakai
22599a2812
define g_haveOpenCL only when HAVE_OPENCL is set
2015-12-21 16:42:38 +08:00
jiakai
387214a3c5
do not open libopencl by dlopen if HAVE_OPENCL is not set
2015-12-21 12:04:00 +08:00
Vadim Pisarevsky
d02627ea89
Merge pull request #5825 from GregoryMorse:master
2015-12-18 07:17:29 +00:00
Gregory Morse
af5bec70cd
merge Win10/VS2015 changes
2015-12-17 21:15:59 +01:00
Vadim Pisarevsky
9aeb8c8d5a
Merge pull request #5810 from mshabunin:hal_interface
2015-12-17 16:48:02 +00:00
Maksim Shabunin
84f37d352f
HAL moved back to core
2015-12-17 12:33:23 +03:00
Alexander Alekhin
6f2632ca2e
ocl: fix buffer pool small allocations issue ( fixes #5815 )
2015-12-16 14:31:26 +03:00
Vadim Pisarevsky
b2bb7d075a
Merge pull request #5781 from jet47:fix-cuda-createContinuous
2015-12-16 10:37:56 +00:00
Alexander Alekhin
1836d41b5c
warning fix (MSVS2015)
2015-12-12 15:23:31 +03:00
Vadim Pisarevsky
d2e169929c
Merge pull request #5767 from dtmoodie:cpu_mat_memory_allocator
2015-12-10 14:32:04 +00:00
Vladislav Vinogradov
5576c0866a
fix #5780 : cv::cuda::createContinuous
effectively ignores its cols
argument
2015-12-10 14:20:28 +03:00
Alexander Alekhin
b26580cc7b
checkRange fixes
...
1) fix multichannel support
2) remove useless bad_value, read value from original Mat directly
3) add more tests
4) fix docs for cvCeil and checkRange
2015-12-09 18:31:27 +03:00
Vadim Pisarevsky
d6f8a75e91
Merge pull request #5230 from PhilLab:patch-4
2015-12-08 18:46:34 +00:00
Dan
237f33d4f0
nullptr -> NULL
2015-12-08 12:19:15 -05:00
Vadim Pisarevsky
a15707bc57
Merge pull request #5765 from alalek:pthread_mutex_init
2015-12-08 16:49:41 +00:00
Dan Moodie
4f373a4284
Clarified default allocator interface.
...
Conflicts:
modules/core/src/matrix.cpp
2015-12-08 09:45:13 -05:00
Vadim Pisarevsky
0de6165863
Merge pull request #5660 from jet47:cuda-wrap-stream-3.0
2015-12-08 12:28:57 +00:00
Alexander Alekhin
78376c0f63
pthreads: remove usage of non-POSIX PTHREAD_RECURSIVE_MUTEX_INITIALIZER* defines
...
related issues: #5423 #5476
2015-12-08 13:55:02 +03:00
Vadim Pisarevsky
d19897b734
Merge pull request #5651 from hoangviet1985:fix_solvePoly_3.0.0
2015-12-07 10:12:54 +00:00
Maksim Shabunin
b4bcdd10a1
HAL: improvements
...
- added new functions from core module: split, merge, add, sub, mul, div, ...
- added function replacement mechanism
- added example of HAL replacement library
2015-12-03 14:43:37 +03:00
Vadim Pisarevsky
313dc32606
Merge pull request #5644 from hoangviet1985:splitError
2015-11-23 13:43:38 +00:00
hoangviet1985
e679d97100
remove redundant code
2015-11-22 14:32:18 -05:00
Maksim Shabunin
ddf293a081
Merge pull request #5649 from hoangviet1985:solve_pow(x,3)=0_opencv300
2015-11-22 18:02:40 +00:00
hoangviet1985
3e96b724c2
squash
2015-11-20 15:03:32 -05:00
hoangviet1985
b96def885f
squash
2015-11-20 14:48:29 -05:00
Bert
d3b0bda011
Update system.cpp
...
FreeBSD compilation fix
2015-11-13 11:03:34 +03:00
Vladislav Vinogradov
39854ceda4
cuda::StreamAccessor::wrapStream and cuda::EventAccessor::wrapEvent
...
to import existed CUDA stream or CUDA event to OpenCV
2015-11-12 13:38:29 +03:00
Vadim Pisarevsky
b4112a5878
Merge pull request #5480 from dtmoodie:vecgpumat
2015-11-10 17:02:21 +00:00
Vadim Pisarevsky
3942b1f362
Merge pull request #5340 from alalek:ocl_off
2015-11-10 16:53:36 +00:00
Pavel Vlasov
4d7f9e06e8
Fix for copy with mask 32s and 8u;
2015-11-02 18:00:22 +03:00
Alexander Alekhin
debe99f1a4
Merge pull request #5531 from mshabunin:fix-vs2015
2015-10-22 16:28:42 +00:00
Maksim Shabunin
6e9d0d9a0c
Visual Studio 2015 warning and test fixes
2015-10-20 12:48:37 +03:00
grundman
f260b13a6f
Fix racy modification of ForThread::m_state during invocation of stop()
2015-10-16 11:19:27 -07:00
Dan
1e1f20cfee
Improved consistency.
2015-10-12 09:21:53 -04:00
Pavel Vlasov
89eee6ca99
Fixes for IPP integration:
...
dotProd_16s - disabled for IPP 9.0.0;
filter2D - fixed kernel preparation;
morphology - conditions fix and disabled FilterMin and FilterMax for IPP 9.0.0;
GaussianBlur - disabled for CV_8UC1 due to buffer overflow;
integral - disabled for IPP 9.0.0;
IppAutoBuffer class was added;
2015-10-12 10:51:28 +03:00
Dan Moodie
3d1355efc1
Updated input / output array to support std::vector<cv::cuda::GpuMat>
...
Expanded support for std::vector<cv::cuda::GpuMat>.
Whitespace fix.
2015-10-09 15:12:57 -04:00
Pavel Vlasov
6a0d3b06f2
IPPInitSingelton initialization guards;
2015-10-01 17:48:49 +03:00
Pavel Vlasov
e837d69f8f
IPPInitSingelton was added to contain IPP related global variables;
...
OPENCV_IPP env var now allows to select IPP architecture level for IPP9+;
IPP initialization logic was unified across modules;
2015-10-01 09:58:48 +03:00
Pavel Vlasov
62854dcc0d
Enables support of IPP 9.0.0;
...
HAVE_IPP_ICV_ONLY will be undefined if OpenCV was linked against ICV packet from IPP9 or greater. ICV9+ packets will be aligned with IPP in OpenCV APIs
This will ease code management between IPP and ICV
2015-09-29 17:27:13 +03:00
Pavel Vlasov
14b006e808
IPP_VERSION_X100 was changed to:
...
IPP_VERSION_MAJOR * 100 + IPP_VERSION_MINOR*10 + IPP_VERSION_UPDATE
to manage changes between updates more easily.
IPP_DISABLE_BLOCK was added to ease tracking of disabled IPP functions;
2015-09-25 17:50:15 +03:00
Pavel Vlasov
2177c7c5a8
Some IPP functions were encapsulated;
...
Minor changes to IPP implementations;
2015-09-25 17:30:26 +03:00
Pavel Rojtberg
46ada3886f
fix wrong parsing of values containing '='
...
fixes #5074
2015-09-24 17:27:07 +02:00
Pavel Rojtberg
1dd7f546af
CommandLineParser: add special <none> value to disallow empty strings
...
some mandatory string keys like paths must not be empty. Add the special
default value `<none>` so the CommandLineParser can enforce this and
generate an according error message for us.
2015-09-23 15:43:46 +02:00
Pavel Rojtberg
2ced3ba276
CommandLineParser: throw on programmer error
...
requesting a previously undeclared key is most likely an programming
error. e.g. a typo "--unused vs --unsued".
So throw in those cases.
Add an according failure testcase.
2015-09-22 18:35:49 +02:00
Alexander Alekhin
d430e802f2
cmdparser: allow empty strings
2015-09-22 16:34:37 +03:00
Vadim Pisarevsky
9f697510ca
Merge pull request #5320 from berak:lda_fix
2015-09-22 10:38:46 +00:00
Alexander Alekhin
4bb4c92f68
Merge pull request #5376 from lupustr3:pvlasov/compare_16s_fix
2015-09-22 10:19:52 +00:00
Pavel Vlasov
33e659786f
Fix for cmp16s condition during IPP compare type convert;
2015-09-21 18:11:20 +03:00
Gleb Gladilov
344d9fd83f
Fixed minMaxLoc and test functions
2015-09-21 13:44:29 +02:00
Elena Shipunova
5de01fde53
do not proceed with removing zero-length slice
2015-09-21 13:44:26 +02:00
Alexander Alekhin
05b1636780
Merge pull request #5329 from paroj:cliparser
2015-09-21 09:46:54 +00:00
berak
2f7c926670
remove usage of obsolete _dataAsRows flag
2015-09-21 07:59:09 +02:00
Pavel Rojtberg
96cc618410
has() should only test for the presence of the argument
...
and not consider its value. treat arguments with a set default value as
present.
2015-09-18 18:42:14 +02:00
Pavel Rojtberg
08dd126f08
do not parse empty default values and improve error messages
2015-09-18 18:42:14 +02:00
Alexander Alekhin
0051208684
Merge pull request #5349 from lupustr3:pvlasov/tls_update
2015-09-18 14:39:08 +00:00
Alexander Alekhin
af0942c78f
Merge pull request #5335 from Dikay900:ports_to_master
2015-09-18 11:06:08 +00:00
Pavel Vlasov
aa485ccd75
TLS keys leak fix;
...
Disables TLS copy constructor and operator, as they can lead to errors and reservation of too much keys in TLS storage;
gather method was added to TLS to gather data from all threads;
2015-09-17 15:49:20 +03:00
Vadim Pisarevsky
9533982729
Merge pull request #5272 from avershov:opencl-vaapi-fallback
2015-09-14 11:54:20 +00:00
Alexander Alekhin
1a8d7f9587
ocl: temporary restore OpenCL tls data (ocl.cpp without fix)
2015-09-13 20:28:24 +03:00
Alexander Alekhin
7213e5f68a
ocl: correct disabling of OpenCL code
2015-09-13 20:28:23 +03:00
Ilya Lavrenov
df21a24f81
initialize padding of CvString with zeros
2015-09-13 12:38:10 +02:00
Alexey Ershov
933ad0e710
changed call to getDefault: false to true in fallback initialization
2015-09-10 14:36:40 +03:00
Alexander Alekhin
ad70ab404c
ocl: workaround for getUMat()
2015-09-09 18:56:14 +03:00
Andrey Pavlenko
cea2dafa0f
man/unmap, preventing getMat/getUMat from temp object, fix thread-unsafe code in UMat::getMat()
2015-09-09 18:05:29 +03:00
Alexey Ershov
5937e58a11
fixed argument in call to {Input|Output}Array.getMat()
2015-09-08 19:29:19 +03:00
Alexey Ershov
949410b347
separated UMat & Mat usage in interop/copy paths
2015-09-07 21:11:05 +03:00
Maksim Shabunin
5618f4a0ac
Fixed: Concurrency static scheduler release failed
2015-09-07 18:59:07 +03:00
Alexey Ershov
68a0347ff9
fallback path & sample timing implemented
...
fixed trailing whitespaces
renamed files *vaapi* to *va_intel*
changed names vaapi* to va_intel*
2015-09-04 02:42:49 +03:00
Maksim Shabunin
771af4f32d
Some changes to support mingw-w64
...
- IPP is disabled by default when compiler is mingw (couldn't make it
work)
- fixed some warnings
- fixed some `__GNUC__` version checks (for correctness and convenience)
- removed UTF-8 BOM from hough.cpp (fixes #5253 )
2015-09-01 00:59:08 +03:00
Vadim Pisarevsky
a78da45fec
Merge pull request #5242 from apavlenko:fix_releasing_temp_umat
2015-08-27 14:29:52 +00:00
Andrey Pavlenko
b78eb49178
removing a garbage comment
2015-08-26 14:15:18 +03:00
Andrey Pavlenko
3656a1aa66
fixing 'map_unmap_counting'
2015-08-25 17:25:03 +03:00
Vadim Pisarevsky
edcd2c0408
Merge pull request #4176 from alalek:issue_4461
2015-08-25 07:46:28 +00:00
Andrey Pavlenko
f6b758699b
Revert merging of pull request #5176 from vladimir-dudnik:master (reverted from commit 5ee00ec5c6
)
2015-08-24 19:22:22 +03:00
Vadim Pisarevsky
bd8bac27d5
Merge pull request #4212 from ludv1x:master
2015-08-24 14:20:54 +00:00
Philippe FOUBERT
0629add3f7
Fix issue #5234 (UMat::convertTo when noScale)
2015-08-22 13:16:59 +02:00
Vitaliy Lyudvichenko
85cc11e3b2
Changed behaviour of Mat/UMat::reshape() to accept n-dim shapes
2015-08-21 19:26:29 +03:00
Alexander Alekhin
b8956d0111
tls: fix access to array (out of bounds)
2015-08-21 16:43:15 +03:00
Alexander Alekhin
44f8670719
Merge pull request #5171 from avershov:opencl-vaapi-interop
2015-08-21 12:47:14 +00:00
Philipp Hasper
14c0bfce2f
kmeans and one-dimensional vectors
...
Clustering one-dimensional data was only possible when storing it column-based. But a std::vector is interpreted as matrix with height=1
Additionally, made conditionals more readable
2015-08-21 14:20:00 +02:00
Alexey Ershov
a8656ea20f
implemented core support & sample
...
fixed whitespaces
fixed issues
fixed issue
fixed module statement issues
fixed access mode
added initialization check
fixed warning
2015-08-21 02:44:03 +03:00
Alexander Alekhin
887d8d091b
Merge pull request #5177 from lupustr3:pvlasov/tls_fixes
2015-08-19 14:18:19 +00:00
Pavel Vlasov
a33d98c13a
TLS memory leaks were fixed;
...
TLS was redesigned in more straightforward way;
OPENCV_ABI_COMPATIBILITY define was added;
2015-08-19 13:31:44 +03:00
Vladimir Dudnik
58934f1e16
add condition to skip unmapping for internally allocated host memory. this should fix map_unmap_counting UMat test.
2015-08-17 14:36:34 +03:00
Vladimir Dudnik
6bb8afde96
fixed typo in OpenCLAllocator.upload method (wrong step used)
2015-08-11 16:01:05 +03:00
Vladimir Dudnik
ef0d1c1c3f
fixed download/upload for non contiguous umat
2015-08-11 01:33:46 +03:00
Vadim Pisarevsky
3f65a86977
Merge pull request #4233 from alalek:fix_4006
2015-08-03 05:46:29 +00:00
Alexander Alekhin
fe0a17e78f
fix build with Win7 SDK
2015-07-30 23:23:01 +03:00
Alexander Alekhin
2b2bc83b61
Merge pull request #4238 from vladimir-dudnik:d3d11-nv12-interop
2015-07-30 10:36:25 +00:00
Vladimir Dudnik
6bd01a96d9
finished with NV12 support for D3D11-interop. Now, if texture is in NV12 format then it will be converted to/from BGR UMat.
2015-07-29 19:52:05 +03:00
Vladimir Dudnik
d4774ead43
d3d11-nv12 interop
...
fixed issues with ocl nv12 cvt kernel
finisged ocl nv12-to-rgba kernel, update dx-interop samples. (ocl rgba-to-nv12 kernel will be added later)
an attempt to fix build issue
fix for non opencl build issue
fix typo
fix compilation warnings
fix compile issue for Mac (OpenCL)
add convertion from rgba to nv12 (still need to debug kernel)
remove empty line at the EOF
fixed compilation warning
2015-07-29 19:52:03 +03:00
Alexander Alekhin
9bcccb028b
fixes
2015-07-28 11:49:46 +03:00
Alexander Alekhin
cd5c70691a
ocl: add map tests
2015-07-28 11:49:46 +03:00
Alexander Alekhin
b36f565d13
fix OpenCV code (bug 4006: #4862 )
2015-07-28 11:49:41 +03:00
Alexander Alekhin
c94945f892
Merge pull request #4210 from wangyan42164:ocl_umat_convert
2015-07-27 15:05:57 +00:00
Yan Wang
132416ebe9
It is unnecessary to use fma() if no scaling.
...
Signed-off-by: Yan Wang <yan.wang@linux.intel.com>
2015-07-23 10:18:11 +08:00
Alexander Alekhin
c0ee03fab2
Merge pull request #4192 from avershov:opencl-opengl-buffer
2015-07-21 14:10:00 +00:00
Alexey Ershov
89392b2a6d
added convertToGLBuffer() & convertFromGLBuffer() functions; added OpenGL interop sample comment
...
rewrite & change convertFromGLBuffer() & convertToGLBuffer() into acquireGLBuffer() & releaseGLBuffer(), respectively
opengl sample: added buffer support
tested and fixed buffer support on Windows
change glFlush() call to glFinish()
added UMat::release() call; fixed functions' names
adopted & implemented API suggestion(s) from Alexander
fixed unreachable code warning
added more info to the mapGLBuffer/unmapGLBuffer description
2015-07-21 13:42:17 +03:00
Alexander Alekhin
88e66697a7
t-api: try to fix allocator fallback code paths
...
issue: http://code.opencv.org/issues/4461
2015-07-09 13:26:59 +03:00
Alexander Alekhin
06617392e0
workaround issue with MatExpr
2015-07-08 16:22:59 +03:00
Vadim Pisarevsky
743160b6ac
Merge pull request #4158 from alalek:fix-bigdata-copyto
2015-07-08 10:00:46 +00:00
Alexander Alekhin
40fcc9d149
Merge pull request #4159 from alalek:vector_processing
2015-07-06 13:22:57 +00:00
Alexander Alekhin
c0d61964d6
ocl: fix unaligned memory access
...
http://code.opencv.org/issues/4462
2015-07-06 13:58:17 +03:00
Alexander Alekhin
f4cb8c0dfb
Merge pull request #4127 from avershov:opengl-opencl-sample
2015-07-02 11:24:10 +00:00
Alexander Alekhin
885114ed2c
fix copyTo memory corruption, but the main problem is still here
2015-07-02 13:47:59 +03:00
Vadim Pisarevsky
c81b064727
Merge pull request #4164 from kalistratovag:parallel_mjpeg
2015-07-01 14:15:52 +00:00
Alexey Ershov
9dfa9a81e9
eliminated unused function __OpenCLinitializeGL(), simplified query device logic
2015-06-30 21:27:15 +03:00
kalistratovag
65e0387aa5
Implementing parallel mjpeg encoder.
...
Fixed errors in parallel_for based on pthreads
Fixing compiler errore & removing whitespaces
Fixing prallel_for_pthreads error and warnings on win
2015-06-30 13:57:37 +03:00
Alexander Alekhin
50b453e5c2
Merge pull request #4150 from alalek:opencv_org_issue_4428
2015-06-29 17:08:54 +00:00
Alexey Ershov
3f68787051
OpenGL OpenCL sample
...
057cd52 first versions: cv::ogl::convertFromGLTexture2D & cv::ogl::convertToGLTexture2D
5656e94 added autogenerated stuff for cl_gl.h
765f1fd resolved CL functions in opengl.cpp
9f9fee3 implemented function cv::ogl::ocl::initializeContextFromGLTexture2D()
a792adb cv::ogl::ocl::initializeContextFromGLTexture2D() - added linux support (glx.h)
51c2869 added missing error message in function cv::ogl::ocl::initializeContextFromGLTexture2D()
513b887 fixed extension call in function cv::ogl::ocl::initializeContextFromGLTexture2D()
475a3e9 added CL-GL interop Windows sample (gpu/opengl_interop.cpp)
07af28f added building of CL-GL interop sample - Windows only
befe3a2 fixed whitespace errors & doxygen warnings (precommit_docs)
551251a changed function name to cv::ogl::ocl::initializeContextFromGL(), removed unused argument
4d5f009 changed CL_DEVICES_FOR_GL_CONTEXT_KHR to CL_CURRENT_DEVICE_FOR_GL_CONTEXT_KHR
9fc3055 changed CL_DEVICES_FOR_GL_CONTEXT_KHR to CL_CURRENT_DEVICE_FOR_GL_CONTEXT_KH
6d31cee Revert "changed CL_DEVICES_FOR_GL_CONTEXT_KHR to CL_CURRENT_DEVICE_FOR_GL_CONTEXT_KH"
cc6a025 added texture format check in cv::ogl::convertFromGLTexture2D()
063a2c1 CL-GL sample: added Linux implementation (Xlib/GLX)
c392ae9 fixed trailing whitespace
85a80d0 fixed include files
ae23628 excluded samples/opengl from build case 2
9870ea5 added android EGL support
530b64c added doxygen documentation comments to CL-GL interop functions
2015-06-26 14:15:12 +03:00
Alexander Alekhin
04b2edcc8c
ocl: autogenerated files for cl_gl.h
2015-06-26 14:08:27 +03:00
Alexander Alekhin
ee68d26f99
ocl: update generator scripts
2015-06-26 14:08:20 +03:00
Alexander Alekhin
24e2a008c3
copyTo: fix for big data (4Gb+)
2015-06-25 21:08:37 +03:00
Vadim Pisarevsky
56e637d5f4
Merge pull request #4135 from lupustr3:ipp_code_refactoring
2015-06-24 16:18:55 +00:00
Pavel Vlasov
e57609836c
Warning fix;
2015-06-24 14:43:42 +03:00
Pavel Vlasov
101607a7d0
Imgproc_Hist_MinMaxVal.accuracy fix;
...
Some code style corrections;
2015-06-24 13:50:17 +03:00
Alexander Alekhin
53fc5440d7
implement singleton lazy initialization
2015-06-23 14:38:45 +03:00
Vadim Pisarevsky
bd90425f71
Merge pull request #4136 from alalek:fix_matallocator_destruction
2015-06-22 11:49:36 +00:00
Vladimir Dudnik
217dd63e02
OpenCV-OpenCL interop (PR #4072 ):
...
Commits:
added new function, cv::ocl::attachContext(String& platformName, void* platformID, void* context, void* deviceID) which allow to attach externally created OpenCL context to OpenCV.
add definitions of clRetainDevice, clRetainContext funcs
removed definitions for clRetainContext, clRetainDevice
fixed build issue under Linux
fixed uninitialized vars, replace dbgassert in error handling
remove function which is not ready yet
add new function, cv::ocl::convertFromBuffer(int rows, int cols, int type, void* cl_mem_obj, UMat& dst, UMatUsageFlags usageFlags = cv::USAGE_DEFAULT) which attaches user allocated OpenCL clBuffer to UMat
uncommented clGetMemObjectInfo definition (otherwise prevent opencv build)
fixed build issue on linux and android
add step parameter to cv::ocl::convertFromBuffer func
suppress compile-time warning
added sample opencl-opencv interoperability (showcase for cv::ocl::convertFromBuffer func)
CMakeLists.txt modified to not create sample build script if OpenCL SDK not found in system
fixed build issue (apple opencl include dir and spaces in CMake file)
added call to clRetainContext for attachContext func and call to clRetainMemObject for convertFromBuffer func
uncommented clRetainMemObject definition
added comments and cleanup
add local path to cmake modules search dirs (instead of replacing)
remove REQUIRED for find_package call (sample build together with opencv). need to try standalone sample build
opencl-interop sample moved to standalone build
set minimum version requirement for sample's cmake to 3.1
put cmake_minimum_required under condition, so do not check if samples not builded
remove code dups for setSize, updateContinuityFlag, and finalizeHdr
commented out cmake_minimum_required(VERSION 3.1)
add safety check for cmake version
add convertFromImage func and update opencl-interop sample
uncommented clGetImageInfo defs
uncommented clEnqueueCopyImageToBuffer defs
fixed clEnqueueCopyImageToBuffer defs
add doxygen comments
remove doxygen @fn tag
try to restart buildbot
add doxygen comments to directx interop funcs
remove internal header, use fwd declarations in affected compile units instead
2015-06-19 13:33:22 +03:00
Alexander Alekhin
032b6322fc
fix MatAllocator creation/destruction issues
2015-06-18 14:02:01 +03:00
Dmitry Budnikov
a5a21019b2
ipp_countNonZero build fix;
...
Removed IPP port for tiny arithm.cpp functions
Additional warnings fix on various platforms.
Build without OPENCL and GCC warnings fixed
Fixed warnings, trailing spaces and removed unused secure_cpy.
IPP code refactored.
IPP code path implemented as separate static functions to simplify future work with IPP code and make it more readable.
2015-06-18 12:47:07 +03:00
Alexander Alekhin
a482dcce46
fix support for pthreads parallel_for
2015-06-11 16:53:07 +03:00
Maksim Shabunin
1f983ec39c
Fixed compilation of pthread-based parallel_for with gcc 4.4.3
2015-06-09 13:59:48 +03:00
Vadim Pisarevsky
a4073ed676
made improvements in ECC code (comments from https://github.com/Itseez/opencv/pull/3845 ), as well as minor tweak in pthread's based parallel for.
2015-06-02 01:30:00 +03:00
kalistratovag
bf5393ae58
parallel for on pthreads initial commit
...
removing trailing whitespaces
Compilation error on Mac fix & warning on android
Warnings fixed on iOs
2015-06-01 17:48:21 +03:00
Vadim Pisarevsky
dc587992a0
Merge pull request #4053 from vpisarev:calib3d_fixes
2015-05-26 11:23:50 +00:00
Vadim Pisarevsky
85149f8686
hack solvePoly to finds roots of polynoms with zero higher-order coefficients. The roots are populated in this case, which is not valid, strictly speaking, but good enough for function like correctMatches. This solves http://code.opencv.org/issues/4330
2015-05-25 23:43:39 +03:00
Camille
55a5e6be2c
bug fix 3696
2015-05-23 19:09:31 +02:00
Alexander Smorkalov
acbc7362db
Bug #3611 Initializing static cv::Mat with cv::Mat::zeros causes segmentation fault fixed.
...
fix MatOpInitializer
2015-05-23 17:36:35 +02:00
Vadim Pisarevsky
d2aaa70e93
removed HAL calls from public OpenCV headers; put IPP calls back to hall:sqrt() and such (but they are disabled for now)
2015-05-22 16:04:10 +03:00
Vadim Pisarevsky
432cf1152a
added read/write functions for vector<DMatch> + the test ( http://code.opencv.org/issues/4308 )
2015-05-18 18:59:37 +03:00
Vadim Pisarevsky
a85bb091c8
Merge pull request #4018 from vpisarev:imgproc_video_fixes
2015-05-15 16:15:01 +00:00
Vadim Pisarevsky
4a9b244577
hopefully, fixed crash in sparse mat test
2015-05-15 16:15:10 +03:00
Vadim Pisarevsky
66eb270cf7
significantly reduced sparse matrix footprint:
...
http://code.opencv.org/issues/2206 ,
http://code.opencv.org/issues/2924
2015-05-14 23:29:09 +03:00
Vadim Pisarevsky
7dd67f4997
use _mm_sad_epu8 instead of _mm_movemask_epi8 in countNonZero. Therefore, we do not need any popcount table and this solves http://code.opencv.org/issues/4333
2015-05-14 20:33:53 +03:00
Maksim Shabunin
e22c09c601
Documentation for CommandLineParser
2015-05-13 18:46:21 +03:00
Vadim Pisarevsky
d280205245
fixed compile errors on ARM, as well as failures in OCL_Dft* regression tests
2015-05-06 10:00:10 +03:00
Vadim Pisarevsky
0ee8634b2f
fixed random failures in Core_DFT.complex_output2 test (the case of input_mat.cols == 1)
2015-05-05 20:31:30 +03:00
Vadim Pisarevsky
73f760fdf0
some more compile warnings fixed
2015-05-05 18:03:40 +03:00
Vadim Pisarevsky
931a519969
fixed warning in mathfuncs
2015-05-05 17:49:36 +03:00
Vadim Pisarevsky
63a63e3eaa
extended MinProblemSolver::Function to 1) report the space dimensionality, 2) compute gradient if needed
2015-05-05 15:56:06 +03:00
Vadim Pisarevsky
5a31f6b4e1
ok, so probably the failure in downhill simplex has been finally solved
2015-05-03 11:31:49 +03:00
Vadim Pisarevsky
01e351de37
refactored downhill simplex implementation a bit; hopefully, fixed the bug with random failures in the tests
2015-05-03 02:29:15 +03:00
Vadim Pisarevsky
a33d7928a4
Fixed getContinuousSize() to handle huge matrices properly. This should solve http://code.opencv.org/issues/3232
2015-05-02 22:12:12 +03:00
Vadim Pisarevsky
9c0b740b84
improved randShuffle as suggested by John Stark in http://code.opencv.org/issues/3730
2015-05-02 21:31:47 +03:00
Vadim Pisarevsky
96b5ec14d4
fixed bug with simplex initialization in Downhill solver (thanks to Toby Rule for the patch): http://code.opencv.org/issues/3990 .
...
re-enabled Downhill solver test, let's see how it goes
2015-05-02 20:59:57 +03:00
Vadim Pisarevsky
9fbd1d68ad
refactored div & pow funcs; added tests for special cases in pow() function.
...
fixed http://code.opencv.org/issues/3935
possibly fixed http://code.opencv.org/issues/3594
2015-05-01 21:49:11 +03:00
Vadim Pisarevsky
74e2b8cbcb
fixed invalid output of cv::dft when using DFT_ROWS + DFT_COMPLEX_OUTPUT ( http://code.opencv.org/issues/3428 )
2015-04-29 23:08:22 +03:00
Vadim Pisarevsky
7918267d02
fixed U non-orthogonality in SVD ( http://code.opencv.org/issues/3801 )
2015-04-29 16:09:58 +03:00
Vadim Pisarevsky
135fa85e8f
added vector<bool> support ( http://code.opencv.org/issues/3826 )
2015-04-29 15:36:45 +03:00
Vadim Pisarevsky
d8f360316d
fixed http://code.opencv.org/issues/3828 , added test. thanks to berak for the patch
2015-04-29 13:12:53 +03:00
Vadim Pisarevsky
0746ec7035
fixed http://code.opencv.org/issues/4278
2015-04-29 11:58:49 +03:00
Vadim Pisarevsky
a8226814c5
fixed http://code.opencv.org/issues/4189
2015-04-29 11:47:12 +03:00
Boris Daskalov
3b5530fb67
Fix a bad free space check in icvGrowSeq.
...
A difference of two pointers was casted to unsigned which can lead to
overflow on 64-bit systems.
2015-04-25 18:54:51 +02:00
Vadim Pisarevsky
7dd19f6d97
Merge pull request #3939 from paroj:fmtfix
2015-04-22 12:15:17 +00:00
Vadim Pisarevsky
063e4004ba
Merge pull request #3935 from vpisarev:extending_hal_part1
2015-04-21 14:02:02 +00:00
Pavel Rojtberg
b02fc039cd
fix python and numpy output formatters
2015-04-17 17:42:24 +02:00
Vadim Pisarevsky
ee11a2d266
fully implemented SSE and NEON cases of intrin.hpp; extended the HAL with some basic math functions
2015-04-16 23:00:26 +03:00
Ilya Lavrenov
17bedd72f0
workaround for stange internal compiler error when cross-compiling with 4.6
2015-04-14 16:55:11 +03:00
Vadim Pisarevsky
5b3f89df0a
Merge pull request #3911 from vpisarev:core_tweaks
2015-04-07 19:03:48 +00:00
Vadim Pisarevsky
259e41da73
reverted "typedef _InputArray InputArray", fixed warning from doxygen
2015-04-07 19:34:51 +03:00
Maksim Shabunin
35b2139929
Basic HAL module
2015-04-07 18:16:22 +03:00
Vadim Pisarevsky
052593c760
1. Input/OutputArray optimizations;
...
2. Algorithm::load/save added (moved from StatModel)
3. copyrights updated; added copyright/licensing info for ffmpeg
4. some warnings from Xcode 6.x are fixed
2015-04-07 16:44:26 +03:00
Dmitry-Me
75a65542ba
Reduce variables scope
2015-03-25 17:17:06 +03:00
Vadim Pisarevsky
f7f10a7d2c
Merge pull request #3854 from Dmitry-Me:reduceVariableScope8
2015-03-24 15:13:34 +00:00
Dmitry-Me
ce167e233b
Reduce variable scope
2015-03-20 13:27:08 +03:00
Alexander Alekhin
b5529aef9c
ocl/bufferpool: fix cleanup
2015-03-20 13:21:10 +03:00
Vadim Pisarevsky
9683f93a76
Merge pull request #3840 from MSOpenTech:concurrency-contrib
2015-03-19 15:11:01 +00:00
Vadim Pisarevsky
9997e6d337
Merge pull request #3823 from lupustr3:pvlasov/implementation_detector_update
2015-03-19 12:47:22 +00:00
Maxim Kostin
d1a8171011
Added WinRT concurrency support.
2015-03-18 15:13:08 +03:00
Dmitry-Me
8ed4bae4dd
Reduce variable scope, make formatting consistent with surrounding code
2015-03-14 12:50:42 +03:00
Pavel Vlasov
085409340e
Implementation collector update;
...
Moved out of TLS to properly collect data from different threads;
Concurrent access guards were added to collection functions;
2015-03-12 17:58:03 +03:00
Roman Donchenko
6274e1fc5f
cvOpenFileStorage: reduce the scope of xml_buf and make sure it's freed...
...
... before any exceptions occur.
(cherry picked from commit 08da247a87
)
2015-03-10 12:38:45 +03:00
Roman Donchenko
6b8ebf3e94
Fix memory leaks appearing when cvOpenFileStorage throws
...
(cherry picked from commit 16ce114e0c
)
2015-03-10 12:38:45 +03:00
Vadim Pisarevsky
64bd77cf4d
Merge pull request #3794 from vpisarev:ubuntu_x86_fixes
2015-03-09 21:34:17 +00:00
Vadim Pisarevsky
5d8a1bf813
explained reason why IPP is disabled on 32-bit Linux. disable IPP's minMaxLoc_32f on 32-bit OSes only
2015-03-06 15:02:22 +03:00
Dmitry-Me
2fb4178f7c
Reduce variable scope
2015-03-06 10:17:37 +03:00
Vadim Pisarevsky
7a52c3d9f1
1) make sure Python 2.7 is detected on Ubuntu (including libs) when it's there.
...
2) disable IPP on 32-bit Linux when OpenCV is built as shared libs. Otherwise we get linker errors
3) disable IPP's minMaxIdx 32-bit floating-point flavor in a hope that it fixes some test failures
2015-03-05 18:58:54 +03:00
Vadim Pisarevsky
aa113c5a85
Merge pull request #3787 from Dmitry-Me:fixNullPointerDereference2
2015-03-05 12:19:02 +00:00
Dmitry-Me
d164d51380
Fix potential null pointer dereference
2015-03-05 10:31:51 +03:00
Vadim Pisarevsky
f9b8ce1d25
Merge pull request #3784 from ilya-lavrenov:nvidia_android
2015-03-05 06:43:03 +00:00
Ilya Lavrenov
cf4c79ebdd
disabled some kernels for Android && NVidia
2015-03-04 14:46:49 +03:00
Vadim Pisarevsky
ebac739a35
Merge pull request #3781 from Dmitry-Me:dangerousStringManipulation
2015-03-04 10:45:25 +00:00
Dmitry-Me
f8c454733b
Remove dangerous string manipulation, merge declaration with assignment
2015-03-04 10:35:06 +03:00
Dmitry-Me
c587bae997
Uniform computation of variable size
2015-03-03 16:11:07 +03:00
Vadim Pisarevsky
6820292256
Merge pull request #3700 from MSOpenTech:winrt-cmake-contrib
2015-03-03 12:52:03 +00:00
Vadim Pisarevsky
d3b74cdcb2
Merge pull request #3773 from ilya-lavrenov:warning
2015-03-03 12:33:17 +00:00
Dmitry-Me
1b7066fc95
Reduce variable scope, only compute values that are used later
2015-03-03 11:04:22 +03:00
Maxim Kostin
22a30af353
Fixing buildbot job:
...
- explicitly turning OCL off since WinRT does not support it
- fixing macro definitions in core/ocl.cpp
Signed-off-by: Maxim Kostin <v-maxkos@microsoft.com>
2015-03-02 22:26:45 +03:00
Maxim Kostin
cdd23440c9
Adding support for WinRT(WinPhone 8/8.1 and Win Store) via CMake 3.1
...
- Substituted HAVE_WINRT with WINRT
- Fixed compilation issues in ocl.cpp and parallel.cpp
- Fixed compiler issue for WP8: "C2678: binary '+' : no operator found which takes a left-hand - Fixed gitignore
- Added #ifdef HAVE_OPENCL to remove compiler warnings in ocl.cpp
- Used NO_GETENV similar to '3rdparty\libjpeg\jmemmgr.c;
- Added ole32.lib for core module (for WindowsStore 8.0 builds)
- Made OpenCV_ARCH aware of ARM
Signed-off-by: Maxim Kostin <v-maxkos@microsoft.com>
2015-03-02 22:26:43 +03:00
Ilya Lavrenov
ffc453ebfd
fixed [-Wsign-compare] warning
2015-03-02 16:41:06 +03:00
Vadim Pisarevsky
8e37a693e5
Merge pull request #3758 from ilya-lavrenov:aarch64
2015-03-02 10:50:31 +00:00
Vadim Pisarevsky
845d3edb67
Merge pull request #3757 from ilya-lavrenov:popcnp
2015-03-02 10:50:08 +00:00
Vladislav Vinogradov
4254e35f6a
merge new setUseOptimizations function with existed setUseOptimized
2015-02-27 14:24:51 +03:00
Vladislav Vinogradov
c10f1851cb
rename setOptimizations -> setUseOptimizations
2015-02-27 12:52:28 +03:00
Vladislav Vinogradov
cda6fed41f
move tegra namespace out of cv to prevent conflicts
2015-02-27 12:52:11 +03:00
Ilya Lavrenov
c5a80193a8
aarch64 support
2015-02-26 19:43:32 +03:00
Vladislav Vinogradov
12180bd7b1
add generic method to disable/enable all optimizations (IPP, OpenCL, Tegra)
2015-02-26 19:36:04 +03:00
Vladislav Vinogradov
44e41baffe
use new functions before all tegra:: calls
2015-02-26 19:34:58 +03:00
Vladislav Vinogradov
f8484e3266
add fucntion to control tegra optimizations in run-time
2015-02-26 19:34:20 +03:00
Ilya Lavrenov
b963d84b41
fixed bug #4215
2015-02-25 21:57:17 +03:00
Ilya Lavrenov
5f7a994ad1
fixed internal compiler error
2015-02-20 15:46:22 +03:00
Vadim Pisarevsky
7b270f4c69
Merge pull request #3679 from mshabunin:remove-algorithm-factory
2015-02-19 18:43:58 +00:00
Dmitry-Me
103687871d
Reduce variable scope
2015-02-16 15:39:52 +03:00
Maksim Shabunin
da383e65e2
Remove deprecated methods from cv::Algorithm
2015-02-16 15:28:54 +03:00
Vladislav Vinogradov
a932d8aba1
use full qualified name for all internal namespaces to prevent ambiguous symbol errors
2015-02-16 11:31:21 +03:00
Dmitry-Me
ea48fcfc9f
Fix possible null pointer dereference
2015-02-09 12:38:02 +03:00
Alexander Alekhin
2054123890
coreTlsData -> getCoreTlsData()
2015-02-04 13:03:27 +03:00
Alexander Alekhin
f282fd0ebf
ocl: print missing error message only if OPENCV_OPENCL_RUNTIME is used
2015-01-29 13:16:31 +03:00
Vadim Pisarevsky
97aa8d3341
Merge pull request #3603 from alalek:ocl_svm
2015-01-29 05:35:46 +00:00
Alexander Alekhin
0a07d780e0
ocl: OpenCL SVM support
2015-01-23 20:37:45 +03:00
Ilya Lavrenov
dcb0c68ed3
fix for -m32
2015-01-23 17:26:48 +03:00
Vadim Pisarevsky
f93be25cc8
Merge pull request #3629 from nstiurca:matx-fix
2015-01-22 09:43:33 +00:00
Vadim Pisarevsky
03fc3d1ceb
Merge pull request #3591 from ilya-lavrenov:sse_avx
2015-01-21 10:46:23 +00:00
Nicu Știurcă
7a3ca99894
fix _OutputArray::assign() for _OutputArrays of type MATX when data types differ
2015-01-20 23:49:04 -05:00
Ilya Lavrenov
e9a6c5db21
sqsum
2015-01-20 21:10:06 +03:00
Vadim Pisarevsky
21a9a17478
Merge pull request #3568 from ilya-lavrenov:neon_runtime
2015-01-20 12:47:51 +00:00
Vadim Pisarevsky
e501c61c04
Merge pull request #3611 from jet47:fix-gpumat-setto
2015-01-20 12:37:45 +00:00
Nicu Știurcă
e97d18ba81
implement _OutputArray::assign() for _OutputArrays of type MATX
2015-01-19 16:20:10 -05:00
Vadim Pisarevsky
5e92a777ef
Merge pull request #3569 from ilya-lavrenov:sse_mul
2015-01-19 13:42:36 +00:00
Vladislav Vinogradov
f676bfb3d7
fix GpuMat::setTo method in case if mask is empty:
...
it might be called from _OutputArray::setTo
2015-01-15 19:33:27 +03:00
Vadim Pisarevsky
713aa5c5b9
Merge pull request #3557 from ilya-lavrenov:sse_add_weighted
2015-01-15 13:14:06 +00:00
Vadim Pisarevsky
16b56e71d9
Merge pull request #3561 from jet47:cuda-arithm-refactoring
2015-01-15 11:44:35 +00:00
Adam Borowski
7b0d1c932e
Get rid of sysctl includes on Linux.
...
The 'sysctl' syscall has been strongly deprecated on Linux for ages.
Currently, on old architectures it will spam syslog whenever used, and on
newer ones it's missing from the headers altogether. Opencv has migrated
away on Linux already, but #includes were left lingering. This commit
removes them on non-__APPLE__, unbreaking x32 (and probably others).
2015-01-13 01:52:56 +01:00
Ilya Lavrenov
81f786393a
NEON detection in runtime
2015-01-13 01:35:53 +03:00
Ilya Lavrenov
8c94568cc3
cv::sum
2015-01-12 10:59:31 +03:00
Ilya Lavrenov
6bce6ee34a
checks
2015-01-12 10:59:31 +03:00
Ilya Lavrenov
1d3c860411
SinCos_32f
2015-01-12 10:59:31 +03:00
Ilya Lavrenov
ef29b15c9a
reciprocal
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
44d89638fd
divide
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
f2cd65cf1e
fixes
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
25e99c453f
avx2 in arithm
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
70933ea999
convert from f64
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
612b8ce2cb
fixed compilation issues
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
d87457a025
split/merge
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
fc0869735d
used popcnt
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
bc394e7516
detection of other CPU features
2015-01-12 10:59:30 +03:00
Ilya Lavrenov
63fc6ef316
convertTo from 64f
2015-01-12 10:59:29 +03:00
Ilya Lavrenov
8870ef4159
convertTo from 16u, 16s, 32s, 32f
2015-01-12 10:59:29 +03:00
Ilya Lavrenov
116fb275a8
convertTo from 8s
2015-01-12 10:59:29 +03:00
Ilya Lavrenov
19e77e4787
convertTo from 8u
2015-01-12 10:59:29 +03:00
Ilya Lavrenov
b758dbd384
convertTo AVX2
2015-01-12 10:59:29 +03:00
Ilya Lavrenov
3a78a22733
convertScaleAbs for s8, f64
2015-01-12 10:59:29 +03:00
Ilya Lavrenov
5578088983
countNonZero
2015-01-12 10:59:28 +03:00
Ilya Lavrenov
972ff1d0c4
polarToCart
2015-01-12 10:59:28 +03:00
Ilya Lavrenov
0a5c9cf145
magnitude 64f
2015-01-12 10:59:28 +03:00
Ilya Lavrenov
6ab928fb39
phase 64f
2015-01-12 10:59:28 +03:00
Ilya Lavrenov
a2a8ba17fc
compare
2015-01-12 10:59:28 +03:00
Ilya Lavrenov
8d48632ebe
avx2
2015-01-12 10:59:28 +03:00
Joe Howse
379de5708f
Fix shadowed variable warning
2015-01-05 10:56:46 -04:00
Ilya Lavrenov
68962adc54
SSE mul
2014-12-31 17:58:54 +03:00
Ilya Lavrenov
60f2f7898a
SSE4.1 addWeighted fo 16u
2014-12-31 17:55:30 +03:00
Vadim Pisarevsky
2f6db4dfac
Merge pull request #3547 from ilya-lavrenov:ocl_setto
2014-12-31 09:30:40 +00:00
Vadim Pisarevsky
f792fdc3e0
Merge pull request #3559 from ilya-lavrenov:sse_dot_s8
2014-12-31 08:06:06 +00:00
Vladislav Vinogradov
b4e7ee46c6
fix compilation without CUDA
2014-12-30 11:06:33 +03:00
Vladislav Vinogradov
00e7816c1b
add auxiliary functions to work with Input/Output arrays:
...
they allow to perform asynchronous upload/download into temporary buffer
to get valid GpuMat object
2014-12-30 11:06:32 +03:00
Ilya Lavrenov
f57136fd79
SSE2 cv::Mat::dot
2014-12-30 00:34:09 +03:00
Ilya Lavrenov
f6b3bc01e5
addWeighted
2014-12-29 22:01:53 +03:00
Ilya Lavrenov
1af7d397d4
optimization of UMat::setTo
2014-12-29 13:34:21 +03:00
Vadim Pisarevsky
0ff67253f7
Merge pull request #3531 from jet47:cuda-core-refactoring
2014-12-26 12:12:42 +00:00
Vladislav Vinogradov
f36546dbd2
improve error reporting in _InputArray methods
2014-12-26 12:03:25 +03:00
Vladislav Vinogradov
f054d6316a
add cuda::HostMem::getAllocator method
...
it allows to use cudaHostAlloc methods for cv::Mat objects
2014-12-23 17:42:49 +03:00
Vladislav Vinogradov
53862687d5
rename CudaMem -> HostMem to better reflect its purpose
2014-12-23 17:42:49 +03:00
Vladislav Vinogradov
9210d8e542
move allocMatFromBuf function to farneback.cpp:
...
* it is the only place, where it is used
* no need to make this function public
2014-12-23 17:42:49 +03:00
Vladislav Vinogradov
68e08bbecd
fix null stream initialization for multi-gpu systems
2014-12-23 17:41:24 +03:00
Vladislav Vinogradov
05d40946f3
move StackAllocator to cpp file
...
it is internal class, no need to export it
2014-12-23 17:41:24 +03:00
Vladislav Vinogradov
7ed38b97c3
fix cuda::BufferPool deinitialization
...
The deinitialization of BufferPool internal objects is controled by global
object, but it depends on other global objects, which leads to errors
caused by undefined deinitialization order of global objects.
I merge global objects initialization into single class, which performs
initialization and deinitialization in correct order.
2014-12-23 17:41:24 +03:00
Chuanbo Weng
2d8c89c40b
Remove unnecessary kercn limitation of 4.
...
When accessing global memory by DWORD4, memory bandwidth
can be fully utilized on Intel platform. This patch will
make more image format(e.g. 8UC4) be processed in DWORD4
by work-item. After applying this patch, 3 subcase of
./opencv_perf_core --gtest_filter=OCL_RepeatFixture_Repeat.Repeat/*
can be speedup on HD4000 graphics card with Beignet:
OCL_RepeatFixture_Repeat.Repeat/2, 64% improvement.
OCL_RepeatFixture_Repeat.Repeat/6, 50% improvement.
OCL_RepeatFixture_Repeat.Repeat/8, 56% improvement.
Signed-off-by: Chuanbo Weng <chuanbo.weng@intel.com>
2014-12-04 11:15:13 +08:00
Dmitry-Me
4ff8a3ad92
Fix incorrect size computation
2014-11-26 12:24:53 +03:00
Alexander Alekhin
f50f249f80
Merge pull request #3138 from alalek:icv_update
2014-11-06 15:58:14 +00:00
Alexander Karsakov
462c3c25a9
Removed incorrect using of rootn() and powr() in ocl_pow
2014-11-06 16:23:02 +03:00
Alexander Alekhin
4eb16122c0
ocl: change processing of OpenCL failures
...
disable "unwanted" messages
2014-11-05 19:44:36 +03:00
Alexander Alekhin
1c9f590f0d
IPPICV: disable NormDiff_L1_16s_C1R for IPP/ICV 8.2/8.2.1
2014-11-05 13:26:23 +03:00
Ilya Lavrenov
5ca25ab8f0
cv::pow (integer power)
2014-11-01 13:19:51 +03:00
Ilya Lavrenov
ccdc71286c
cv::polarToCart
2014-11-01 13:19:51 +03:00
Ilya Lavrenov
d5f006eee5
cv::magnitude; cv::corner**
2014-11-01 13:19:51 +03:00
Ilya Lavrenov
fb97273b3c
cv::phase; cv::cartToPolar
2014-11-01 13:19:51 +03:00
Alexander Alekhin
fd59551ff0
Merge pull request #3354 from vbystricky:oclopt_convertScaleAbs
2014-10-29 13:53:56 +00:00
ElenaGvozdeva
d88fdd0378
use LOCAL_SIZE+1
2014-10-28 15:18:31 +03:00
ElenaGvozdeva
65b8a1cb37
Some small fixes
2014-10-27 14:38:22 +03:00
Elena Gvozdeva
c5a2879ce0
use vectors
2014-10-27 14:38:22 +03:00
Elena Gvozdeva
2d89df1804
use local memory
2014-10-27 14:38:21 +03:00
Elena Gvozdeva
d78bc3c321
naive implementation
2014-10-27 14:38:21 +03:00
Alexander Alekhin
dee56598e9
Merge pull request #3369 from vbystricky:fix_scaleAdd
2014-10-27 10:03:29 +00:00
Alexander Alekhin
1f08d8cb6f
Merge pull request #3367 from akarsakov:ocl_image2d
2014-10-24 16:01:21 +00:00
vbystricky
8466911ad0
Move _dst.create() to the begining of scaleAdd function
2014-10-24 18:27:47 +04:00
Alexander Karsakov
237cb93143
Added extra checks to ocl::Image2D
2014-10-24 15:04:42 +03:00
Alexander Alekhin
579a7fff6d
ocl: restore clFinish() in unmap() for AMD devices
...
This reverts commit 7d91b8efcd
.
2014-10-24 14:29:38 +04:00
Alexander Karsakov
3a263c6326
Added tests for Image2D
2014-10-23 14:23:37 +03:00
vbystricky
a8aa6381d9
Optimize OpenCL version of conversScaleAbs function
2014-10-21 19:20:20 +04:00
ElenaGvozdeva
070e5ec042
Changed predictOptimalVectorWidth function, now it is possible to choose vector size.
2014-10-21 13:13:15 +03:00
Vadim Pisarevsky
926b64fff7
Merge pull request #3292 from mshabunin:fix-ios-warnings
2014-10-20 06:41:51 +00:00
Vadim Pisarevsky
d2b9dc5530
quickly corrected the previous refactoring of features2d: moved from set(SOME_PROP, val) to setSomeProp(val)
2014-10-18 20:44:26 +04:00
Maksim Shabunin
ef3d02214b
Fixing iOS clang warnings, part 2
2014-10-17 18:14:54 +04:00
Vadim Pisarevsky
01d3848f17
all the tests now pass except for MSER
2014-10-17 14:56:58 +04:00
Pavel Vlasov
45958eaabc
Implementation detector and selector for IPP and OpenCL;
...
IPP can be switched on and off on runtime;
Optional implementation collector was added (switched off by default in CMake). Gathers data of implementation used in functions and report this info through performance TS;
TS modifications for implementations control;
2014-10-15 14:24:41 +04:00
Vadim Pisarevsky
a798386660
Merge pull request #3326 from ilya-lavrenov:neon_canny
2014-10-11 17:58:24 +00:00
Vadim Pisarevsky
a3916113b9
Merge pull request #3254 from ilya-lavrenov:neon_scale_add
2014-10-10 14:26:14 +00:00
Ilya Lavrenov
5f23d99918
the rest modes of cv::Mat::convertTo
2014-10-10 14:10:50 +00:00
Ilya Lavrenov
4babecf3b0
fixes for cv::addWeighted and cv::Mat::dot
2014-10-09 12:55:52 +00:00
vbystricky
1d280352f4
Add code for print errors of OpenCL kernels runing
2014-10-09 13:59:38 +04:00
Ilya Lavrenov
00f16e9178
neon
2014-10-03 08:43:02 +00:00
Ilya Lavrenov
be3efdf274
cv::sum refactoring
2014-09-30 14:36:21 +00:00
Ilya Lavrenov
a3e56114d1
cv::multiply
2014-09-30 14:20:22 +00:00
Ilya Lavrenov
1c491c42cd
fix for cornerHarris
2014-09-29 14:59:46 +00:00
Ilya Lavrenov
bbc161e1cb
fix for cv::Mat::convertTo with scale
2014-09-28 14:51:30 -07:00
Ilya Lavrenov
f50f0ba63e
cv::norm
2014-09-28 07:28:33 -07:00
Ilya Lavrenov
44ea50f1c4
cv::countNonZero
2014-09-28 07:06:53 -07:00
Ilya Lavrenov
34a571d37f
cv::Mat::dot
2014-09-28 05:00:22 -07:00
Ilya Lavrenov
e46332a183
cv::Mat::convertTo with scale and shift
2014-09-28 03:49:56 -07:00
Ilya Lavrenov
74e60e44ad
cv::compare
2014-09-28 02:41:08 -07:00
Ilya Lavrenov
857a2d5bfd
cv::addWeighted
2014-09-28 01:11:07 -07:00
Maksim Shabunin
047abb0050
Merge pull request #3258 from ilya-lavrenov:neon_convert
2014-09-26 09:27:16 +00:00
Ilya Lavrenov
345b1369be
correct neon rounding
2014-09-25 07:54:52 +00:00
Ilya Lavrenov
5d018c090f
Neon optimization of cv::scaleAdd (CV_32F)
2014-09-23 21:16:29 +04:00
Ilya Lavrenov
4b3f2c1972
Neon optimization of Mat::convertTo
2014-09-23 15:06:17 +00:00
Vadim Pisarevsky
281ce441a8
Merge pull request #3250 from ilya-lavrenov:neon_convert_scale_abs
2014-09-23 07:15:24 +00:00
Ilya Lavrenov
515be70867
Neon optimization of cv::convertScaleAbs
2014-09-22 15:47:46 +00:00
Ilya Lavrenov
27b933ba5a
Neon optimization of cv::sum
2014-09-22 09:22:03 +00:00
Vadim Pisarevsky
06e55ddf38
Merge pull request #2893 from ilya-lavrenov:tapi_vector_width_intel
2014-09-18 12:05:24 +00:00
Vadim Pisarevsky
4057e27539
Merge pull request #3126 from avdmitry:move_KDTree_to_ml
2014-09-14 18:57:23 +00:00
Alexander Karsakov
c942c6539a
Remove mul24 since id can be larger 2^23
2014-09-08 13:11:58 +04:00
Vadim Pisarevsky
26c284b225
Merge pull request #3167 from akarsakov:ocl_rm_clFinish
2014-09-04 17:00:10 +00:00
Vadim Pisarevsky
64a53de27d
Merge pull request #3185 from ElenaGvozdeva:ocl_norm
2014-09-04 08:53:47 +00:00
Ilya Lavrenov
98e7d4ceec
changed optimal vector width for Intel
2014-09-04 11:59:41 +04:00
Elena Gvozdeva
9fe11db7e2
disabled IPP acceleration for 3-channel norms and for CV_8S only for APPLE
2014-09-04 10:38:45 +04:00
Alexander Karsakov
7d91b8efcd
Removed redundant clFinish() after clEnqueueUnmapMemObject()
...
sss
2014-09-03 14:54:05 +04:00
Alexander Karsakov
f57a4bf87b
Disabled minMaxIdx for 32FC1 since it occasionally fails on AMD devices (e.g. A10-6800K)
2014-09-03 14:36:51 +04:00
Vadim Pisarevsky
0276cc90c2
Merge pull request #3184 from ilya-lavrenov:arm
2014-09-03 05:40:19 +00:00
Ilya Lavrenov
5d3a128cd3
NEON impl on cv::convertScaleAba CV_32f
2014-09-01 17:04:36 +00:00
Vadim Pisarevsky
3bafe64666
Merge pull request #3170 from ElenaGvozdeva:ocl_fix
2014-09-01 10:40:02 +00:00
Vadim Pisarevsky
1f85ffa11b
Merge pull request #3166 from akarsakov:ocl_native_sqrt
2014-09-01 10:36:50 +00:00
Alexander Alekhin
4d474d40e7
Merge pull request #3171 from akarsakov:amd_fft_fix
2014-08-29 16:28:31 +00:00
Ilya Lavrenov
71ec6144bd
attempt to fix compilation of OpenCL cv::transpose for AMD
2014-08-29 16:59:30 +04:00
Alexander Karsakov
d4e6812be2
Added check AmdFft version to be sure that AmdFft binaries are available
2014-08-29 14:23:18 +04:00
Elena Gvozdeva
31ac73c315
fix for cv::memopTypeToStr
2014-08-29 14:18:52 +04:00
Alexander Alekhin
57fec2f2da
OCL: enable clAmdFftGetVersion
2014-08-29 13:45:04 +04:00
Alexander Karsakov
491bf41356
Disabled native_sqrt for double, since it may be not implemented and gives compilation error.
2014-08-28 17:01:49 +04:00
Alexander Alekhin
b332152bef
Merge pull request #2956 from ilya-lavrenov:tapi_accumulate
2014-08-28 09:08:51 +00:00
Vadim Pisarevsky
4d9d7e6ded
Merge pull request #3160 from akarsakov:ocl_dft_double_support
2014-08-27 10:06:34 +00:00
Alexander Karsakov
a89ff402fc
Refactoring of OCL_FftPlan class
2014-08-27 10:33:25 +04:00
Alexander Karsakov
3ae95150c7
Added double support for OCL version of DFT
2014-08-25 18:08:43 +04:00
Ilya Lavrenov
7017b1250a
used new stratehy in cv::accumulate**
2014-08-25 11:25:54 +04:00
Ilya Lavrenov
2c6b7a52e9
improved cv::ocl::predictOptimalVectorWidth
2014-08-25 11:25:01 +04:00
Ilya Lavrenov
2f46cf5ddb
changed base types for cv::memopTypeToStr
2014-08-25 11:23:46 +04:00
Vadim Pisarevsky
de2a8d3706
Merge pull request #3135 from thorikawa:fix-arithmetic-exception-with-ipp
2014-08-21 05:41:18 +00:00
Poly Takahiro Horikawa
cb0ab87646
Fix arithmetic exception when pass an empty mat with IPP option on.
2014-08-19 19:19:53 -07:00
Vadim Pisarevsky
70679b49fc
Merge pull request #3121 from akarsakov:ocl_dft_opt
2014-08-19 08:57:53 +00:00
Dmitriy Anisimov
37b1a7560c
first version of moving KDTree from core to ml
2014-08-18 22:40:31 +04:00
Alexander Karsakov
fa818d03b8
Changed twiddle buffer creation to use OCL buffer pool (if possible)
2014-08-18 18:22:52 +04:00
Vadim Pisarevsky
0ffc53bafe
Merge pull request #3107 from vpisarev:fixed_win_build
2014-08-18 09:00:54 +00:00
Vadim Pisarevsky
e9729a9601
multiple yet minor fixes to make most of the tests pass on Mac with Iris graphics
2014-08-16 00:29:10 +04:00
Vadim Pisarevsky
daaa5a1824
removed duplicated Hershey fonts
2014-08-15 12:09:16 +04:00
Vadim Pisarevsky
dfbeaddc72
Merge pull request #3106 from LeszekSwirski:fix-gemm-buf-allocate
2014-08-15 05:44:54 +00:00
Leszek Swirski
de806330fd
Fix reallocation of D buffer in gemm
2014-08-14 15:54:54 +01:00
Alexander Alekhin
52ac61d87c
Merge pull request #3088 from vbystricky:ocl_enableNormEtc
2014-08-14 14:34:40 +00:00
vbystricky
942ff5be57
Disable OpenCL version of minMaxIdx with mask on AMD devices
2014-08-14 17:39:41 +04:00
Vadim Pisarevsky
95a4943762
Merge pull request #3103 from vpisarev:core_imgproc_optim_rearrangements
2014-08-14 13:39:01 +00:00
Vadim Pisarevsky
4530c7ad08
trying to fix builds
2014-08-14 13:18:04 +04:00
Vadim Pisarevsky
f7243be79c
Merge pull request #3090 from akarsakov:ocl_reduce_fix
2014-08-14 08:52:18 +00:00
Vadim Pisarevsky
5240a03217
Merge pull request #3100 from vpisarev:disabled_neon_split_merge_on_ios
2014-08-14 08:51:44 +00:00
Vadim Pisarevsky
257463719b
removed optim module; moved its functionality to core and photo modules; moved drawing functions from core to imgproc. Removed FilterEngine etc. from public API
2014-08-14 12:50:07 +04:00
vbystricky
aab6f6c56f
Set sync=true in ocl_minMaxIdx kernel call. For tests
2014-08-14 12:14:56 +04:00
Vadim Pisarevsky
5d8098a2c3
avoid compiler warnings
2014-08-14 10:06:10 +04:00
Vadim Pisarevsky
5f590ebed0
Merge pull request #3094 from Adil-Ibragimov:minor_fixes_in_Mat-data
2014-08-13 18:28:51 +00:00
Vadim Pisarevsky
71ff228ad3
Merge pull request #3089 from Adil-Ibragimov:fix_cuda_constness
2014-08-13 18:09:42 +00:00
Vadim Pisarevsky
a37876ab95
fixed build of iOS framework another time; clang does not like the new NEON-optimized split/merge
2014-08-13 19:59:26 +04:00
vbystricky
1fe403f461
Enable OpenCL version of norm and convertScaleAbs or 32F data
...
Fix error in minmaxloc.cl
Change test for convertScaleAbs
Fix minMaxIdx for _src2 align
Change epsilon on the tests
2014-08-13 18:33:01 +04:00
Vadim Pisarevsky
d97f9fc477
now the tests pass with loop_times==30
2014-08-13 16:32:01 +04:00
Adil Ibragimov
8a4a1bb018
Several type of formal refactoring:
...
1. someMatrix.data -> someMatrix.prt()
2. someMatrix.data + someMatrix.step * lineIndex -> someMatrix.ptr( lineIndex )
3. (SomeType*) someMatrix.data -> someMatrix.ptr<SomeType>()
4. someMatrix.data -> !someMatrix.empty() ( or !someMatrix.data -> someMatrix.empty() ) in logical expressions
2014-08-13 15:21:35 +04:00
Vadim Pisarevsky
3cd7941098
Merge pull request #3083 from akarsakov:ocl_transpose_check_locmem
2014-08-13 10:50:51 +00:00
Vadim Pisarevsky
01bd4961e3
Merge pull request #3081 from akarsakov:small_fixes
2014-08-13 10:50:33 +00:00
Alexander Karsakov
c3100eeb19
Fixed buffer initialization in reduce kernel. Enabled OCL version of reduce for SUM, MAX, MIN modes.
2014-08-13 12:03:06 +04:00
Adil Ibragimov
dfd0b297ce
datastart must be non-const in CUDA.
2014-08-13 11:11:33 +04:00
Alexander Karsakov
713ddb89bd
Set minimum matrix size for AmdBlas::gemm to 20 since it works incorrect for small sizes
2014-08-12 18:02:29 +04:00
Alexander Karsakov
3deea1c3ce
Added check required local memory size for non-inplace transpose kernel
2014-08-12 16:26:08 +04:00
Alexander Alekhin
da806cc3e7
Merge pull request #3073 from vbystricky:ocl_fixMinMaxLoc
2014-08-12 10:21:41 +00:00
vbystricky
6fb282aa39
Remove mul24, for CV8UC3 3840x2160 it generates implementation specific result
2014-08-12 11:25:23 +04:00
Alexander Alekhin
8fd61c9aa6
Merge pull request #3075 from akarsakov:ipp_imgproc_fix
2014-08-11 15:02:53 +00:00
Elena Gvozdeva
b73b28ab9c
fix for ocl_lut
2014-08-11 16:21:55 +04:00
vbystricky
dfe5fff2f5
Restore OpenCL branch for minmaxLoc on float data
2014-08-11 15:57:29 +04:00
Alexander Karsakov
835b5e2179
Disabled ippiConvert_32f16s_C1R since it breaks OCL accuracy tests of remap with test_loop_times=30.
2014-08-11 15:50:06 +04:00
Vadim Pisarevsky
9ef373f6a5
Merge pull request #3038 from yury-gorbachev:core_arithm_neon
2014-08-10 20:20:04 +00:00
Vadim Pisarevsky
4de4ff5682
Merge pull request #3067 from vpisarev:minor_fixes2
2014-08-10 10:52:57 +00:00
Vadim Pisarevsky
e2f24f43c9
added some basic functionality needed by the new face module (moved from the old "contrib")
2014-08-10 01:54:16 +04:00
Vadim Pisarevsky
05e7c29de5
fixed various warnings and obvious errors reported by clang compiler and the coverity tool.
2014-08-10 00:10:05 +04:00
Yury Gorbachev
9a233999a9
Fixed review comment from Vadim Pisarevsky
2014-08-09 13:37:53 +04:00
Vadim Pisarevsky
99eed2d885
Merge pull request #3057 from Adil-Ibragimov:adding-constness
2014-08-08 17:27:47 +00:00
Vadim Pisarevsky
74457cd3b5
Merge pull request #3055 from akarsakov:copyto_UMat_fix
2014-08-08 14:20:02 +00:00
Vadim Pisarevsky
88672f414b
Merge pull request #3056 from akarsakov:ocl_setIdentity_fix
2014-08-08 14:03:46 +00:00
Alexander Karsakov
6ad4521b78
Fixed typos
2014-08-08 13:11:35 +04:00
Adil Ibragimov
8dab2ffce7
fixing error, wrong template method param.
2014-08-07 17:43:32 +04:00
Adil Ibragimov
98d5731ad8
some formal changes (generally adding constness)
2014-08-07 15:49:14 +04:00
Alexander Karsakov
c5ad6de596
Fixed choice of kercn and rowsPerWI for non-Intel device.
2014-08-07 15:37:26 +04:00
Alexander Karsakov
00221e96fd
Fixed issue: Mat::copyTo(UMat) if device copy is obsolete. Added test.
2014-08-07 13:16:02 +04:00
Vadim Pisarevsky
ca183707f2
Merge pull request #3039 from alalek:ocv_world
2014-08-06 14:52:52 +00:00
Vadim Pisarevsky
d611684d9e
Merge pull request #3041 from adrians:optimize_split
2014-08-06 09:22:38 +00:00
Alexander Alekhin
55188fe991
world fix
2014-08-05 20:12:35 +04:00
vbystricky
774d277c1f
Fix error in OpenCl version of meanstddev for continues src and not continues mask
2014-08-05 17:30:06 +04:00
Adrian Stratulat
b2acd1f75a
Vectorize split & merge for NEON
2014-08-05 06:51:36 +00:00
Vadim Pisarevsky
a05ce00a65
Merge pull request #2879 from alalek:ocl_runtime_flags
2014-08-04 16:43:01 +00:00
Vadim Pisarevsky
432b7cb235
Merge pull request #3032 from vpisarev:refactor_ml2
2014-08-04 14:51:38 +00:00
Yury Gorbachev
61423a2d47
ARM NEON accelerated implementation of cv::addWeighted, cv::inRange and cv::compare functions.
...
NOT verified on target platform, compilation passes with and without NEON.
2014-08-04 16:44:51 +04:00
vbystricky
b48e487d53
Enable ocl version of minmaxloc with mask
2014-07-31 19:12:10 +04:00
vbystricky
11a0e3ff78
Fix error in OCL minmaxloc
2014-07-31 19:04:38 +04:00
Elena Gvozdeva
9db8592aa8
fixed ocl_flip for cn==3
2014-07-31 16:57:47 +04:00
Vadim Pisarevsky
b0cd822924
trying to make the tests pass
2014-07-31 14:24:21 +04:00
Vadim Pisarevsky
ef8647fe30
put IPP ReduceSum_32f back
2014-07-30 19:12:11 +04:00
Vadim Pisarevsky
5267ed46c7
1. disabled OpenCL acceleration for cv::multiply() (CV_32F), cv::divide (CV_32F), cv::convertScaleAbs (CV_32F) and cv::reduce (SUM, CV_32F), cv::reduce (MIN & MAX), cv::flip (3-channel case).
...
2. changed the number of test loops from 1 to 30 (except for cv::pow() test, which fails for yet unknown reason)
3. disabled IPP acceleration for 3-channel norms.
4. modified relativeNorm test function to handle very small values
2014-07-30 18:19:47 +04:00
Vadim Pisarevsky
ba3783d205
initial commit; ml has been refactored; it compiles and the tests run well; some other modules, apps and samples do not compile; to be fixed
2014-07-29 23:54:23 +04:00
Vadim Pisarevsky
41df63aacd
Merge pull request #3010 from vpisarev:ocl_disable_some_norms2
2014-07-28 17:24:42 +00:00
Vadim Pisarevsky
962b519708
Merge pull request #2996 from akarsakov:ocl_dft_new_concept
2014-07-28 15:59:59 +00:00
Vadim Pisarevsky
0c749fd7e5
Merge pull request #2998 from ernest-galbrun:cuda_concurrency
2014-07-28 15:59:43 +00:00
Vadim Pisarevsky
af83884dc7
disabled ocl_minmaxloc on 32-bit data completely
2014-07-28 19:35:06 +04:00
Vadim Pisarevsky
1925fab427
disabled some more stuff
2014-07-28 19:32:06 +04:00
Vadim Pisarevsky
97f6dd1820
trying to make OpenCL builder green
2014-07-28 17:16:02 +04:00
Vadim Pisarevsky
f42bb0bd65
disable OpenCL difference norms with mask, since some of the tests fail
2014-07-28 15:45:54 +04:00
Vadim Pisarevsky
ef0f010437
Merge pull request #3003 from ElenaGvozdeva:ocl_sum
2014-07-28 11:08:08 +00:00
Alexander Karsakov
fecfaf4092
Using std::map in PlanCache
2014-07-28 14:19:47 +04:00
Elena Gvozdeva
27d97aa231
fixed
2014-07-28 12:21:00 +04:00
Elena Gvozdeva
fe29af2e58
Fixed bug in reduce.cl
2014-07-25 14:51:30 +04:00
Alexander Karsakov
37d01e2d27
Added license header, using cv::Ptr, small fixes.
2014-07-25 13:27:00 +04:00
Adrian Stratulat
a98ce86ca3
Add NEON intrinsics to core/src/arithm.cpp
2014-07-24 19:45:32 +00:00
Alexander Karsakov
66ac46214d
Final refactoring, fixes
2014-07-24 13:23:02 +04:00
Ernest Galbrun
551ab83cf2
trailing whitespace
2014-07-23 14:34:22 +02:00
Ernest Galbrun
d7038423dd
Merge branch 'master' of https://github.com/Itseez/opencv
2014-07-23 14:21:45 +02:00
Ernest Galbrun
6a769c92b3
modified default stream initialization to allow concurrent calls modified cuda surf.cuda.cpp to allow concurrent call
2014-07-23 14:21:21 +02:00
Elena Gvozdeva
4fdd6de8e8
Fixed ippiDFT call
2014-07-23 13:39:31 +04:00
Alexander Karsakov
1d2cf0e20e
Added nonzero_rows support
2014-07-22 18:31:08 +04:00
Alexander Karsakov
52f76a3283
Added rest Elena's changes
2014-07-22 18:31:08 +04:00
Alexander Karsakov
7791264525
Added multi-block scheme
2014-07-22 18:31:03 +04:00
Alexander Karsakov
2b9e556055
Added Elena's changes with implemented DFT_INVERSE C2C mode.
2014-07-22 18:27:31 +04:00
Alexander Karsakov
b17bf031f6
Added DFT_SCALE for forward transforms
2014-07-22 18:27:31 +04:00
Alexander Karsakov
6c8b6bd0c7
Added packing to CCS format
2014-07-22 18:27:25 +04:00
Alexander Karsakov
ed07241f89
Completed all forward transforms.
2014-07-22 18:27:17 +04:00
Alexander Karsakov
e5a3ab3cb9
Added fftplan cache
2014-07-22 18:27:08 +04:00
Alexander Karsakov
0318d27720
Enabled precalculated wave
2014-07-22 18:26:58 +04:00
Alexander Karsakov
5dd9263848
Multi-radix with kernel generation
2014-07-22 18:25:59 +04:00
PhilLab
e82241cba5
Function for drawing arrows
...
Porting the [PR 2970](https://github.com/Itseez/opencv/pull/2970 ) To the new InputOutputArray type
2014-07-18 10:04:44 +02:00
Vadim Pisarevsky
b63a7e66db
Merge pull request #2953 from gongzg:master
2014-07-14 19:21:15 +00:00
Alexander Alekhin
e169083fed
Merge pull request #2918 from vbystricky:oclopt_reduce2
2014-07-12 05:00:57 +00:00
Alexander Alekhin
df74a266cf
Merge pull request #2963 from ilya-lavrenov:tapi_bufferpool
2014-07-11 14:55:13 +00:00
Alexander Alekhin
0876ec7a67
Merge pull request #2935 from vbystricky:oclopt_meanStdDev
2014-07-11 14:53:46 +00:00
Alexander Alekhin
cbf63b0709
Merge pull request #2851 from ilya-lavrenov:tapi_set_identity
2014-07-11 10:00:16 +00:00
Ilya Lavrenov
f2a2467de3
enabled BufferPool
2014-07-11 12:21:10 +04:00
Alexander Alekhin
a0816c6d15
Merge pull request #2927 from ilya-lavrenov:tapi_lut
2014-07-10 09:38:37 +00:00
Zhigang Gong
2e49ca4903
Avoid unmap an umat(ocl) which hasn't been mapped at all.
...
According to opencl 1.2 spec 5.4.2:
enqueues a command to unmap a previously mapped region of a memory object.
...
CL_INVALID_VALUE if mapped_ptr is not a valid pointer returned by
clEnqueueMapBuffer, or clEnqueueMapImage for memobj.
So if the u->data is not from a clEnqueueMapBuffer call, we should not
call clEnqueueUnmapMemObject() unmap it. With this patch, the cases
./opencv_test_video --gtest_filter=OCL_Video/FarnebackOpticalFlow.Mat/*
could work well with beignet 0.9.1, Otherwise, it will get a
CL_INVALID_VALUE at the clEnqueueUnmapMemObject().
Signed-off-by: Zhigang Gong <zhigang.gong@intel.com>
2014-07-09 16:21:20 +08:00
Alexander Alekhin
e79ceb4b45
Merge pull request #2936 from ilya-lavrenov:tapi_minmaxloc_opt
2014-07-08 07:30:49 +00:00
Vadim Pisarevsky
e1553d5da3
Merge pull request #2929 from ilya-lavrenov:sse_pow
2014-07-07 11:22:26 +00:00
Vadim Pisarevsky
b2ab1b758a
Merge pull request #2937 from ilya-lavrenov:sse_patch_nans
2014-07-07 11:20:30 +00:00
Ilya Lavrenov
e2b86974ae
SSE2 optimization of cv::patchNaNs
2014-07-04 10:27:44 +04:00
Ilya Lavrenov
e04c39b089
SSE2 optimization of cv::inRange
2014-07-04 10:11:18 +04:00
Alexander Alekhin
964b260937
Merge pull request #2932 from ilya-lavrenov:tapi_other_vendors
2014-07-03 13:02:41 +00:00
Ilya Lavrenov
55d84cf815
optimization of cv::minMaxLoc - used min/max
2014-07-03 16:56:22 +04:00
vbystricky
690c932d7f
Change groups count for Intel
2014-07-03 15:50:56 +04:00
Ilya Lavrenov
2fe07abf83
use vectors for devices of all vendors
2014-07-03 15:32:34 +04:00
Alexander Alekhin
c5d3c08256
ocl: add try-catch for OpenCL device getter
2014-07-03 12:18:19 +04:00
Ilya Lavrenov
e5b63d43f2
sse2 optimization of inv_sqrt_64f
2014-07-03 01:33:49 +04:00
Ilya Lavrenov
10c772fa7f
minor optimization of cv::LUT
2014-07-02 18:50:21 +04:00
Vadim Pisarevsky
0e534b5a25
Merge pull request #2923 from alalek:ocl_default_device_ignore_cpu
2014-07-02 09:56:59 +00:00
vbystricky
a3b9884674
Change global buffer to local
2014-07-02 11:36:25 +04:00
Vadim Pisarevsky
15197eb09b
Merge pull request #2911 from ilya-lavrenov:sse2_convert_scale_abs
2014-07-01 17:31:37 +00:00
Alexander Alekhin
157257b072
ocl: don't use OpenCL CPU device in default setup
2014-07-01 14:25:31 +04:00
Ilya Lavrenov
972119bff3
sse2 optimization of cv::convertScaleAbs
2014-07-01 02:21:00 +04:00
Ilya Lavrenov
54e4ef657c
optimized cv::transpose inplace
2014-07-01 00:36:13 +04:00
Ilya Lavrenov
9c8b9fc733
cv::transpose
2014-06-30 19:07:45 +04:00
Alexander Alekhin
b304868d91
Merge pull request #2886 from ilya-lavrenov:tapi_inrange
2014-06-30 10:04:04 +00:00
Alexander Alekhin
2d81595ed4
Merge pull request #2852 from ilya-lavrenov:tapi_abs
2014-06-27 08:21:06 +00:00
Roman Donchenko
e8850bf84a
Merge pull request #2626 from KayKwon:matlab_formatter
2014-06-26 17:40:26 +04:00
Alexander Alekhin
4f1d06d14a
Merge pull request #2874 from vbystricky:ocl_LUTAMDfix
2014-06-23 14:23:26 +00:00
Ilya Lavrenov
e89cee35e5
optimized cv::inRange
2014-06-23 13:12:08 +04:00
Ilya Lavrenov
0528d2e2b3
added 32s to 32u conversion
2014-06-23 13:09:54 +04:00
Alexander Alekhin
07446ec954
Merge pull request #2847 from ilya-lavrenov:tapi_pow
2014-06-20 08:47:18 +00:00
Alexander Alekhin
d9daa0cbaa
ocl: added runtime flags for debugging
2014-06-19 17:52:46 +04:00
vbystricky
ea6da6ee95
Change uchar2 to short
2014-06-18 15:44:56 +04:00
vbystricky
9391ccfe35
Move some variables into #ifdef
2014-06-18 15:03:09 +04:00
Hans Johnson
4c7ed03b5f
COMP: Fix problem with narrowing in c++11
...
modules/core/src/arithm.cpp:345:51:
error: constant expression evaluates to 4294967295 which cannot be narrowed to type 'int' [-Wc++11-narrowing]
static int CV_DECL_ALIGNED(16) v64f_absmask[] = { 0xffffffff, 0x7fffffff, 0xffffffff, 0x7fffffff };
^~~~~~~~~~
Converted to unsigned int. This variable is only used to initialize a bit pattern anywhy for a 128bit type.
2014-06-17 15:49:06 -05:00
Ilya Lavrenov
6dd658a0af
optimized cv::setIdentity
2014-06-16 13:41:43 +04:00
Ilya Lavrenov
316c044e06
used abs in reduction operations
2014-06-11 18:54:43 +04:00
Ilya Lavrenov
f1e24381d1
used abs
2014-06-11 18:33:46 +04:00
Ilya Lavrenov
b2c2aabd04
used built-in functions
2014-06-11 18:32:42 +04:00
Alexander Alekhin
88ceee05c6
Merge pull request #2837 from ilya-lavrenov:tapi_norm_relative
2014-06-10 22:21:09 +04:00
Ilya Lavrenov
03f665e870
some optimizaions of cv::pow
2014-06-10 18:37:35 +04:00
Ilya Lavrenov
634da9f3bf
added norm_inf support to minmaxloc kernel
2014-06-09 20:05:26 +04:00
Roman Donchenko
ed1c56e063
Merge pull request #2817 from bf:patch-1
2014-06-09 15:06:07 +04:00
Ilya Lavrenov
7f2662b310
fixes
2014-06-09 00:50:14 +04:00
Ilya Lavrenov
5403bdd228
optimized cv::norm with NORM_RELATIVE
2014-06-07 21:26:40 +04:00
Ilya Lavrenov
2040995801
optimized cv::norm with 2 args
2014-06-07 20:45:28 +04:00
Ilya Lavrenov
1a7a262f74
optimized cv::norm with NORM_INF
2014-06-06 19:34:26 +04:00
Alexander Alekhin
ea417ac04a
Merge pull request #2755 from vbystricky:oclopt_LUT
2014-06-06 18:52:40 +04:00
Ilya Lavrenov
fd5a8b3e97
minmaxloc
2014-06-06 18:39:08 +04:00
Alexander Alekhin
f30301d171
Merge pull request #2801 from ilya-lavrenov:tapi_reduction
2014-06-06 18:38:22 +04:00
VBystricky
5d924b7a75
If lut table has one channel and src aligned to 4, work with src as with one channel matrix
2014-06-05 19:31:31 +04:00
VBystricky
0c0ebca855
Read 4 pixel for aligned data with 1 channel
2014-06-04 23:50:23 +04:00
VBystricky
8a5f2781fc
Fix kernel by comments
2014-06-04 20:13:42 +04:00
Alexander Alekhin
3f485d58bb
Merge pull request #2808 from ilya-lavrenov:tapi_normalize_mask
2014-06-04 18:10:13 +04:00
Ilya Lavrenov
405f12fe80
optimized cv::flip (CV_8UC1)
2014-06-04 14:21:42 +04:00
Alexander Alekhin
6d3413bc6a
Merge pull request #2810 from ilya-lavrenov:tapi_copytomask
2014-06-04 12:23:36 +04:00
Alexander Alekhin
c2ca059baf
Merge pull request #2798 from ilya-lavrenov:tapi_copymakeborder
2014-06-04 12:22:52 +04:00
Alexander Alekhin
ee8adc10a1
Merge pull request #2821 from ilya-lavrenov:tapi_reduce
2014-06-04 12:18:30 +04:00
Alexander Alekhin
607cd37b48
Merge pull request #2750 from ilya-lavrenov:tapi_convertto
2014-06-03 16:36:48 +04:00
Alexander Alekhin
93af92c878
Merge pull request #2795 from ilya-lavrenov:tapi_setto
2014-06-03 13:21:05 +04:00
Alexander Alekhin
5e8a3cde94
Merge pull request #2757 from ElenaGvozdeva:ocl_flip
2014-06-03 13:19:42 +04:00
Ilya Lavrenov
26b73a7bbd
merged 2 kernels
2014-06-03 13:14:51 +04:00
Ilya Lavrenov
ab428c9dbd
optimized cv::copyMakeBorder
2014-06-02 15:46:44 +04:00
Ilya Lavrenov
67bb1c6a70
optimized UMat::setTo
2014-06-02 15:39:36 +04:00
Ilya Lavrenov
17956a5ae5
optimized cv::normalize in case of mask
2014-06-02 15:33:19 +04:00
Alexander Alekhin
fc2f79701d
Merge pull request #2778 from vbystricky:oclopt_reduce
2014-06-02 13:57:57 +04:00
Benjamin Flesch
62533d0d26
perspectiveTransform simplify assert() for better debuggin
...
When using perspectiveTransform in rather complicated settings, it would be easier for developers to have two separate assertions for each part of the boolean expression in order to pinpoint problems more efficiently. In my case I am struggling in Python2.7 with finding out whether scn+1 == m.cols or the depth == CV_32F || depth == CV_64F is making a problem, which is kind of hard.
2014-06-02 00:27:32 +02:00
Alexander Alekhin
78badcd651
Merge pull request #2781 from ilya-lavrenov:tapi_meanstddev
2014-05-31 22:50:44 +04:00
Alexander Alekhin
17d6083e01
Merge pull request #2787 from ilya-lavrenov:tapi_repeat
2014-05-31 22:46:21 +04:00
Ilya Lavrenov
7f818e9bc3
optimized UMat::copyTo with mask
2014-05-30 18:27:55 +04:00
Ilya Lavrenov
c52a77b90f
optimized cv::UMat::dot
2014-05-29 13:16:16 +04:00
Ilya Lavrenov
579499d900
optimized cv::sum (CV_8UC1)
2014-05-28 19:46:08 +04:00
Ilya Lavrenov
002a79bfc4
optimized cv::countNonZero
2014-05-28 18:42:42 +04:00
Ilya Lavrenov
7804d57f8b
optimized index calculation
2014-05-28 18:01:03 +04:00
Ilya Lavrenov
33173d900a
optimized cv::meanStdDev
2014-05-28 17:55:30 +04:00
Elena Gvozdeva
437927b7bb
optimized index access
2014-05-28 16:27:08 +04:00
vbystricky
b4498d1d48
Fix some errors
2014-05-26 16:55:31 +04:00
vbystricky
63584bffda
fix code in the kernel
2014-05-26 16:54:26 +04:00
vbystricky
04433b2d2b
Change buffer cols count from 64 to 32
2014-05-26 16:54:25 +04:00
vbystricky
169351b01d
Optimize openCL version of reduce function
2014-05-26 16:54:25 +04:00
vbystricky
cfabf32492
Fix some errors
2014-05-26 16:53:01 +04:00
vbystricky
48d82fd911
Fix some errors
2014-05-26 16:53:01 +04:00
vbystricky
a8bfab3cb7
Optimize OpenCL LUT function
2014-05-26 16:53:00 +04:00
vbystricky
6667cea0f4
Optimize OpenCL LUT function
2014-05-26 16:53:00 +04:00
vbystricky
72727111c7
Use 4 pixels for one unit. Some ocl code refactoring
2014-05-26 16:52:59 +04:00
vbystricky
47b092e527
Optimize OpenCL LUT function
2014-05-26 16:52:59 +04:00
Ilya Lavrenov
50fa809a96
fixed cv::mixChannels
2014-05-26 14:20:16 +04:00
Ilya Lavrenov
ec3c68c8ea
fix
2014-05-26 12:19:07 +03:00
Ilya Lavrenov
bd5e4c6c49
other kernels now use row scheme
2014-05-26 12:19:06 +03:00
Ilya Lavrenov
15738bf7ef
multiple rows in KF kernel
2014-05-26 12:19:06 +03:00
Ilya Lavrenov
5ee398bfd6
multiple rows per work-item
2014-05-26 12:19:05 +03:00
Ilya Lavrenov
c83455d8a4
optimized cv::repeat
2014-05-23 13:16:27 +03:00
Alexander Alekhin
c43e32854a
Merge pull request #2713 from alalek:icv_enable_fns
2014-05-23 13:35:45 +04:00
Alexander Alekhin
694d7726b5
Merge pull request #2762 from ilya-lavrenov:ipp_sort
2014-05-22 12:30:53 +04:00
Alexander Alekhin
d940093fb5
Merge pull request #2723 from ilya-lavrenov:tapi_intel_vector_width
2014-05-21 19:56:00 +04:00
Roman Donchenko
3d7a457c97
Merge remote-tracking branch 'origin/2.4' into merge-2.4
...
Conflicts:
doc/tutorials/introduction/linux_install/linux_install.rst
modules/core/doc/operations_on_arrays.rst
modules/core/include/opencv2/core/core.hpp
modules/core/src/system.cpp
modules/gpu/src/cuda/resize.cu
modules/imgproc/doc/miscellaneous_transformations.rst
modules/imgproc/doc/structural_analysis_and_shape_descriptors.rst
modules/video/src/bgfg_gaussmix2.cpp
2014-05-19 18:39:20 +04:00
Vadim Pisarevsky
c6113c6cb3
Merge pull request #2630 from isarandi:concat_corr
2014-05-19 17:36:20 +04:00
Vadim Pisarevsky
118b27f5b4
Merge pull request #2744 from jet47:kmeans-fix
2014-05-19 17:33:10 +04:00
Roman Donchenko
cba5235f08
Merge pull request #2698 from GregoryMorse:patch-2
2014-05-19 14:52:58 +04:00
Ilya Lavrenov
eb1dd1900b
fixed conditions
2014-05-16 12:18:35 +04:00
Elena Gvozdeva
c7dc884855
T-API: optimized ocl_flip
2014-05-16 11:01:10 +04:00
Vladislav Vinogradov
f16503743f
use more accurate reshape
2014-05-15 12:08:01 +04:00
Roman Donchenko
f13e05cdcb
Merge pull request #2740 from ilya-lavrenov:tapi_norm_fix
2014-05-14 12:40:19 +04:00
Alexander Alekhin
46d672dcc3
Merge pull request #2663 from ilya-lavrenov:ipp_slow
2014-05-14 12:30:05 +04:00
Vladislav Vinogradov
55a714c83b
fix cv::kmeans function
...
reshape input matrix, since the function works with data
as with [N x dims] matrix
2014-05-13 18:00:17 +04:00
Ilya Lavrenov
d16e0b377f
disabled IPP functions that slower than OpenCV
2014-05-13 15:06:38 +04:00
Ilya Lavrenov
f64dfeb222
fix for cv::norm (norm_inf)
2014-05-13 14:21:07 +04:00
Roman Donchenko
ff2d76ec0d
Merge pull request #2699 from GregoryMorse:patch-1
2014-05-13 11:58:25 +04:00
Roman Donchenko
313a0ad255
Merge remote-tracking branch 'origin/2.4' into merge-2.4
...
Conflicts:
modules/core/include/opencv2/core/internal.hpp
modules/core/src/arithm.cpp
modules/imgproc/src/imgwarp.cpp
modules/objdetect/src/hog.cpp
2014-05-12 17:38:07 +04:00
Ilya Lavrenov
1e5694e082
heuristic for Intel
2014-05-12 16:08:39 +04:00
Alexander Alekhin
1f638a3e5b
icv: enable functions
2014-05-12 15:38:38 +04:00
Alexander Alekhin
c74fed1fcf
Merge pull request #2706 from ilya-lavrenov:ipp_2.4_warnings
2014-05-08 17:04:18 +04:00
Ilya Lavrenov
19a2495067
fixed IPP related warnings
2014-05-08 13:31:27 +04:00
Vladislav Vinogradov
77275031ab
finally fix cv::subtract
2014-05-08 13:11:42 +04:00
Vladislav Vinogradov
629461c836
fix output matrix allocation in cv::subtract
2014-05-07 19:52:35 +04:00
Alexander Alekhin
07d5f56707
Merge pull request #2679 from alalek:ippicv_update
2014-05-07 18:24:41 +04:00
Alexander Alekhin
849228ac6a
Merge pull request #2684 from ElenaGvozdeva:ippiDFT
2014-05-07 17:57:17 +04:00
Alexander Alekhin
0e1bf58195
Merge pull request #2634 from ElenaGvozdeva:ippiDCT
2014-05-07 14:49:13 +04:00
Vladislav Vinogradov
4c66614e07
fix cv::subtract function:
...
call dst.create(...) before using it
2014-05-07 13:15:19 +04:00
GregoryMorse
38db7a78df
WinRT core compatibility fixes
...
Update system.cpp
Update system.cpp
Update system.cpp
Update matching.cpp
Update matching.cpp
2014-05-07 06:17:37 +08:00
GregoryMorse
b1443bc1bc
WinRT core compatibility fixes
...
Update system.cpp
Update system.cpp
Update ocl.cpp
Update matching.cpp
Update ocl.cpp
Update matching.cpp
2014-05-07 06:15:54 +08:00
Elena Gvozdeva
12279e2631
fixed
2014-05-06 14:49:03 +04:00
Alexander Alekhin
01f31dc007
Merge pull request #2687 from ilya-lavrenov:tapi_norm_inf_234cn
2014-05-06 13:27:03 +04:00
Elena Gvozdeva
e362c3fb38
fixed
2014-05-06 13:09:45 +04:00
Alexander Alekhin
03b1d133df
Merge pull request #2660 from arkunze:pullreq/140423-filter2D
2014-05-06 12:59:03 +04:00
Ilya Lavrenov
c80faff42f
added cn>1 support to cv::norm (NORM_INF)
2014-04-30 14:59:37 +04:00
Alexander Alekhin
b62e59aac0
icv: enable functions
2014-04-29 15:36:43 +04:00
Alexander Alekhin
5658ba0002
icv: update package
2014-04-29 15:35:27 +04:00
Elena Gvozdeva
043b917c7d
IPP: cv::dft fixed
2014-04-29 12:23:23 +04:00
Elena Gvozdeva
507ea95265
fixed
2014-04-28 16:46:11 +04:00
Elena Gvozdeva
16629bff4c
Added parallel version for DCT_ROWS
2014-04-28 16:05:19 +04:00
Elena Gvozdeva
6880dbfd95
IPP: cv::dct
2014-04-28 16:05:19 +04:00
Alexander Alekhin
649935596e
Merge pull request #2653 from alalek:ipp_lut
2014-04-28 15:15:11 +04:00
Alexander Alekhin
0e5b0677c9
Merge pull request #2654 from ElenaGvozdeva:ipp_DFT
2014-04-28 15:14:40 +04:00
Alexander Alekhin
7ee1d5f69b
ipp: added LUT optimization
2014-04-25 12:09:06 +04:00
Elena Gvozdeva
10d1b3a534
Added parallel version for DFT_ROWS
2014-04-25 11:39:19 +04:00