Vitaly Tuzov
d43597c199
transform() implementation updated to utilize wide universal intrinsics
2019-03-18 20:33:19 +03:00
Alexander Alekhin
5451b89aed
core: refactor EigenvalueDecomposition (hqr2)
...
- fix resource allocation management
- reduce variables scope
- fix complex_div
- fix comments, constants
- simplify add/sub operations
2019-03-18 19:07:34 +03:00
Alexander Alekhin
a7c4ee9ae1
core: add iterations limit check in eigenNonSymmetric()
2019-03-18 17:49:17 +03:00
Alexander Alekhin
8c8715c4dd
fix static analysis issues
2019-03-13 17:19:39 +03:00
Alexander Alekhin
b9d2e6664d
Merge pull request #13979 from alalek:issue_13772
2019-03-07 09:53:25 +00:00
Alexander Alekhin
7366eebebb
core: fix condition in OutputArray::create(allowTransposed=True)
2019-03-05 16:26:59 +03:00
Alexander Alekhin
35edad3e74
build: fix warnings
2019-03-05 14:47:04 +03:00
Alexander Alekhin
fd49ee5f39
core: dispatch merge.cpp
2019-02-23 15:42:26 +00:00
Alexander Alekhin
93a36b0df1
core: keep history of merge.cpp
2019-02-23 15:41:39 +00:00
Alexander Alekhin
4e12febe90
core: clone merge.simd.hpp
2019-02-23 15:41:33 +00:00
Alexander Alekhin
6eabe6bc14
core: clone merge.dispatch.cpp
2019-02-23 15:41:33 +00:00
Alexander Alekhin
91d152e2c2
core: dispatch split.cpp
2019-02-22 09:54:31 +00:00
Alexander Alekhin
1d8b30bf4f
core: keep history of split.cpp
2019-02-22 09:18:51 +00:00
Alexander Alekhin
0311770e8b
core: clone split.simd.hpp
2019-02-22 09:18:27 +00:00
Alexander Alekhin
82cd2f8c93
core: clone split.dispatch.cpp
2019-02-22 09:17:51 +00:00
Alexander Alekhin
dc84cf9914
core: dispatch mean.cpp
2019-02-19 16:58:32 +03:00
Alexander Alekhin
4b82c8a22b
core: keep history of mean.cpp
2019-02-19 16:46:46 +03:00
Alexander Alekhin
7af7bcae18
core: clone mean.dispatch.cpp
2019-02-19 16:46:28 +03:00
Alexander Alekhin
93cea6e46e
core: clone mean.simd.hpp
2019-02-19 16:45:42 +03:00
Alexander Alekhin
cd66f6e3db
core: dispatch matmul
...
- gemm: keep baseline only (lapack is 10x+ faster, lets reduce binary size)
- transform / distTransform
- scaleAdd (32f/64f only)
- Mahalanobis: keep baseline only (no perf tests)
- mulTransposed: keep baseline only (no perf tests)
- dot
2019-02-18 14:36:46 +03:00
Alexander Alekhin
fbde57dba8
core: keep history of matmul.cpp
2019-02-14 19:07:41 +03:00
Alexander Alekhin
dcee7b1605
core: clone matmul.dispatch.cpp
2019-02-14 19:07:37 +03:00
Alexander Alekhin
b769ad2c23
core: clone matmul.simd.hpp
2019-02-14 19:07:37 +03:00
Alexander Alekhin
e3633ec4a2
core: dispatch count_non_zero
2019-02-14 13:16:20 +03:00
Alexander Alekhin
0b49680339
core: keep history of count_non_zero.cpp
2019-02-14 13:15:43 +03:00
Alexander Alekhin
439e43a027
core: clone count_non_zero.dispatch.cpp
2019-02-14 13:15:39 +03:00
Alexander Alekhin
af8a3a0b66
core: clone count_non_zero.simd.hpp
2019-02-14 13:15:39 +03:00
Alexander Alekhin
b40a7ffbe4
core: dispatch sum
2019-02-13 18:17:38 +03:00
Alexander Alekhin
c88e6b344b
core: keep history of sum.cpp
2019-02-13 13:49:36 +03:00
Alexander Alekhin
6e88bff3e3
core: clone sum.dispatch.cpp
2019-02-13 13:49:29 +03:00
Alexander Alekhin
5aceac6b93
core: clone sum.simd.hpp
2019-02-13 13:49:29 +03:00
Alexander Alekhin
2e28ff78c1
Merge pull request #13780 from alalek:core_dispatch_convertTo
2019-02-12 12:08:30 +00:00
klemens
5d9c6723ee
spelling fixes
...
backport 997b7b18af
2019-02-11 15:35:10 +03:00
Alexander Alekhin
d32d576d6d
core: dispatch convert_scale
2019-02-08 18:32:10 +03:00
Alexander Alekhin
39b90ae9fb
core: dispatch convert
2019-02-08 18:32:10 +03:00
Alexander Alekhin
33d477e049
core: keep history of convert_scale.cpp
2019-02-08 16:39:13 +03:00
Alexander Alekhin
c4abdb8050
core: clone convert_scale.dispatch.cpp
2019-02-08 16:38:50 +03:00
Alexander Alekhin
0af9e19be7
core: clone convert_scale.simd.hpp
2019-02-08 16:35:52 +03:00
Alexander Alekhin
b6a0acb8f6
core: keep history of convert.cpp
2019-02-08 16:29:54 +03:00
Alexander Alekhin
5527c41468
core: clone convert.dispatch.cpp
2019-02-08 16:29:16 +03:00
Alexander Alekhin
1629e29b98
core: clone convert.simd.hpp
2019-02-08 16:27:59 +03:00
Alexander Alekhin
66d9a33b50
core(ocl): fix log messages
2019-02-07 16:35:14 +03:00
Alexander Alekhin
4501a2cdea
ocl: support empty "ptr only" UMat in Kernel::set()
...
add messages to avoid silent kernel destruction
2019-01-30 14:51:06 +03:00
Scott Ramsby
7d0d6cd5ee
Fix build error when building with Unicode on Windows
2019-01-07 10:18:58 -08:00
Lee Jaehwan
71aee662bd
Merge pull request #13544 from eightco:bugfix
...
Fix a bug in cv :: merge when array of 3-channel mat is input (#13544 )
* Mat merge function bug fix - Bug fix of merge function of 3-channel vector <Mat> of 3 or 4 matrices
* Add Core_merge test for opencv#13544
* fixups
2019-01-04 15:10:50 +03:00
Alexander Alekhin
d9d9b05912
core(ocl): add parameter to limit device max workgroup size
...
used by OpenCV
2018-12-17 18:33:05 +00:00
Vitaly Tuzov
6ad8a9c09d
Replaced core module calls to universal intrinsics with wide universal intrinsics
2018-12-04 16:24:20 +03:00
Alexander Alekhin
aee865fec9
Merge pull request #13352 from alalek:issue_13324
2018-12-03 15:34:23 +00:00
Alexander Alekhin
80c2adbda1
core: add getcwd() stub
2018-12-03 17:38:57 +03:00
Maksim Shabunin
c26c43c69c
Fixed compilation with VA-interop on 32-bit platforms
2018-12-03 17:16:09 +03:00
Vitaly Tuzov
00c9ab8c23
Merge pull request #13317 from terfendail:norm_wintr
...
* Added performance tests for hal::norm functions
* Added sum of absolute differences intrinsic
* norm implementation updated to use wide universal intrinsics
* improve and fix v_reduce_sad on VSX
2018-11-29 19:34:14 +03:00
Alexander Alekhin
82f4322d18
Merge pull request #13269 from alalek:ocl_fix_kernels_with_use_host_mem
2018-11-25 18:27:01 +00:00
Alexander Alekhin
9fd822f97e
ocl: fix kernels launching with USE_HOST_PTR UMat
...
created from RAW memory buffers (without proper lifetime management)
2018-11-24 15:37:16 +00:00
Alexander Alekhin
3c49b1dbbe
core: use dladdr() instead of parsing /proc/self/maps
2018-11-24 15:22:54 +00:00
Bram
724620b476
Fixed build on FreeBSD
2018-11-22 09:23:09 +01:00
Alexander Alekhin
45d2e18808
Merge pull request #13232 from huangqinjin:openmp
2018-11-21 08:32:14 +00:00
huangqinjin
e1ac8589f8
fix numThreadsMax for OpenMP
...
introduced by commit 4e62900009
2018-11-21 10:54:24 +08:00
Sayed Adel
474a0dac49
core: several improves and fixes on ppc64le infrastructure
...
- add infrastructure support for Power9/VSX3
- fix missing VSX flags on GCC4.9 and CLANG4(#13210 , #13222 )
- fix disable VSX optimzation on GCC by using flag ENABLE_VSX
- flag ENABLE_VSX is deprecated now, use CPU_BASELINE, CPU_DISPATCH instead
- add VSX3 to arithmetic dispatchable flags
2018-11-20 15:28:46 +00:00
Alexander Alekhin
780ae864a0
Merge pull request #13192 from alalek:fix_valgrind_3.4
2018-11-17 13:20:33 +03:00
berak
96c99c716a
Merge pull request #13193 from berak:core_copyMakeBorder
2018-11-17 13:19:42 +03:00
Alexander Alekhin
c26dd5d7aa
core: fix issues from valgrind builder
2018-11-17 07:32:06 +00:00
Alexander Alekhin
e580061b74
Merge pull request #12908 from alexevans:Issue11855
2018-11-16 20:42:54 +00:00
Alexander Alekhin
2fa9bd221d
core: add utils::findDataFile() / samples::findFile()
2018-11-16 00:25:06 +00:00
Alexander Alekhin
96c71dd3d2
dnn: reduce set of ignored warnings
2018-11-15 13:15:59 +03:00
Dmitry Kurtaev
6c76c8f881
Add a test for FileNode::keys()
2018-11-12 13:38:18 +03:00
Alexander Alekhin
858a7da5c0
core: rework getContinuousSize() for vector-col/row support
2018-11-10 11:08:28 +00:00
Alexander Alekhin
8675a8c743
Merge pull request #12878 from tompollok:3.4
2018-11-09 19:54:17 +00:00
Alexander Alekhin
596ada51f3
Merge pull request #13080 from alalek:issue_13078
2018-11-09 13:20:27 +00:00
Alexander Alekhin
5059523937
core: fix processing of vector-rows
2018-11-08 20:04:22 +03:00
tompollok
2da56d5af6
refactoring catching all exceptions as const ref
2018-11-08 19:59:47 +03:00
Alexander Alekhin
b74b05d1b3
Revert CV_TRY/CV_CATCH macros
...
This reverts commit 7349b8f5ce
(partially).
2018-11-08 19:56:52 +03:00
Alexander Alekhin
bb7cfcbcdb
Merge pull request #12064 from seiko2plus:coreUnvintrinArithm2
2018-11-08 14:02:40 +00:00
Sayed Adel
93ffebc273
core: reimplement SIMD arithmetic, logic and comparison operations into wide universal intrinsics
...
- initialize arithmetic dispatcher
- add new universal intrinsic v_absdiffs
- add new universal intrinsic v_pack_b
- add accumulate version of universal intrinsic v_round
- fix sse/avx2:uint8 multiplication overflow
- reimplement arithmetic, logic and comparison operations into wide universal intrinsics
with full support for all types
- reimplement IPP arithmetic, logic and comparison operations in a sperate file arithm_ipp.hpp
- avoid scalar multiplication if scaling factor eq 1 and use integer multiplication
- move C arithmetic operations to precomp.hpp and delete [arithm_simd|arithm_core].hpp
- add compatibility with new opencv4 divide policy
2018-10-30 12:48:31 +02:00
root
1196eb33fc
remove non-ideal pairs when using crosscheck in batchdistance
2018-10-26 20:25:24 +00:00
Alexander Alekhin
3e0c72ea84
core(ipp): disable SSE4.2 meanStdDev() optimization for CV_32F
2018-10-26 15:57:26 +03:00
maver1
e397434cb6
Merge pull request #12877 from maver1:3.4
...
* Updated ICV packages and IPP integration
* core(test): minMaxIdx IPP regression test
* core(ipp): workaround minMaxIdx problem
* core(ipp): workaround meanStdDev() CV_32FC3 buffer overrun
* Returned semicolon after CV_INSTRUMENT_REGION_IPP()
2018-10-24 15:02:53 +03:00
Michał Janiszewski
c8e6ce304f
Catch exceptions by const-reference
...
Exceptions caught by value incur needless cost in C++, most of them can
be caught by const-reference, especially as nearly none are actually
used. This could allow compiler generate a slightly more efficient code.
2018-10-16 22:43:54 +02:00
Alexander Alekhin
f185640eda
Merge pull request #12799 from alalek:update_build_js
...
* js: update build script
- support emscipten 1.38.12 (wasm is ON by default)
- verbose build messages
* js: use builtin Math functions
* js: disable tracing code completelly
2018-10-15 17:35:21 +03:00
Alexander Alekhin
72eccb7694
Merge pull request #12825 from alalek:issue_8413_3.4
2018-10-15 14:23:21 +00:00
Vitaly Tuzov
43d9256096
Replaced core module calls to universal intrinsics with wide universal intrinsics
2018-10-15 11:46:45 +03:00
Alexander Alekhin
c813ad5533
core(ocl): replace ambiguous 'depth' to 'DEPTH_dst'
...
- always pass DEPTH_dst value to core/arithm kernel
2018-10-14 02:18:04 +00:00
Alexander Alekhin
d2a66d3c99
Merge pull request #12815 from alalek:issue_12812
2018-10-13 13:12:55 +00:00
Alexander Alekhin
0f41daeba5
Merge pull request #12641 from dkurt:dnn_samples_args_autofill
2018-10-13 12:28:08 +00:00
Alexander Alekhin
8c4f886f5f
core: re-throw allocation exception if there is no fallback
2018-10-12 19:10:06 +00:00
Alexander Alekhin
91c6d36ff8
Merge pull request #12796 from alalek:fix_openmp_performance
2018-10-11 19:29:04 +00:00
Alexander Alekhin
4e62900009
core: call omp_set_dynamic() for better CPU usage
...
Similar to 'OMP_DYNAMIC=TRUE'.
2018-10-10 19:57:43 +00:00
Alexander Alekhin
11e2a216c5
ocl(win32): bypass deallocate() during process termination
2018-10-10 18:06:06 +00:00
Alexander Alekhin
70f2ee917e
cmake: add DllMain() into each OpenCV DLL
...
to detect process termination after ExitProcess() call
2018-10-10 11:00:59 +00:00
Alexander Alekhin
18bf91a08b
core: update allocSingleton implementation, valgrind suppression
2018-10-05 18:25:13 +03:00
Alexander Alekhin
83a0c12144
Merge pull request #12581 from terfendail:lapack_intr
2018-10-02 16:39:50 +00:00
Vitaly Tuzov
283348afc3
SSE2 code in invert() replaced with universal intrinsics
2018-10-02 12:47:07 +03:00
Alexander Alekhin
b41f38ecc7
Merge pull request #12669 from mshabunin:add-paths-config
2018-10-01 15:57:57 +00:00
Alexander Alekhin
94201b7cf9
ocl: OPENCV_OPENCL_BUILD_EXTRA_OPTIONS parameter
2018-10-01 17:56:17 +03:00
Maksim Shabunin
15632c6305
Added support for multi-path configuration parameter (env)
2018-10-01 17:50:47 +03:00
Rostislav Vasilikhin
be989b3b60
Merge pull request #12637 from savuor:fix/instr_ipp_ocl
...
Fixes for instrumentation of IPP and OCL (#12637 )
* fixed warning about re-declaring variable when both IPP and instrumentation are enabled
* fixed segfault when no funName provided
* compilation fixed when both OCL and instrumentation are enabled
2018-09-27 22:39:06 +03:00
Dmitry Kurtaev
24ab751547
Merge pull request #12565 from dkurt:dnn_non_intel_gpu
...
* Remove isIntel check from deep learning layers
* Remove fp16->fp32 fallbacks where it's not necessary
* Fix Kernel::run to prevent localsize > globalsize
2018-09-26 16:27:00 +03:00
Dmitry Kurtaev
ad5898224d
Add a file with preprocessing parameters for deep learning networks
2018-09-25 18:28:37 +03:00
Hamdi Sahloul
47bb09bc4b
OpenGL: avoid losing precision in double-to-float conversion
2018-09-24 19:06:48 +09:00
Hamdi Sahloul
ecc9bd0925
Support GpuMat in copyTo() functions
2018-09-17 23:43:14 +09:00
Alexander Alekhin
cecb878915
Merge pull request #12527 from terfendail:mean_wintr
2018-09-14 16:34:30 +00:00
Vitaly Tuzov
95502242c9
meanStdDev() implementation updated to use wide universal intrinsics
2018-09-14 17:52:08 +03:00
Hamdi Sahloul
5d54def264
Add semicolons after CV_INSTRUMENT
macros
2018-09-14 06:45:31 +09:00
Takuho NAKANO
451340fd3d
Merge pull request #12523 from takotakot:12455_rotatedrect_constructor
...
* Fix perpendicular decision of RotatedRect::RotatedRect
Error estimation is based on #12455 .
* Fix abs to std::fabs and atan to std::atan
2018-09-13 22:26:05 +03:00
Alexander Alekhin
87b1100027
Merge pull request #12410 from terfendail:sum_wintr
2018-09-13 15:49:56 +03:00
Hamdi Sahloul
03b3be0f51
MSVC: Slience external/meaningless warnings
2018-09-12 20:02:13 +09:00
Vitaly Tuzov
0a5bd0ac8b
sum() implementation updated to use wide universal intrinsics
2018-09-10 20:16:38 +03:00
Vitaly Tuzov
2f929376ec
Fixed meanStdDev() implementation for the case input matrix has more than 4 channels
2018-09-10 20:05:45 +03:00
Alexander Alekhin
95dd4b3f27
bindings: add debug helpers for args conversions
2018-09-08 12:23:08 +00:00
cyy
286c2c236b
Merge pull request #12458 from DEEPIR:3.4
...
* may be an typo fix
* remove identical branch,may be paste error
* add parentheses around macro parameter
* simplify if condition
* check malloc fail
* change the condition of branch removed by commit 3041502861
2018-09-07 18:43:47 +03:00
Hamdi Sahloul
a39e0daacf
Utilize CV_UNUSED macro
2018-09-07 20:33:52 +09:00
cyy
8b48c2a10c
Merge pull request #12443 from DEEPIR:master
...
* simplify condition
* dims must > 0 or latter sz[dims-1] will underflow
2018-09-06 23:09:39 +03:00
Alexander Alekhin
f1f15841d7
Merge pull request #11630 from alalek:c_api_eliminate_constructors
2018-09-06 20:07:16 +00:00
Vadim Pisarevsky
80b62a41c6
Merge pull request #12411 from vpisarev:wide_convert
...
* rewrote Mat::convertTo() and convertScaleAbs() to wide universal intrinsics; added always-available and SIMD-optimized FP16<=>FP32 conversion
* fixed compile warnings
* fix some more compile errors
* slightly relaxed accuracy threshold for int->float conversion (since we now do it using single-precision arithmetics, not double-precision)
* fixed compile errors on iOS, Android and in the baseline C++ version (intrin_cpp.hpp)
* trying to fix ARM-neon builds
* trying to fix ARM-neon builds
* trying to fix ARM-neon builds
* trying to fix ARM-neon builds
2018-09-06 19:36:59 +03:00
Alexander Alekhin
8a3c394d6a
don't use constructors for C API structures
2018-09-06 14:34:16 +03:00
Alexander Alekhin
ad146e5a6b
core: remove constructors from C API structures
...
POD structures can't have constructors.
2018-09-06 14:34:09 +03:00
woody.chow
07c6b0a8df
Replace naive normL2Sqr with hal::normL2Sqr_ for better performance
2018-09-06 17:43:07 +09:00
Zhenqing Hu
d621a99e73
To fix bug #12389 , which is a validation for the potential null pointer.
...
Signed-off-by: Zhenqing Hu <huzq85@gmail.com>
2018-09-05 19:14:34 +03:00
Alexander Alekhin
acce95f446
backport fixes for static analyzer warnings
...
Commits:
- 09837928d9
- 10fb88d027
Excluded changes with std::atomic (C++98 requirement)
2018-09-04 16:49:42 +03:00
Alexander Alekhin
4f9db2c03a
Merge pull request #12377 from alalek:inputarray_raw_wrapper_customtype
2018-09-04 11:29:23 +00:00
Alexander Alekhin
a0f86479e0
core: wrap custom types via _RawArray (raw() call)
...
- support passing of `std::vector<KeyPoint>` via InputArray
2018-09-03 18:41:48 +00:00
LaurentBerger
d71812425a
Solves issue 12392
2018-09-03 19:30:49 +02:00
Alexander Alekhin
70a27c7dd6
core: add solveLP type checks for output
...
to forbid Mat1f
Checks are not reliable: empty uninitialized `cv::Mat` has `CV_8UC1` type
2018-09-01 14:51:44 +00:00
Vitaly Tuzov
c894fc5bae
countNonZero function reworked to use wide universal intrinsics instead of SSE2 intrinsics
2018-08-30 20:43:39 +03:00
Alexander Alekhin
4e0d2a3e6c
Merge pull request #12193 from alalek:fix_vaapi_sample
2018-08-27 20:56:20 +00:00
Alexander Alekhin
5ac9a2a7d0
Merge pull request #12219 from alalek:fix_assert_messages
2018-08-21 12:46:35 +00:00
Rostislav Vasilikhin
378cf2ab63
fixed filename slash processing
2018-08-20 18:02:49 +03:00
Alexander Alekhin
d2e08a524e
core: repair CV_Assert() messages
...
Multi-argument CV_Assert() is accessible via CV_Assert_N() (with malformed messages).
2018-08-15 17:43:10 +03:00
Alexander Alekhin
4910f16f16
core(libva): support YV12 too
...
Added to CPU path only.
OpenCL code path still expects NV12 only (according to Intel OpenCL extension)
2018-08-10 16:02:05 +03:00
Alexander Alekhin
5b3ac112fe
core: move const tables outside of dispatched code
...
To avoid duplicates in binaries
2018-08-08 17:54:54 +03:00
Alexander Alekhin
7453a6938a
core(test): extra tests/fixes for merge/split ( #12171 )
...
* core(test): merge hang test
* core(merge/split): fix intrin optimization
2018-08-07 18:11:05 +03:00
Maksim Shabunin
f0f652f9e9
Merge pull request #12161 from alalek:cache_dump_neighbor_directories_for_cleanup
2018-08-07 12:09:39 +00:00
Alexander Alekhin
9eaa583bfb
core: dump neighbour cache directories (from old OpenCV versions)
...
- use '3.4.x' cache name for current maintenance series (there are no serious changes between releases)
- message is shown only once during creation of new cache directory
- use OPENCV_CACHE_SHOW_CLEANUP_MESSAGE=0 to hide this warning
2018-08-06 19:05:35 +03:00
Vadim Pisarevsky
23022f3ffb
Merge pull request #12121 from maver1:amatyuko/sse2_convert_with_saturation_fix
2018-08-06 14:26:37 +00:00
Alexander Alekhin
3082ea82f9
core(merge): fix SIMD loop head processing
2018-08-02 17:23:19 +03:00
amatyuko
3ea2586a5a
Fix for SSE2 intrinsics problem in the part of saturation arithmetic processing during 32s->16u packed conversion -
...
for some big negative values less than -INT_MAX+32767 the sign of the numbers is lost due to overflow that leads to
incorrect saturation to MAX value, instead of zero.
The issue is not reproduced with CV_ENABLED_INTRINSICS=OFF
2018-08-01 16:04:08 +03:00
luz.paz
2003eb1b9b
Misc. typos
...
Found via `codespell -q 3 -I ../opencv-whitelist.txt --skip="./3rdparty"`
2018-07-31 18:44:23 +03:00
Alexander Alekhin
7e71b1079e
Merge pull request #12103 from alalek:ocl_fix_crash
2018-07-31 14:55:43 +00:00
Alexander Alekhin
28d0e97c09
Merge pull request #12004 from mshabunin:more-asserts
2018-07-30 16:42:09 +00:00
Alexander Alekhin
5bde800ee3
Merge pull request #12088 from alalek:ocl_callback_catch_exceptions
2018-07-30 16:33:41 +00:00
Alexander Alekhin
e90e398e7a
core(ocl): do not split refcount operations / compare
...
- check result from CV_XADD() directly
- decrease urefcount after unmap() call only
2018-07-30 15:42:09 +03:00
miaow1988
2988260107
Fixed the int size overflow bug of cv::Mat.push_back().
...
Changed the type of variable *r* from int to size_t.
This change makes sure that a valid result of std::max(r + delta,
(r*3+1)/2) can be passed into the reserve function.
2018-07-30 18:36:19 +08:00
Alexander Alekhin
89528d7c3a
core(ocl): don't expose exceptions from OpenCL callback
...
to avoid silent crashes of OpenCL worker threads.
2018-07-28 10:29:26 +00:00
Maksim Shabunin
1165fdd0f5
Added more strict checks for empty inputs to compare, meanStdDev and RNG::fill
2018-07-26 18:06:38 +03:00
Maksim Shabunin
597db69151
ts: test case list is printed after cmd line parsing, refactored
2018-07-26 16:43:43 +03:00
Vadim Pisarevsky
43820d89b4
further improvements in split & merge; started using non-temporary store instructions ( #12063 )
...
* 1. changed static const __m128/256 to const __m128/256 to avoid wierd instructions and calls inserted by compiler.
2. added universal intrinsics that wrap MOVNTPS and other such (non-temporary or "no cache" store) instructions. v_store_interleave() and v_store() got respective flags/overloaded variants
3. rewrote split & merge to use the "no cache" store instructions. It resulted in dramatic performance improvement when processing big arrays
* hopefully, fixed some test failures where 4-channel v_store_interleave() is used
* added missing implementation of the new universal intrinsics (v_store_aligned_nocache() etc.)
* fixed silly typo in the new intrinsics in intrin_vsx.hpp
* still trying to fix VSX compiler errors
* still trying to fix VSX compiler errors
* still trying to fix VSX compiler errors
* still trying to fix VSX compiler errors
2018-07-26 12:04:28 +03:00
Alexander Alekhin
5336b9ad19
Merge pull request #12048 from mshabunin:fix-static-2
2018-07-24 19:45:27 +00:00
Vadim Pisarevsky
9c7040802c
converted split() & merge() to wide univ intrinsics ( #12044 )
...
* fixed/updated v_load_deinterleave and v_store_interleave intrinsics; modified split() and merge() functions to use those intrinsics
* fixed a few compile errors and bug in v_load_deinterleave(ptr, v_uint32x4& a, v_uint32x4& b)
* fixed few more compile errors
2018-07-24 17:27:56 +03:00
Maksim Shabunin
cbb1e867e5
More issues found by static analysis
2018-07-24 16:04:42 +03:00
Maksim Shabunin
e0603bb45f
Fixed several issues found by static analysis tools
2018-07-23 17:22:47 +03:00
Alexander Alekhin
f3ee07ca11
Merge pull request #11986 from alalek:build_eliminate_gcc8_warnings
2018-07-17 15:41:36 +00:00