Alexander Alekhin
|
aa2777ed61
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2020-02-10 19:40:29 +03:00 |
|
Alexander Alekhin
|
9d0f16e8fe
|
Merge pull request #16505 from alalek:issue_16326
|
2020-02-10 15:51:49 +00:00 |
|
Ilya Lavrenov
|
41f444a230
|
CMAKE: more robust custom IE libraries support
backporting of commit 98d96f2cc8
|
2020-02-10 17:55:51 +03:00 |
|
Ilya Lavrenov
|
98d96f2cc8
|
CMAKE: more robust custom IE libraries support
|
2020-02-10 15:17:54 +03:00 |
|
Alexander Alekhin
|
f791cf825f
|
Merge pull request #16486 from alalek:cmake_use_target_compile_features
|
2020-02-08 20:01:12 +00:00 |
|
Alexander Alekhin
|
225566da7b
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2020-02-04 19:49:24 +03:00 |
|
Alexander Alekhin
|
85a59ef292
|
cmake: hide IPPICV symbols using --exclude-libs
|
2020-02-04 18:11:07 +03:00 |
|
Alexander Alekhin
|
01c1b341a8
|
cmake: enable C++11 implicitly
|
2020-02-03 17:45:37 +03:00 |
|
Rajkiran Natarajan
|
579dcf12bd
|
GCC 10 warnings not caught by CMake ignore regex
|
2020-02-03 06:39:14 +00:00 |
|
Alexander Alekhin
|
bf2f7b0f8b
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2020-02-01 17:26:00 +00:00 |
|
Sayed Adel
|
bd531bd828
|
core:vsx fix inline asm constraints
generalize constraints to 'wa' for VSX registers
|
2020-01-28 15:48:00 +02:00 |
|
Alexander Alekhin
|
f856c964e2
|
Merge pull request #16338 from alalek:abi_checker_exclude_modules
|
2020-01-28 12:58:45 +03:00 |
|
Alexander Alekhin
|
02f8a9470a
|
Merge pull request #16441 from alalek:abi_checker_exclude_modules_3.4
|
2020-01-27 13:26:23 +00:00 |
|
Alexander Alekhin
|
4af55e64e9
|
cmake: support OPENCV_ABI_SKIP_MODULES_LIST
backporting of commit f163778da7
|
2020-01-27 14:40:48 +03:00 |
|
Alexander Alekhin
|
3d14dd4e39
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2020-01-22 16:58:30 +03:00 |
|
Alexander Alekhin
|
8af87d5792
|
cmake: apply '-Wl,--as-needed' linker option
|
2020-01-21 19:09:38 +03:00 |
|
h6197627
|
fb3a334bbe
|
cmake: fix Linux system OpenEXR detection
|
2020-01-16 23:12:41 +02:00 |
|
Alexander Alekhin
|
f163778da7
|
cmake: support OPENCV_ABI_SKIP_MODULES_LIST
|
2020-01-13 17:17:47 +03:00 |
|
Alexander Alekhin
|
1996ae4a42
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-12-31 10:11:39 +00:00 |
|
Alexander Alekhin
|
d064079a4e
|
build: eliminate build warnings
|
2019-12-25 23:28:24 +00:00 |
|
Alexander Alekhin
|
c6c8783c60
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-12-16 21:30:30 +00:00 |
|
Tatsuro Shibamura
|
971ae00942
|
Merge pull request #16027 from shibayan:arm64-windows10
* Support ARM64 Windows 10 platform
* Fixed detection issue for ARM64 Windows 10
* Try enabling ARM NEON intrin
* build: disable NEON with MSVC compiler
* samples(directx): gdi32 dependency
|
2019-12-17 00:23:30 +03:00 |
|
Alexander Alekhin
|
ba7b0f4c54
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-12-15 11:23:46 +00:00 |
|
Alexander Alekhin
|
60ba6ef916
|
Merge pull request #16163 from alalek:fix_16122
|
2019-12-14 18:51:22 +00:00 |
|
Alexander Alekhin
|
424e1150ae
|
cmake: fix OpenCV_ARCH
|
2019-12-14 15:02:43 +00:00 |
|
Alexander Alekhin
|
a45928045a
|
Merge pull request #16150 from alalek:cmake_avoid_deprecated_link_private
* cmake: avoid deprecated LINK_PRIVATE/LINK_PUBLIC
see CMP0023 (CMake 2.8.12+)
* cmake: fix 3rdparty list
- don't include OpenCV modules
|
2019-12-13 17:52:40 +03:00 |
|
Alexander Alekhin
|
92b9888837
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-12-12 13:02:19 +03:00 |
|
Alexander Alekhin
|
6ea29a7696
|
cmake: prefer using CMAKE_SYSTEM_PROCESSOR / CMAKE_SIZEOF_VOID_P
Drop:
- discouraged CMAKE_CL_64
- MSVC64
- MINGW64
|
2019-12-11 00:21:10 +00:00 |
|
Pádraig Brady
|
7b298b0995
|
Fix pch generation when linker flags used with GCC
-c is required to avoid linking (and the associated missing "main" message)
when linker flags like "-Wl,-z,relro" are passed to GCC
|
2019-12-10 14:55:56 +00:00 |
|
Alexander Alekhin
|
1ddcfc5c68
|
gapi: update CMakeLists.txt, fix TBB dependency handling
|
2019-12-06 16:36:42 +03:00 |
|
Alexander Alekhin
|
4b0132ed7a
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-12-02 16:26:52 +03:00 |
|
Lubov Batanina
|
7523c777c5
|
Merge pull request #15537 from l-bat:ngraph
* Support nGraph
* Fix resize
|
2019-12-02 16:16:06 +03:00 |
|
Alexander Alekhin
|
01a28db949
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-11-29 18:57:33 +03:00 |
|
Alexander Alekhin
|
70146700aa
|
Merge pull request #15839 from alalek:core_simd_v_setall_template
|
2019-11-27 19:19:35 +00:00 |
|
Alexander Alekhin
|
fc41c18c6f
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-11-18 13:56:24 +03:00 |
|
Alexander Alekhin
|
6773b938b3
|
Merge pull request #15896 from alalek:build_gcc_9
|
2019-11-14 14:22:02 +00:00 |
|
Alexander Alekhin
|
7ecdcf6ca6
|
build: GCC9 compilation
|
2019-11-12 18:49:34 +03:00 |
|
Alexander Alekhin
|
b6a58818bb
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-11-11 20:25:42 +00:00 |
|
Alexander Alekhin
|
af92a517a7
|
cmake: set _WIN32_WINNT for Windows 7 API
|
2019-11-11 20:19:40 +00:00 |
|
Alexander Alekhin
|
d32d31577c
|
Merge pull request #15835 from alalek:cmake_cpu_optimizations_fix_15802
|
2019-11-10 09:22:45 +00:00 |
|
Alexander Smorkalov
|
377fcc062d
|
Merge pull request #15159 from SSE4:fix_find_openexr
|
2019-11-06 13:37:37 +00:00 |
|
SSE4
|
2e20f06f8e
|
- fix FindOpenEXR to respect OPENEXR_ROOT
Signed-off-by: SSE4 <tomskside@gmail.com>
|
2019-11-06 17:26:52 +07:00 |
|
Alexander Alekhin
|
a893969ec9
|
core(simd): v_setall template
|
2019-11-03 12:49:25 +00:00 |
|
Alexander Alekhin
|
21c38bbdaf
|
cmake(cpu optmizations): fix cleanup of OPENCV_DEPENDANT_TARGETS_* vars
|
2019-11-02 10:34:54 +00:00 |
|
Alexander Alekhin
|
055ffc0425
|
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
|
2019-10-24 18:21:19 +00:00 |
|
Yashas Samaga B L
|
613c12e590
|
Merge pull request #14827 from YashasSamaga:cuda4dnn-csl-low
CUDA backend for the DNN module
* stub cuda4dnn design
* minor fixes for tests and doxygen
* add csl public api directory to module headers
* add low-level CSL components
* add high-level CSL components
* integrate csl::Tensor into backbone code
* switch to CPU iff unsupported; otherwise, fail on error
* add fully connected layer
* add softmax layer
* add activation layers
* support arbitary rank TensorDescriptor
* pass input wrappers to `initCUDA()`
* add 1d/2d/3d-convolution
* add pooling layer
* reorganize and refactor code
* fixes for gcc, clang and doxygen; remove cxx14/17 code
* add blank_layer
* add LRN layer
* add rounding modes for pooling layer
* split tensor.hpp into tensor.hpp and tensor_ops.hpp
* add concat layer
* add scale layer
* add batch normalization layer
* split math.cu into activations.cu and math.hpp
* add eltwise layer
* add flatten layer
* add tensor transform api
* add asymmetric padding support for convolution layer
* add reshape layer
* fix rebase issues
* add permute layer
* add padding support for concat layer
* refactor and reorganize code
* add normalize layer
* optimize bias addition in scale layer
* add prior box layer
* fix and optimize normalize layer
* add asymmetric padding support for pooling layer
* add event API
* improve pooling performance for some padding scenarios
* avoid over-allocation of compute resources to kernels
* improve prior box performance
* enable layer fusion
* add const layer
* add resize layer
* add slice layer
* add padding layer
* add deconvolution layer
* fix channelwise ReLU initialization
* add vector traits
* add vectorized versions of relu, clipped_relu, power
* add vectorized concat kernels
* improve concat_with_offsets performance
* vectorize scale and bias kernels
* add support for multi-billion element tensors
* vectorize prior box kernels
* fix address alignment check
* improve bias addition performance of conv/deconv/fc layers
* restructure code for supporting multiple targets
* add DNN_TARGET_CUDA_FP64
* add DNN_TARGET_FP16
* improve vectorization
* add region layer
* improve tensor API, add dynamic ranks
1. use ManagedPtr instead of a Tensor in backend wrapper
2. add new methods to tensor classes
- size_range: computes the combined size of for a given axis range
- tensor span/view can be constructed from a raw pointer and shape
3. the tensor classes can change their rank at runtime (previously rank was fixed at compile-time)
4. remove device code from tensor classes (as they are unused)
5. enforce strict conditions on tensor class APIs to improve debugging ability
* fix parametric relu activation
* add squeeze/unsqueeze tensor API
* add reorg layer
* optimize permute and enable 2d permute
* enable 1d and 2d slice
* add split layer
* add shuffle channel layer
* allow tensors of different ranks in reshape primitive
* patch SliceOp to allow Crop Layer
* allow extra shape inputs in reshape layer
* use `std::move_backward` instead of `std::move` for insert in resizable_static_array
* improve workspace management
* add spatial LRN
* add nms (cpu) to region layer
* add max pooling with argmax ( and a fix to limits.hpp)
* add max unpooling layer
* rename DNN_TARGET_CUDA_FP32 to DNN_TARGET_CUDA
* update supportBackend to be more rigorous
* remove stray include from preventing non-cuda build
* include op_cuda.hpp outside condition #if
* refactoring, fixes and many optimizations
* drop DNN_TARGET_CUDA_FP64
* fix gcc errors
* increase max. tensor rank limit to six
* add Interp layer
* drop custom layers; use BackendNode
* vectorize activation kernels
* fixes for gcc
* remove wrong assertion
* fix broken assertion in unpooling primitive
* fix build errors in non-CUDA build
* completely remove workspace from public API
* fix permute layer
* enable accuracy and perf. tests for DNN_TARGET_CUDA
* add asynchronous forward
* vectorize eltwise ops
* vectorize fill kernel
* fixes for gcc
* remove CSL headers from public API
* remove csl header source group from cmake
* update min. cudnn version in cmake
* add numerically stable FP32 log1pexp
* refactor code
* add FP16 specialization to cudnn based tensor addition
* vectorize scale1 and bias1 + minor refactoring
* fix doxygen build
* fix invalid alignment assertion
* clear backend wrappers before allocateLayers
* ignore memory lock failures
* do not allocate internal blobs
* integrate NVTX
* add numerically stable half precision log1pexp
* fix indentation, following coding style, improve docs
* remove accidental modification of IE code
* Revert "add asynchronous forward"
This reverts commit 1154b9da9da07e9b52f8a81bdcea48cf31c56f70.
* [cmake] throw error for unsupported CC versions
* fix rebase issues
* add more docs, refactor code, fix bugs
* minor refactoring and fixes
* resolve warnings/errors from clang
* remove haveCUDA() checks from supportBackend()
* remove NVTX integration
* changes based on review comments
* avoid exception when no CUDA device is present
* add color code for CUDA in Net::dump
|
2019-10-21 14:28:00 +03:00 |
|
Alexander Alekhin
|
6e85e852d4
|
Merge pull request #15740 from xerus:fix_typo
|
2019-10-18 18:41:35 +00:00 |
|
Fei Wu
|
90af2835a2
|
Fix issue 15730.
|
2019-10-19 00:36:18 +08:00 |
|
Pavel Grunt
|
83e2e124a3
|
OpenCVFindMKL: Fix a typo
|
2019-10-18 18:33:06 +02:00 |
|
Alexander Alekhin
|
c4e2e17b0c
|
cmake: fix UWP scripts path
|
2019-10-16 21:42:31 +00:00 |
|