Tomoaki Teshima
0db4b7cebb
fix build error on CUDA + Aarch64
...
* __fp16 doesn't exist on nvcc, but it slips through ifdef guard
2016-09-16 08:06:14 +09:00
Alexander Alekhin
c8d77fd961
Merge pull request #7233 from tomoaki0705:featureUniversalIntrinsicFp16
2016-09-07 14:42:46 +00:00
Tomoaki Teshima
903789f7af
use universal intrinsic for FP16
...
* use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
* define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
* brush up implementation on old compiler (guard correctly)
* add test for v_load_f16 and round trip conversion of v_float16x4
* fix conversion error
2016-09-05 08:13:52 +09:00
Alexander Alekhin
d102ea96c0
Merge pull request #7210 from mshabunin:fix-warnings
2016-09-02 12:50:45 +00:00
Maksim Shabunin
28db4a2207
Merge pull request #7175 from tomoaki0705:featureIntrinsic64
2016-09-02 10:16:44 +00:00
Maksim Shabunin
dc704d77ac
Fixed several GCC 5.x warnings
2016-09-01 15:44:01 +03:00
Tomoaki Teshima
7fef96be1e
add 64F intrinsic in HAL NEON
...
* use universal intrinsic for accumulate series using float/double
* accumulate, accumulateSquare, accumulateProduct and accumulateWeighted
* add v_cvt_f64_high in both SSE/NEON
* add test for conversion v_cvt_f64_high in test_intrin.cpp
* improve some existing universal intrinsic by using new instructions in Aarch64
* add workaround for Android build in intrin_neon.hpp
2016-08-30 17:21:02 +09:00
Matthew Self
9678d48e1a
2-channel interleaved load/store for universal intrinsics (float only)
...
* Added 2-channel ops to match existing 3-channel and 4-channel ops
* v_load_deinterleave() and v_store_interleave()
* Implements float32x4 only on SSE (but all types on NEON and CPP)
* Includes tests
* Will be used to vectorize 2D functions, such as estimateAffine2D()
2016-08-26 18:17:08 -07:00
Vadim Pisarevsky
4f0f5a24ef
Merge pull request #7158 from sturkmen72:documentation_fix
2016-08-26 11:40:19 +00:00
Alexander Alekhin
da5ead2c23
Merge pull request #7166 from tomoaki0705:brushUpFp16
2016-08-25 11:49:23 +00:00
Tomoaki Teshima
c5d7791b67
brush up fp16 implementation
...
* DRY
* switch to Cv32suf and remove fp32Int32
* add Cv16suf
2016-08-25 05:31:25 +09:00
Vadim Pisarevsky
b03e3abd62
Merge pull request #7088 from wiryls:FileStorageJSON
2016-08-24 16:49:06 +00:00
Suleyman TURKMEN
fcc28fecb6
Make Up
2016-08-23 22:17:30 +03:00
Pavel Vlasov
30a6cee2fe
Instrumentation for OpenCV API regions and IPP functions;
2016-08-19 18:10:03 +03:00
Pavel Vlasov
680ca88ce0
Outdated ICV restrictions were removed;
2016-08-19 15:08:39 +03:00
Pavel Vlasov
a462d76a95
Implementations instrumentation framework for OpenCV performance system;
2016-08-17 16:26:03 +03:00
Pavel Vlasov
3860b8db02
IPP was enabled in mathfuncs_core;
...
Exp and Log IPP implementations are changed to hal interface;
2016-08-12 18:16:04 +03:00
Ilya Lavrenov
1b3f634843
fixed compilation warning
2016-08-12 14:23:15 +03:00
daB0bby
5cc1ef9022
fix code-style consistency
2016-08-11 10:15:48 +02:00
MYLS
8596e82d98
Add JSON support.
...
a JSON emitter, a parser, tests and some basic doc.
2016-08-11 00:53:15 +08:00
Vadim Pisarevsky
5eee757658
Merge pull request #6949 from wiryls:FileStorageBase64DocsTests
2016-08-10 09:26:13 +00:00
Vadim Pisarevsky
28d28a2607
Merge pull request #7024 from paroj:semanticidx
2016-08-08 15:19:36 +00:00
Vadim Pisarevsky
b316aa3710
Merge pull request #7043 from alalek:add_nonfree_flag
2016-08-08 15:18:49 +00:00
Tomoaki Teshima
2db2d137ce
follow the naming rule as other API
...
* stop using 'CUDA' prefix
* add explanation
2016-08-05 22:14:55 +09:00
Alexander Alekhin
18e7afdfd9
cmake: add OPENCV_ENABLE_NONFREE option and macro
2016-08-04 19:23:49 +03:00
Tomoaki Teshima
87ca607fd4
brush up convertFp16
...
* raise an error when wrong bit depth passed
* raise an build error when wrong depth is specified for cvtScaleHalf_
* remove unnecessary safe check in cvtScaleHalf_
* use intrinsic instead of direct pointer access
* update the explanation
2016-08-03 17:27:45 +09:00
Pavel Rojtberg
15dabc11b5
Mat: use row/ col instead of i0/ i1 for parameter names in 2D case
...
this improves documentation when using an IDE and reflects that Mats are
row major.
2016-08-02 13:57:51 +02:00
Tomoaki Teshima
2974b049e7
cudev: add feature to convert FP32(float) from/to FP16(half) on GPU
...
* add feature of Fp16 on GPU (cudev)
* add test
* leave template function as unimplemented to raise error
2016-08-01 00:55:16 +09:00
MYLS
3935855365
Merge remote-tracking branch 'refs/remotes/opencv/master' into FileStorageBase64DocsTests
...
# Conflicts:
# modules/core/test/test_io.cpp
2016-07-30 01:08:27 +08:00
MYLS
08911cbfae
change code
to snippet
and do some cleaning.
...
- use `@snippet` instead of `@code` in docs.
- remove some functions that were not used.
2016-07-30 00:35:41 +08:00
Tomoaki Teshima
c57f8780e9
show CPU feature correctly when FP16 is available
...
* make sure that CV_FP16 has the correct meaning
* check FP16 feature correctly
2016-07-29 14:10:33 +09:00
Alexander Alekhin
2ec63e4dd1
fix android pack build
2016-07-20 16:49:57 +03:00
MYLS
78ca5ddd45
solve errors and warnings
2016-07-19 19:56:57 +08:00
MYLS
617df09143
Modify Base64 functions and add test and documentation
...
Major changes:
- modify the Base64 functions to compatible with `cvWriteRawData` and so
on.
- add a Base64 flag for FileStorage and outputs raw data in Base64
automatically.
- complete all testing and documentation.
2016-07-19 15:54:38 +08:00
Vadim Pisarevsky
48b747903b
Merge pull request #6830 from tomoaki0705:featureSupportFp16
2016-07-18 15:56:00 +00:00
Vadim Pisarevsky
d62b0bd363
Merge pull request #6849 from alcinos:optflow_interface
2016-07-18 15:05:13 +00:00
Alexander Alekhin
55d0945149
Merge pull request #6932 from philippefoubert:bugfix_issue_6931
2016-07-18 13:23:56 +00:00
catree
7f64f31f66
Fix NAryMatIterator code example.
2016-07-17 23:21:15 +02:00
Philippe FOUBERT
d9a56f6590
Resolves issue #6931
2016-07-17 12:22:37 +02:00
Maksim Shabunin
b0fa477614
Merge pull request #6908 from ilya-lavrenov:reduce
2016-07-15 09:07:56 +00:00
Maksim Shabunin
154d61edf9
Merge pull request #6907 from ilya-lavrenov:typo
2016-07-15 09:07:19 +00:00
Maksim Shabunin
f695d4f04b
Merge pull request #6903 from jet47:cuda-stream-thread-safety-notes
2016-07-14 14:29:08 +00:00
Ilya Lavrenov
0af3947468
added note about cv::reduce output type for MIN/MAX
2016-07-14 17:22:08 +03:00
Ilya Lavrenov
c634e39299
removed extra semicolon
2016-07-14 15:50:35 +03:00
Maksim Shabunin
a00f0c44ae
Merge pull request #6821 from sturkmen72:TickMeter_class_addition
2016-07-14 11:40:46 +00:00
Vladislav Vinogradov
bfd6ae77f5
Add note that cv::cuda::Stream class is not thread safe
2016-07-14 14:20:37 +03:00
Vladislav Vinogradov
112903c2bd
increase minimal supported CUDA toolkit to 6.5
2016-07-13 13:02:13 +03:00
Alexander Alekhin
c8693f01a3
Merge pull request #6792 from paroj:jaccard_distance
2016-07-11 09:57:20 +00:00
Vladislav Sovrasov
a2ec23c112
Update cv::log documentation
2016-07-05 17:28:53 +03:00
Alexander Alekhin
7dc5332a92
Merge pull request #6745 from alalek:viz_samples
2016-07-01 15:27:01 +00:00