opencv

mirror of https://github.com/opencv/opencv.git synced 2025-01-11 23:18:11 +08:00

Author	SHA1	Message	Date
Rostislav Vasilikhin	12e2cc9502	Merge pull request #25491 from savuor:rv/hal_norm_hamming HAL for Hamming norm added #25491 fixes #25474 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2024-04-27 14:38:44 +03:00
Yuantao Feng	3afe8ddaf8	core: Rename `cv::float16_t` to `cv::hfloat` (#25217 ) * rename cv::float16_t to cv::fp16_t * add typedef fp16_t float16_t * remove zero(), bits() from fp16_t class * fp16_t -> hfloat * remove cv::float16_t::fromBits; add hfloatFromBits * undo changes in conv_winograd_f63.simd.hpp and conv_block.simd.hpp * undo some changes in dnn	2024-03-21 23:44:19 +03:00
Alexander Smorkalov	daa8f7dfc6	Partially back-port #25075 to 4.x	2024-03-05 12:15:39 +03:00
HAN Liutong	0dd7769bb1	Merge pull request #23980 from hanliutong:rewrite-core Rewrite Universal Intrinsic code by using new API: Core module. #23980 The goal of this PR is to match and modify all SIMD code blocks guarded by `CV_SIMD` macro in the `opencv/modules/core` folder and rewrite them by using the new Universal Intrinsic API. The patch is almost auto-generated by using the [rewriter](https://github.com/hanliutong/rewriter), related PR #23885. Most of the files have been rewritten, but I marked this PR as draft because, the `CV_SIMD` macro also exists in the following files, and the reasons why they are not rewrited are: 1. ~~code design for fixed-size SIMD (v_int16x8, v_float32x4, etc.), need to manually rewrite.~~ Rewrited - ./modules/core/src/stat.simd.hpp - ./modules/core/src/matrix_transform.cpp - ./modules/core/src/matmul.simd.hpp 2. Vector types are wrapped in other class/struct, that are not supported by the compiler in variable-length backends. Can not be rewrited directly. - ./modules/core/src/mathfuncs_core.simd.hpp ```cpp struct v_atan_f32 { explicit v_atan_f32(const float& scale) { ... } v_float32 compute(const v_float32& y, const v_float32& x) { ... } ... v_float32 val90; // sizeless type can not used in a class v_float32 val180; v_float32 val360; v_float32 s; }; ``` 3. The API interface does not support/does not match - ./modules/core/src/norm.cpp Use `v_popcount`, ~~waiting for #23966~~ Fixed - ./modules/core/src/has_non_zero.simd.hpp Use illegal Universal Intrinsic API: For float type, there is no logical operation `\|`. Further discussion needed ```cpp /** @brief Bitwise OR Only for integer types. / template<typename _Tp, int n> CV_INLINE v_reg<_Tp, n> operator\|(const v_reg<_Tp, n>& a, const v_reg<_Tp, n>& b); template<typename _Tp, int n> CV_INLINE v_reg<_Tp, n>& operator\|=(v_reg<_Tp, n>& a, const v_reg<_Tp, n>& b); ``` ```cpp #if CV_SIMD typedef v_float32 v_type; const v_type v_zero = vx_setzero_f32(); constexpr const int unrollCount = 8; int step = v_type::nlanes unrollCount; int len0 = len & -step; const float* srcSimdEnd = src+len0; int countSIMD = static_cast<int>((srcSimdEnd-src)/step); while(!res && countSIMD--) { v_type v0 = vx_load(src); src += v_type::nlanes; v_type v1 = vx_load(src); src += v_type::nlanes; .... src += v_type::nlanes; v0 \|= v1; //Illegal ? .... //res = v_check_any(((v0 \| v4) != v_zero));//beware : (NaN != 0) returns "false" since != is mapped to _CMP_NEQ_OQ and not _CMP_NEQ_UQ res = !v_check_all(((v0 \| v4) == v_zero)); } v_cleanup(); #endif ``` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [ ] I agree to contribute to the project under Apache 2 License. - [ ] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake	2023-08-11 08:33:33 +03:00
Dmitry Kurtaev	380caa1a87	Merge pull request #23691 from dkurt:pycv_float16_fixes Import and export np.float16 in Python #23691 ### Pull Request Readiness Checklist * Also, fixes `cv::norm` with `NORM_INF` and `CV_16F` resolves https://github.com/opencv/opencv/issues/23687 See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake	2023-05-26 18:56:21 +03:00
Sean McBride	58e4a880a2	Deprecated convertTypeStr and made new variant that also takes the buffer size This allows removing the unsafe sprintf.	2023-04-26 09:48:15 -04:00
Alexander Smorkalov	e4a29d93fe	Merge remote-tracking branch 'origin/3.4' into merge-3.4	2023-04-21 10:55:04 +03:00
Sean McBride	47bea69322	Merge pull request #23055 from seanm:sprintf2 * Replaced most remaining sprintf with snprintf * Deprecated encodeFormat and introduced new method that takes the buffer length * Also increased buffer size at call sites to be a little bigger, in case int is 64 bit	2023-04-18 09:22:59 +03:00
eplankin	fd8b346c3e	Merge pull request #23443 from eplankin:3.4 * Update IPPICV binaries (20230330) * Revert "core(IPP): disable some ippsMagnitude_32f calls" This reverts commit `8069a6b4f8`. * Reverted changes in norm() and count_non_zero()	2023-04-07 09:14:42 +00:00
Alexander Alekhin	735a79ae83	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-06-19 18:44:16 +00:00
Vincent Rabaud	c8268e65fd	Fix potential NaN in cv::norm. There can be an int overflow. cv::norm( InputArray _src, int normType, InputArray _mask ) is fine, not cv::norm( InputArray _src1, InputArray _src2, int normType, InputArray _mask ).	2021-06-15 14:58:11 +02:00
Alexander Alekhin	cbfd38bd41	core: rework code locality - to reduce binaries size of FFmpeg Windows wrapper - MinGW linker doesn't support -ffunction-sections (used for FFmpeg Windows wrapper) - move code to improve locality with its used dependencies - move UMat::dot() to matmul.dispatch.cpp (Mat::dot() is already there) - move UMat::inv() to lapack.cpp - move UMat::mul() to arithm.cpp - move UMat:eye() to matrix_operations.cpp (near setIdentity() implementation) - move normalize(): convert_scale.cpp => norm.cpp - move convertAndUnrollScalar(): arithm.cpp => copy.cpp - move scalarToRawData(): array.cpp => copy.cpp - move transpose(): matrix_operations.cpp => matrix_transform.cpp - move flip(), rotate(): copy.cpp => matrix_transform.cpp (rotate90 uses flip and transpose) - add 'OPENCV_CORE_EXCLUDE_C_API' CMake variable to exclude compilation of C-API functions from the core module - matrix_wrap.cpp: add compile-time checks for CUDA/OpenGL calls - the steps above allow to reduce FFmpeg wrapper size for ~1.5Mb (initial size of OpenCV part is about 3Mb) backport is done to improve merge experience (less conflicts) backport of commit: `65eb946756`	2021-03-02 23:24:28 +00:00
Alexander Alekhin	65eb946756	core: rework code locality - to reduce binaries size of FFmpeg Windows wrapper - MinGW linker doesn't support -ffunction-sections (used for FFmpeg Windows wrapper) - move code to improve locality with its used dependencies - move UMat::dot() to matmul.dispatch.cpp (Mat::dot() is already there) - move UMat::inv() to lapack.cpp - move UMat::mul() to arithm.cpp - move UMat:eye() to matrix_operations.cpp (near setIdentity() implementation) - move normalize(): convert_scale.cpp => norm.cpp - move convertAndUnrollScalar(): arithm.cpp => copy.cpp - move scalarToRawData(): array.cpp => copy.cpp - move transpose(): matrix_operations.cpp => matrix_transform.cpp - move flip(), rotate(): copy.cpp => matrix_transform.cpp (rotate90 uses flip and transpose) - add 'OPENCV_CORE_EXCLUDE_C_API' CMake variable to exclude compilation of C-API functions from the core module - matrix_wrap.cpp: add compile-time checks for CUDA/OpenGL calls - the steps above allow to reduce FFmpeg wrapper size for ~1.5Mb (initial size of OpenCV part is about 3Mb)	2021-03-02 11:27:58 +00:00
Alexander Alekhin	6fdb7aee84	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-12-04 18:26:58 +00:00
Jojo R	12b8d542b7	norm.cpp(normL2Sqr_): improve performance of pipeline The most of target machine use one type cpu unit resource to execute some one type of instruction, e.g. all vx_load API use load/store cpu unit, and v_muladd API use mul/mula cpu unit, we interleave vx_load and v_muladd to improve performance on most targets like RISCV or ARM.	2020-11-19 09:49:49 +08:00
Alexander Alekhin	198b5096aa	Merge pull request #16754 from alalek:issue_16752 * core(test): FP16 norm test * core: norm()-FP16 disable OpenCL * core(norm): fix 16f32f local buffer size	2020-03-07 19:06:47 +00:00
Alexander Alekhin	619180dffd	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2020-03-06 20:41:30 +00:00
Alexander Alekhin	34530da66e	core: fix coverity issues	2020-03-06 18:12:45 +00:00
Alexander Alekhin	8108fb0575	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2019-12-05 18:27:45 +03:00
Alexander Alekhin	72f35e0626	Merge pull request #16052 from alalek:issue_16040 * calib3d: use normalized input in solvePnPGeneric() * calib3d: java regression test for solvePnPGeneric * calib3d: python regression test for solvePnPGeneric	2019-12-05 15:36:39 +03:00
Alexander Alekhin	bea2c75452	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2019-09-05 14:29:22 +03:00
ChipKerchner	288e6f9c07	Improve vectorization in the 'norm' functions	2019-08-27 12:15:19 -05:00
Alexander Alekhin	2e0150e601	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2018-12-03 18:38:27 +03:00
Vitaly Tuzov	00c9ab8c23	Merge pull request #13317 from terfendail:norm_wintr * Added performance tests for hal::norm functions * Added sum of absolute differences intrinsic * norm implementation updated to use wide universal intrinsics * improve and fix v_reduce_sad on VSX	2018-11-29 19:34:14 +03:00
Alexander Alekhin	808ba552c5	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2018-09-14 23:44:35 +00:00
Hamdi Sahloul	5d54def264	Add semicolons after `CV_INSTRUMENT` macros	2018-09-14 06:45:31 +09:00
Vadim Pisarevsky	6d7f5871db	added basic support for CV_16F (the new datatype etc.) (#12463 ) * added basic support for CV_16F (the new datatype etc.). CV_USRTYPE1 is now equal to CV_16F, which may break some [rarely used] functionality. We'll see * fixed just introduced bug in norm; reverted errorneous changes in Torch importer (need to find a better solution) * addressed some issues found during the PR review * restored the patch to fix some perf test failures	2018-09-10 16:56:29 +03:00
Alexander Alekhin	d74b98c3d9	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2018-09-04 18:39:03 +00:00
Alexander Alekhin	acce95f446	backport fixes for static analyzer warnings Commits: - `09837928d9` - `10fb88d027` Excluded changes with std::atomic (C++98 requirement)	2018-09-04 16:49:42 +03:00
cyy	09837928d9	Merge pull request #12357 from DEEPIR:master * fix some static analyzer warnings * fix some static analyzer warnings * fix race condition of workthread control	2018-09-02 16:34:43 +03:00
Vadim Pisarevsky	051b40f956	a part of PR #11364 (extended findNonZero & PSNR) (#11837 ) * a part of https://github.com/opencv/opencv/pull/11364 by Tetragramm. Rewritten and extended findNonZero & PSNR to support more types, not just 8u. * fixed compile & doxygen warnings * fixed small bug in findNonZero test	2018-06-26 17:10:00 +03:00
Alexander Alekhin	856a07711b	core: disabled IPP AVX512 normL1(a, b, mask) for cv::Mat with type=16UC3 and width < 16	2018-04-27 12:57:53 +03:00
Alexander Alekhin	57dad685d1	core: disabled IPP AVX2 normL1(a, b, mask) for cv::Mat with width < 16	2018-04-26 13:35:25 +03:00
Maksim Shabunin	4437e0c3b9	Split stat.cpp into smaller pieces	2018-02-12 14:14:08 +03:00

34 Commits