opencv

mirror of https://github.com/opencv/opencv.git synced 2024-12-14 08:59:11 +08:00

Author	SHA1	Message	Date
Paul E. Murphy	b465c82696	core: workaround old gcc vec_mul{e,o} (Issue #15506 ) ISA 2.07 (aka POWER8) effectively extended the expanding multiply operation to word types. The altivec intrinsics prior to gcc 8 did not get the update. Workaround this deficiency similar to other fixes. This was exposed by commit `33fb253a66` which leverages the int -> dword expanding multiply. This fixes Issue #15506	2019-09-12 09:54:02 -05:00
Alexander Alekhin	0a13633411	Merge pull request #15444 from alalek:ocl_fix_fft_kernel	2019-09-04 16:25:34 +00:00
Alexander Alekhin	8bd2720c28	core(ocl): fix fft kernel compilation - error: variables in the local address space can only be declared in the outermost scope of a kernel function	2019-09-03 15:46:53 +03:00
Alexander Alekhin	7e46766c8d	Merge pull request #15437 from devnexen:fbsd_opencl_build_fix	2019-09-03 12:21:02 +00:00
Alexander Alekhin	9ef5373776	Merge pull request #15435 from alalek:update_version_3.4.8-pre	2019-09-03 12:04:23 +00:00
Alexander Alekhin	abd7d63b74	Merge pull request #15424 from mshabunin:add-cmake-docs	2019-09-03 10:50:45 +00:00
David Carlier	6769ee3748	OpenCL: FreeBSD build fix	2019-09-02 18:30:53 +01:00
Alexander Alekhin	0fda243a05	pre: OpenCV 3.4.8 (version++)	2019-09-02 14:20:49 +03:00
Alexander Alekhin	048ddbf9ee	Merge pull request #15339 from pmur:dotprod-32s-vsx	2019-08-31 11:16:04 +00:00
Alexander Alekhin	2a6527e751	Merge pull request #15402 from ChipKerchner:normUnroll	2019-08-31 11:10:05 +00:00
Maksim Shabunin	f3aab47f94	Assorted documentation fixes * removed private flann documentation * common tutorial images moved to doc/images * grouping issues	2019-08-31 01:50:11 +03:00
Alexander Alekhin	f224d740a3	Merge pull request #15414 from kuzi117:instr	2019-08-30 12:03:19 +00:00
Braedy Kuzma	9bf8b496d6	Use commonly supported instruction mnemonic.	2019-08-29 10:00:40 -06:00
Braedy Kuzma	d4120dd2fe	Disambiguate vecpopcnt for (u)dword2.	2019-08-29 09:54:56 -06:00
Vitaly Tuzov	d134ec54c5	Extend tests for v_check_any and v_check_all intrinsics	2019-08-28 14:53:31 +03:00
Alexander Alekhin	ca7640e10f	Merge pull request #15401 from ChipKerchner:vectorReduceInt8Bug	2019-08-27 19:59:39 +00:00
ChipKerchner	288e6f9c07	Improve vectorization in the 'norm' functions	2019-08-27 12:15:19 -05:00
ChipKerchner	70b883cfeb	Fix macro bug with v_reduce_min and v_reduce_max for chars in VSX	2019-08-27 11:38:53 -05:00
Vitaly Tuzov	1b40528e1a	Fix for AVX2 implementation of v_check_any(), v_check_all() intrinsics	2019-08-27 14:31:23 +03:00
Alexander Alekhin	d7409604b5	core: handle empty Mat in Mat_ assignment operators	2019-08-23 16:54:24 +03:00
Alexander Alekhin	56e832ee43	Merge pull request #15372 from alalek:core_stat_fix_intrin	2019-08-22 20:52:54 +00:00
Alexander Alekhin	8a0b93bc4d	core: update fastmath.hpp	2019-08-22 16:43:07 +03:00
Alexander Alekhin	8b1fe8f6e0	core: fix stat SIMD code	2019-08-22 16:37:26 +03:00
Zyrin	869ea22f34	Use std::move in Mat_<T> move constructors	2019-08-21 11:12:00 +02:00
Zyrin	8ef8088686	Fix stack overflow on gcc with c++17 (#15343 )	2019-08-21 10:57:03 +02:00
Paul E. Murphy	33fb253a66	core: vectorize dotProd_32s Use 4x FMA chains to sum on SIMD 128 FP64 targets. On x86 this showed about 1.4x improvement. For PPC, do a full multiply (32x32->64b), convert to DP then accumulate. This may be slightly less precise for some inputs. But is 1.5x faster than the above which is about 1.5x than the FMA above for ~2.5x speedup.	2019-08-20 15:28:36 -05:00
luz.paz	fcc7d8dd4e	Fix modules/ typos Found using `codespell -q 3 -S ./3rdparty -L activ,amin,ang,atleast,childs,dof,endwhile,halfs,hist,iff,nd,od,uint` backporting of commit: `ec43292e1e`	2019-08-16 17:34:29 +03:00
Alexander Alekhin	13ecd5bb25	Merge pull request #15122 from pmur:fast-math-improvements	2019-08-14 19:28:05 +00:00
Alexander Alekhin	a703b9ed84	Merge pull request #15101 from alalek:cmake_initialization	2019-08-14 19:17:07 +00:00
Alexander Alekhin	32772a5436	3.4: backported changes from 'master' branch	2019-08-14 16:36:08 +03:00
Alexander Alekhin	7c96857c02	Merge pull request #15292 from alalek:build_warnings_xcode_10_3	2019-08-13 14:18:48 +00:00
Alexander Alekhin	15b8a8d935	build: eliminate warnings with Xcode 10.3	2019-08-13 15:06:13 +03:00
Hugo Lindström	935067ee05	Merge pull request #15265 from hugolm84:wince-armv7-supports-neon * WINCE 8.0 requires ARMv7 Thumb2 and thus have NEON instructions * Only add NEON if on _ARM_	2019-08-09 18:01:37 +03:00
Alexander Alekhin	5ef548a985	cmake: update initialization	2019-08-08 15:23:16 +03:00
Paul E. Murphy	f38a61c66d	fast_math: implement optimized PPC routines Implement cvRound using inline asm. No compiler support exists today to properly optimize this. This results in about a 4x speedup over the default rounding. Likewise, simplify the growing number of rounding function overloads. For P9 enabled targets, utilize the classification testing instruction to test for Inf/Nan values. Operation speedup is about 1.2x for FP32, and 1.5x for FP64 operands. For P8 targets, fallback to the GCC nan inline. It provides a 1.1/1.4x improvement for FP32/FP64 arguments.	2019-08-07 15:01:18 -05:00
Paul E. Murphy	3f92bcc11a	fast_math: selectively use GCC rounding builtins when available Add a new macro definition OPENCV_USE_FASTMATH_GCC_BUILTINS to enable usage of GCC inline math functions, if available and requested by the user. Likewise, enable it for POWER. This is nearly always a substantial improvement over using integer manipulation as most operations can be done in several instructions with no branching. The result is a 1.5-1.8x speedup in the ceil/floor operations. 1. As tested with AT 12.0-1 (GCC 8.3.1) compiler on P9 LE.	2019-08-07 15:01:18 -05:00
Paul E. Murphy	b2135be594	fast_math: add extra perf/unit tests Add a basic sanity test to verify the rounding functions work as expected. Likewise, extend the rounding performance test to cover the additional float -> int fast math functions.	2019-08-07 14:59:46 -05:00
Alexander Alekhin	821f17d666	Merge pull request #15235 from pmur:vsx-v_signmask-vbpermq	2019-08-06 20:09:22 +00:00
Victor Romero	987bb2ca61	Fix build for UWP backport of commit: `f18cbd036a`	2019-08-05 17:19:36 +03:00
Paul E. Murphy	1031b7f4bc	hal: vsx: further optimize v_signmask Use the quadword bit permutation instruction to creatively move the sign bits to create the mask. Note that values above 127 will result in 0.	2019-08-05 09:00:22 -05:00
Alexander Alekhin	ba934ff1ce	Merge pull request #15202 from hugolm84:support_build_shared_for_wince	2019-08-02 15:34:02 +00:00
Hugo Lindström	03fe1cb7fc	Support building shared libraries on WINCE.	2019-08-01 15:28:04 +02:00
Maksim Shabunin	6d5ac67681	Restored IPP call reduction	2019-07-31 15:41:22 +03:00
Maksim Shabunin	eec9fa9d5e	Merge pull request #15181 from berak:java_print_blob	2019-07-30 14:13:02 +00:00
berak	4d3989817c	java: fix Mat.toString() for higher dimensions	2019-07-29 19:39:09 +02:00
Alexander Alekhin	2693ed9b22	Merge tag '3.4.7'	2019-07-25 19:19:49 +00:00
Alexander Alekhin	4a7ca5a291	OpenCV version++ (3.4.7) OpenCV 3.4.7	2019-07-25 19:01:19 +00:00
Chip Kerchner	0db4fb1835	Merge pull request #15136 from ChipKerchner:dotProd_unroll * Unroll multiply and add instructions in dotProd_32f - 35% faster. * Eliminate unnecessary v_reduce_sum instructions.	2019-07-25 21:21:32 +03:00
Alexander Alekhin	6158bd2afa	Merge pull request #15103 from alalek:simd_intrinsics_in_user_code	2019-07-25 11:36:36 +00:00
Hugo Lindström	2ee00e7f7d	Merge pull request #15059 from hugolm84:improved-support-for-wince * Improve support for Windows Embedded Compact * Remove redundant set(WINCE true) and format CMake	2019-07-24 23:12:09 +03:00

1 2 3 4 5 ...

4177 Commits