opencv

mirror of https://github.com/opencv/opencv.git synced 2024-12-15 18:09:11 +08:00

Author	SHA1	Message	Date
Alexander Alekhin	762481411d	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-10-15 16:44:47 +00:00
zoom	d816442e4d	Make Unsqueeze layer support negative axes.	2022-10-14 18:00:19 +08:00
Zihao Mu	0fa43e3aac	Optimize the winograd futher more.	2022-10-14 10:15:45 +08:00
zoom	9119692bb8	let StridedSlice layer support const input	2022-10-12 11:50:44 +08:00
Alexander Smorkalov	ec26541771	Merge pull request #22577 from zihaomu:Disable_winograd_branch_in_tryquantize DNN: add enableWinograd API for Net	2022-10-11 09:44:00 +03:00
Zihao Mu	d9eff7daeb	parse quantized nodes does not rely on name.	2022-10-10 17:08:46 +08:00
Alexander Smorkalov	3419e64dcf	Merge pull request #22611 from zihaomu:greaterOrEqual DNN: support GreaterOrEqual and LessOrEqual op in ONNX	2022-10-10 11:43:44 +03:00
Zihao Mu	1e2ceca4df	add enableWinograd API for Net.	2022-10-09 09:33:07 +08:00
Alexander Alekhin	347246901e	Merge pull request #21745 from alalek:dnn_plugin_openvino	2022-10-08 22:32:25 +00:00
Zihao Mu	9821fae59d	add greater_or_equal and less_or_equal ONNX support	2022-10-08 15:51:40 +08:00
Alexander Alekhin	43b2bb2c25	dnn: plugin support for OpenVINO	2022-10-07 16:57:31 +00:00
Alexander Smorkalov	96844b0ca5	Merge pull request #22554 from WanliZhong:slice_axes_no_seq DNN: Let Slice layer support non-sequential and negative axes	2022-10-03 10:15:55 +03:00
zoom	4557971481	enhance slice layer refactor the code for parsing Slice layer add test for Slice layer let 'begin' and 'end' resize to dims add opset message comment	2022-10-01 17:12:07 +08:00
Zihao Mu	15cfafb360	DNN: Remove unused code in onnx_importer.cpp	2022-09-29 10:53:43 +08:00
Alexander Smorkalov	a6274647a4	Merge pull request #21738 from rogday:gather add Gather implementation	2022-09-19 16:21:14 +03:00
Egor Smirnov	65f71ce2eb	add Gather implementation	2022-09-19 15:06:44 +03:00
Alexander Smorkalov	6aefb8e86f	Merge pull request #22290 from fengyuentau:naive_yolov7 Support for YOLOv7 ONNX (not simplified)	2022-09-19 14:43:18 +03:00
fengyuentau	4aef9b1c93	dnn: support yolov7 (not simplified)	2022-09-19 18:38:03 +08:00
Alexander Smorkalov	e1e9261450	Merge pull request #22479 from scottchou007:master Fix issues in opencv_test_dnn from conv48 kernels without bias	2022-09-16 09:05:55 +03:00
scottchou007	a3cb2020bc	Fix issues in opencv_test_dnn from conv48 kernels using uninitialized tensors when there is no bias.	2022-09-15 13:41:27 -07:00
Alexander Alekhin	65bdb3a544	dnn: eliminate GCC12 warning in total() call	2022-09-14 11:37:00 +00:00
Alexander Smorkalov	c2c8da2517	Merge pull request #22448 from Ichini24:reshape-permutations-fix changed names of permutations if Reshpe is in NHWC	2022-09-13 09:24:56 +03:00
wxsheng	4154bd0667	Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX * Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX * Add resize.lasx.cpp for Loongson SIMD acceleration * Add imgwarp.lasx.cpp for Loongson SIMD acceleration * Add LASX acceleration support for dnn/conv * Add CV_PAUSE(v) for Loongarch * Set LASX by default on Loongarch64 * LoongArch: tune test threshold for Core/HAL.mat_decomp/15 Co-authored-by: shengwenxue <shengwenxue@loongson.cn>	2022-09-10 09:39:43 +03:00
Alexander Alekhin	ca7f964104	dnn: use inheritance for OpenVINO net impl	2022-09-06 18:05:00 +00:00
anton	337452b4c0	changed names of permutations if Reshpe is in NHWC	2022-09-03 19:02:41 +02:00
Zihao Mu	b69b1eae8f	fix bug 22450	2022-09-02 16:30:06 +08:00
Alexander Smorkalov	70fb1cd603	Merge pull request #22440 from zihaomu:fix_conv_bug	2022-08-30 07:01:05 +00:00
Alexander Smorkalov	d2c48b898c	Merge pull request #22306 from zihaomu:qgemm_and_squeeze_opset13_onnximporter	2022-08-30 06:33:57 +00:00
Zihao Mu	2d837efba7	add qgemm and squeeze op13 supported on ONNXImporter	2022-08-30 09:50:29 +08:00
Alexander Smorkalov	1fd45a1b85	Merge pull request #22362 from fengyuentau:conv_asym_pad_fuse Remove asymmetric padding in Conv layer since it is supported in CPU backend	2022-08-29 17:56:17 +03:00
Zihao Mu	2cd7e17b65	replace v_add with +	2022-08-29 17:15:35 +08:00
Alexander Smorkalov	2619099fe5	Merge pull request #22337 from zihaomu:load_ONNX_fp16_as_fp32 DNN: load fp16 ONNX model as fp32	2022-08-29 09:32:25 +03:00
fengyuentau	2959286eb5	tengine: supports conv with asymmetric padding	2022-08-29 02:51:26 +00:00
Zihao Mu	9638e34ab0	reuse WORDS_BIGENDIAN.	2022-08-27 07:42:38 +08:00
Zihao Mu	bb64db98d8	Further optimization of Conv2D, fused Conv_Add_Activation, bring latest code from ficus OpConv.fx. (#22401 )	2022-08-26 12:57:25 +03:00
Zihao Mu	7eaec9dd22	load fp16 as fp32 and align fp16 and double in onnx_graph_simplifie	2022-08-26 10:04:44 +08:00
Zihao Mu	5e92bf8e41	support silu activation in darknet	2022-08-22 10:51:29 +08:00
Alexander Alekhin	2ebdc04787	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-08-14 15:50:42 +00:00
fengyuentau	0cdff46725	tune for opencl	2022-08-14 17:47:48 +08:00
Alexander Smorkalov	bb71cb200e	Merge pull request #22199 from zihaomu:bug_fix_22195 DNN: Reduce Layer (add dynamic batch and ReduceSum support)	2022-08-11 12:59:51 +03:00
fengyuentau	e7e814fa8c	remove asymmetric padding checks	2022-08-10 19:52:44 +08:00
Zihao Mu	d4640f4647	support ReduceLayer without reshape layer.	2022-08-02 10:32:31 +08:00
Zihao Mu	57545653b1	replace new mish impl with softplus	2022-07-28 13:19:06 +08:00
Zihao Mu	3c5377ca1b	add another Mish graph simplifier.	2022-07-28 11:21:29 +08:00
HAN Liutong	e2bfe0ce76	Use "#if" instead of "#ifdef" for CV_SIMD128.	2022-07-21 03:23:57 +00:00
Zihao Mu	98c33c605d	batchsize dynamic is set to index 0.	2022-07-20 19:02:16 +08:00
rogday	ed69bcae2d	Merge pull request #21865 from rogday:nary_eltwise_layers Reimplementation of Element-wise layers with broadcasting support * init * semi-working initial version * add small_vector * wip * remove smallvec * add nary function * replace auto with Mat in lambda expr used in transform * uncomment asserts * autobuffer shape_buf & step_buf * fix a missing bracket * fixed a missing addLayer in parseElementWise * solve one-dimensional broadcast * remove pre_broadcast_transform for the case of two constants; fix missing constBlobsExtraInfo when addConstant is called * one autobuffer for step & shape * temporal fix for the missing original dimension information * fix parseUnsqueeze when it gets a 1d tensor constant * support sum/mean/min/max with only one input * reuse old code to handle cases of two non-constant inputs * add condition to handle div & mul of two non-constant inputs * use \|\| instead of or * remove trainling spaces * enlarge buf in binary_forward to contain other buffer * use autobuffer in nary_forward * generate data randomly and add more cases for perf * add op and, or & xor * update perf_dnn * remove some comments * remove legacy; add two ONNX conformance tests in filter * move from cpu_denylist to all_denylist * adjust parsing for inputs>=2 Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>	2022-07-19 06:14:05 +03:00
fengyuentau	1c7b71bf9e	define data_layout as unknown for pack	2022-07-14 19:27:20 +08:00
Zihao Mu	1b8fba8e26	support ReduceSum with two input and dynamic shape batch size in ReduceLayer.	2022-07-13 13:46:16 +08:00
Zihao Mu	45fbb67aba	fix scale layer can not handle 1x1 weight correctly.	2022-07-13 11:25:27 +08:00
Zihao Mu	139c443770	Merge pull request #22183 from zihaomu:fastConv_ARMv7_compatible DNN: ARMv7 compatible fastConv * support armv7 on fastConv * remove whitespace.	2022-07-07 13:23:08 +03:00
Zihao Mu	a80fcacd90	Merge pull request #21372 from zihaomu:dnn_quantize_per_tensor Add per_tensor_quantize to int8 quantize * add per_tensor_quantize to dnn int8 module. * change api flag from perTensor to perChannel, and recognize quantize type and onnx importer. * change the default to hpp	2022-07-05 19:14:42 +03:00
Zihao Mu	59b870a87a	Merge pull request #21910 from zihaomu:fast_conv_ARM DNN: Accelerating convolution * Fast Conv of ARM, X86 and universal intrinsics. * improve code style. * error fixed. * improve the License * optimize memory allocated and Adjust the threshold. * change FasterRCNN_vgg16 to 2GB memory.	2022-07-01 13:03:15 +03:00
Zihao Mu	ef94275eb6	bug fixed of GEMM node in ONNX_importer	2022-06-22 21:08:48 +08:00
Wanli	a6ca48a1c2	Merge pull request #22100 from WanliZhong:issue_22015 Fix issue 22015, let Clip layer support 1-3 inputs * Fix issue 22015. Let layer Clip support 1-3 inputs. * Resolve other problems caused by modifications * Update onnx_importer.cpp added extra checks to min/max handling in Clip * Add assertions to check the size of the input * Add test for clip with min and max initializers * Separate test for "clip_init_min_max". Change the check method for input_size to provide a clearer message in case of problem. * Add tests for clip with min or max initializers * Change the implementation of getting input Co-authored-by: Vadim Pisarevsky <vadim.pisarevsky@gmail.com>	2022-06-22 14:21:16 +03:00
Zihao Mu	2411b825b4	bug fixed of GEMM node in ONNX_importer	2022-06-22 15:00:17 +08:00
Alexander Alekhin	583bd1a6e2	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-06-04 19:10:35 +00:00
Namgoo Lee	24547f40ff	remove const from functions returning by value	2022-05-26 21:30:41 +09:00
Alexander Alekhin	978dc76653	Merge pull request #22006 from rogday:21947_fix	2022-05-24 19:26:02 +00:00
rogday	a2ad997e97	fix vector access in TF::sortByExecutionOrder	2022-05-24 00:05:13 +03:00
berak	50d7c61c01	Update darknet_importer.cpp make it more obvious, that this is a '404', not a 'parsing' problem	2022-05-23 19:18:31 +02:00
Alexander Alekhin	d9bf522b27	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-05-23 16:06:14 +00:00
rogday	93dc0679ec	Merge pull request #21818 from rogday:revert_renaming * add prefixes to layer names and layer output names * dnn: OPENCV_DNN_ONNX_USE_LEGACY_NAMES runtime parameter Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2022-05-23 14:50:42 +00:00
Alexander Alekhin	bb5462e327	Merge pull request #21991 from fengyuentau:qconv_asympad	2022-05-19 17:20:04 +00:00
fengyuentau	ff88132620	support asymmetric paddings for qconv	2022-05-16 19:01:37 +08:00
OpenCV Developers	d9a444ca1a	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-05-14 11:23:21 +00:00
Yulv-git	15ac54d5d6	Fix some typos in modules/.	2022-04-30 13:40:07 +08:00
Zihao Mu	64ded50bbf	parsing depth2space and space2depth of ONNX importer	2022-04-29 10:17:02 +08:00
rogday	9cd5a0a1e6	Merge pull request #21884 from rogday:cuda_cleanup Fix CUDA compilation issues and adjust thresholds. * Fix CUDA compilation issues and adjust thresholds. * add conformance tests to denylist	2022-04-19 16:40:25 +00:00
OpenCV Developers	2985739b8c	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-04-16 14:41:15 +00:00
rogday	a2b84e9897	add assert to tf graph simplifier to address security concerns	2022-04-13 22:50:27 +03:00
zihaomu	e36948cfbc	add ONNX OP sign, shrink and reciprocal	2022-04-07 15:32:12 +08:00
Alexander Alekhin	a233982931	Merge pull request #20938 from JulieBar:lstm_cuda2	2022-04-01 22:10:08 +00:00
Zihao Mu	7b582b71ba	Merge pull request #21036 from fengyuentau:timvx_backend_support dnn: TIM-VX NPU backend support * Add TimVX NPU backend for DNN module. * use official branch from tim-vx repo; fix detecting viv sdk Co-authored-by: fytao <yuantao.feng@outlook.com>	2022-03-31 21:42:11 +00:00
Smirnov Egor	abebbf04b1	Add CUDA support for LSTM. Co-authored-by: Julia Bareeva <jbareeva@gmail.com>	2022-03-31 16:38:22 +03:00
Alexander Alekhin	5e434073d4	Merge pull request #21796 from alalek:dnn_reduce_fixup_21601	2022-03-30 22:26:28 +00:00
Alexander Alekhin	6f5cf8c15f	dnn: fix ReduceLayer implementation, update OpenVINO tests	2022-03-30 20:03:41 +00:00
Alexander Alekhin	1339ebaa84	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-03-26 16:00:28 +00:00
Alexander Alekhin	c9b90884da	Merge pull request #21601 from zihaomu:add_reduceLayer	2022-03-26 10:20:10 +00:00
luz paz	8e8e4bbabc	dnn: fix various dnn related typos Fixes source comments and documentation related to dnn code.	2022-03-23 18:12:12 -04:00
Alexander Alekhin	4c79318694	dnn: fix index access	2022-03-19 06:54:07 +00:00
Zihao Mu	b6b5c27cec	Support for some reduce layers for onnx	2022-03-18 10:19:13 +08:00
Alexander Alekhin	685797f403	Merge pull request #21662 from alalek:dnn_split	2022-03-17 16:09:17 +00:00
rogday	93353aea70	Merge pull request #21522 from rogday:lstm Fix LSTM support in ONNX * fix LSTM and add peephole support * disable old tests * turn lambdas into functions * more hacks for c++98 * add assertions * slice fixes * backport of cuda-related fixes * address review comments	2022-03-15 09:14:05 +03:00
Alexander Alekhin	5bf3c1df24	Merge pull request #21715 from ilyachur:change_type_info_creation	2022-03-14 09:18:58 +00:00
Ilya Churaev	419918076e	Changed call of NodeTypeInfo constructor	2022-03-14 10:55:33 +03:00
Alexander Alekhin	a120adde63	dnn: add dnn.cpp file with information about git commits history	2022-03-08 19:22:47 +00:00
Alexander Alekhin	a80af177b6	dnn: split dnn.cpp code base commit: `19926e2979` original dnn.cpp content: `19926e2979/modules/dnn/src/dnn.cpp`	2022-03-08 19:22:46 +00:00
Tsukasa Sugiura	8db7d435b9	Merge pull request #21692 from UnaNancyOwen:add_softmax * add apply softmax option to ClassificationModel * remove default arguments of ClassificationModel::setSoftMax() * fix build for python * fix docs warning for setSoftMax() * add impl for ClassficationModel() * fix failed build for docs by trailing whitespace * move to implement classify() to ClassificationModel_Impl * move to implement softmax() to ClassificationModel_Impl * remove softmax from public method in ClassificationModel	2022-03-07 20:26:15 +00:00
Alexander Alekhin	901e0ddfe4	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-03-05 19:46:28 +00:00
Alexander Alekhin	5cc27fd3b5	Merge pull request #21542 from rogday:split_expand	2022-02-28 22:38:24 +00:00
Egor Smirnov	375fe81311	fix slice and expand	2022-02-28 17:18:07 +03:00
Yuantao Feng	f77c3574af	Merge pull request #21607 from fengyuentau:fix_FaceDetectorYN_dynamic_shape Use YuNet of fixed input shape to fix not-supported-dynamic-zero-shape for FaceDetectorYN * use yunet with input of fixed shape * update yunet used in face recognition regression	2022-02-21 13:49:07 +00:00
Alexander Alekhin	19926e2979	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-02-11 17:32:37 +00:00
Alexander Alekhin	effce0573b	dnn: drop legacy Inference Engine NN builder API	2022-02-10 11:55:24 +00:00
Alexander Alekhin	57d3002ee1	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-02-06 16:10:43 +00:00
Alexander Alekhin	1da48beeec	dnn(ngraph): fix output names	2022-02-06 13:08:53 +00:00
Alexander Alekhin	b57ff73086	dnn(ngraph): fix outputs handling, drop 'unconnected' logic	2022-02-06 13:08:53 +00:00
Alexander Alekhin	67978b5746	dnn(ngraph): add debuging messages	2022-02-06 13:08:53 +00:00
Alexander Alekhin	062f305d1a	dnn: don't fuse 'outputs' with OpenVINO backend	2022-02-06 13:08:53 +00:00
Alexander Alekhin	aa5bc20c83	dnn(ngraph): fixup get_output_as_single_output_node() replacement patch	2022-02-06 10:35:59 +00:00
Alexander Alekhin	d573472a86	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-01-31 12:53:45 +00:00
Alexander Alekhin	85719a0a5d	dnn: support outputs registration under new names - fixed ONNX importer	2022-01-29 23:29:51 +00:00
Alexander Alekhin	dc35633aa4	Merge pull request #21521 from alalek:dnn_ignore_denormals	2022-01-28 15:31:44 +00:00
Zihao Mu	9e3ba487fa	Merge pull request #21518 from zihaomu:resize_onnx_opset13 Add resize layer compatible with ONNX opset13 version	2022-01-28 17:55:01 +03:00
Alexander Alekhin	9188ce68aa	Merge pull request #21490 from rogday:optional_outputs	2022-01-26 15:18:07 +00:00
Alexander Alekhin	70b0274c8e	dnn: apply hint to ignore denormals processing	2022-01-26 11:28:35 +00:00
Alexander Alekhin	b796ededae	Merge pull request #21437 from alalek:dnn_api_explicit_const_4.x	2022-01-21 20:19:50 +00:00
Alexander Alekhin	eb7b45d26b	dnn: fix API - explicit ctors, const methods	2022-01-21 12:38:51 +00:00
Smirnov Egor	17b2d92a3d	add optional outputs support and fix graph links	2022-01-21 12:31:46 +03:00
Alexander Alekhin	6ffa2b01e1	Merge pull request #21357 from rogday:model_diag	2022-01-18 15:50:11 +00:00
rogday	0fe7420638	fix model diagnostic tool	2022-01-18 01:22:22 +03:00
Alexander Alekhin	b304730225	dnn: fix API - explicit ctors, const methods	2022-01-17 21:45:29 +00:00
Maksim Shabunin	d5f73f89d8	Fixed issues found by static analysis	2022-01-13 14:51:25 +03:00
Alexander Alekhin	aebb65e983	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-01-12 13:26:10 +00:00
Alexander Alekhin	80d9f624d0	dnn: don't use aligned load without alignment checks - weights are unaligned in dasiamprn sample (comes from numpy)	2022-01-12 05:11:18 +00:00
Alexander Alekhin	76fb3652fc	dnn(ocl): fix fp16 kernel compilation	2021-12-29 19:58:25 +00:00
Alexander Alekhin	9699e2b483	dnn(onnx): handle non-default ONNX domains - re-enable quantized models tests	2021-12-25 01:38:52 +00:00
Alexander Alekhin	217fea9667	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-24 16:48:07 +00:00
Alexander Alekhin	cdd4354256	Merge pull request #21336 from alalek:dnn_pooling_check_array_indexes	2021-12-24 08:35:11 +00:00
Alexander Alekhin	6385511e88	dnn: add checks in pooling layer implementation - to avoid out of buffer access	2021-12-24 00:15:30 +00:00
Alexander Alekhin	ed4becf007	dnn(onnx): debug dump of inputs/outputs/initializers in importer	2021-12-23 21:11:40 +00:00
Alexander Alekhin	f5589445b9	Merge pull request #21322 from alalek:dnn_catch_errors	2021-12-23 20:09:22 +00:00
Alexander Alekhin	88a18c8b6a	dnn(onnx): emit error in Shape for dynamic input	2021-12-23 15:42:59 +00:00
Alexander Alekhin	51e65db715	dnn(onnx): fix Resize inputs handling	2021-12-23 15:42:59 +00:00
Alexander Alekhin	cc02fcd889	dnn: improve debug messages, add ONNX opset version	2021-12-23 15:42:59 +00:00
Alexander Alekhin	c408157a4d	dnn: do not try to rebuilt network during setInput() - this doesn't make sense in case of multiple inputs	2021-12-23 02:40:33 +00:00
Alexander Alekhin	9777fbacf6	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-22 15:57:02 +00:00
rogday	0a178a687a	fix const/x in Div	2021-12-20 19:53:37 +03:00
Alexander Alekhin	80492d663e	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-18 16:19:06 +00:00
Smirnov Egor	71a22e45b0	add celu, hardsigmoid, selu, thresholdedrelu layers	2021-12-18 03:19:54 +03:00
Smirnov Egor	1bd382c1d0	Add acos, acosh, asin, asinh, atan, atanh, cos, cosh, erf, hardswish, sin, sinh, softplus, softsign, tan layers	2021-12-17 18:19:40 +03:00
Smirnov Egor	fec2c7e715	fix Flatten layer	2021-12-17 16:29:56 +03:00
Alexander Alekhin	622b9d9276	Merge pull request #21267 from mshabunin:fix-kw-2021-12	2021-12-16 18:51:47 +00:00
Gruhuang	b4bb98ea60	Merge pull request #21268 from pccvlab:tf_Arg add argmax and argmin parsing for tensorflow * add argmax and argmin for tf * remove whitespace * remove whitespace * remove static_cast Signed-off-by: Crayon-new <1349159541@qq.com>	2021-12-16 17:06:02 +00:00
Maksim Shabunin	792b7e0629	(3.4) Fixed several issues found by static analysis original commit: `a079c2eb7c`	2021-12-16 17:02:58 +00:00
Maksim Shabunin	a079c2eb7c	Fixed several issues found by static analysis	2021-12-16 19:21:25 +03:00
Alexander Alekhin	6d677bbd63	dnn(test): update ONNX conformance filters (4.x)	2021-12-16 12:09:31 +00:00
Alexander Alekhin	299f9837b7	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-15 16:38:56 +00:00
Smirnov Egor	e97c7e042b	fix max_unpool missing attributes, add default value of keepdims in reducemean/max/sum, add support for keepdims=true in full reduction branch, add new padding type to Pad	2021-12-14 22:09:27 +03:00
rogday	4827fe86bb	Merge pull request #21088 from rogday:onnx_tests Onnx conformance tests * Add ONNX conformance tests * dnn(test): add filters for ONNX conformance tests * add filter lists for OCV backend * address review comments * move test_clip_inbounds to all_denylist * address clip issue * avoid empty lists Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2021-12-14 16:58:06 +00:00
cqn2219076254	252ce0b581	add square layer	2021-12-13 21:43:13 +08:00
Alexander Alekhin	6e50e4b9ee	Merge pull request #21161 from rogday:elu_alpha_4x	2021-12-10 16:04:01 +00:00
HAN Liutong	1599f9f0c0	Merge pull request #21086 from hanliutong:rvv-dnn Further optimize DNN for RISC-V Vector. * Optimize DNN on RVV by using vsetvl. * Rename vl. * Update fastConv by using setvl instead of mask. * Fix fastDepthwiseConv	2021-12-10 16:03:22 +00:00
Gruhuang	17bc8565f6	Merge pull request #21154 from pccvlab:MatMul_with_two_inputs Add BatchMatMul layer support for tf_importer * two inputs * support batch_matmul * refactor: remove useless code * refactor: decrease nesting	2021-12-10 14:44:27 +03:00
Smirnov Egor	e608adea60	add ArgMax and ArgMin layers	2021-12-06 20:49:54 +03:00
HAN Liutong	4935b14539	Merge pull request #21012 from hanliutong:rvv_clang Update RVV backend for using Clang. * Update cmake file of clang. * Modify the RVV optimization on DNN to adapt to clang. * Modify intrin_rvv: Disable some existing types. * Modify intrin_rvv: Reinterpret instead of load&cast. * Modify intrin_rvv: Update load&store without cast. * Modify intrin_rvv: Rename vfredsum to fredosum. * Modify intrin_rvv: Rewrite Check all/any by using vpopc. * Modify intrin_rvv: Use reinterpret instead of c-style casting. * Remove all macros which is not used in v_reinterpret * Rename vpopc to vcpop according to spec.	2021-12-03 15:13:24 +00:00
Alexander Alekhin	8b4fa2605e	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-03 12:32:49 +00:00
Alexander Alekhin	35ff9af6ce	Merge pull request #21162 from rogday:softmax_simplification	2021-12-02 17:14:48 +00:00
Alexander Alekhin	dad2b9aac8	Merge pull request #21160 from rogday:elu_alpha	2021-12-02 17:13:57 +00:00
rogday	1613d30544	Merge pull request #21159 from rogday:ceil_mode fix ceil_mode for Average/MaxPooling * fix ceil_mode * add a comment	2021-12-02 20:11:11 +03:00
Alexander Alekhin	5da69c0b9a	Merge pull request #21164 from rogday:sum_identity	2021-12-01 22:49:02 +00:00
Alexander Alekhin	a806e8cc58	Merge pull request #21163 from rogday:transpose_default	2021-12-01 22:47:57 +00:00
Smirnov Egor	33e97e994d	add sum of 1 input	2021-11-30 15:42:20 +03:00
Smirnov Egor	11e6848bb9	add default order to transpose	2021-11-30 15:34:34 +03:00
Smirnov Egor	829410729c	add new (Log)SoftMax simplification passes	2021-11-30 15:20:52 +03:00
Smirnov Egor	4995aecd62	add alpha parameter to ELU	2021-11-30 14:43:18 +03:00
Smirnov Egor	0e2a3686c0	add alpha parameter to ELU layer	2021-11-30 12:20:35 +03:00
Alexander Alekhin	0d2857a242	Merge pull request #21152 from rogday:fix_defaults	2021-11-29 22:39:27 +00:00
Alexander Alekhin	17d99e6266	Merge pull request #21142 from alalek:dnn_two_inputs_ocl_fp16_3.4	2021-11-29 21:44:59 +00:00
Andrew Ryrie	ea7d4be3f8	Merge pull request #20658 from smbz:lstm_optimisation * dnn: LSTM optimisation This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm. fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications: - Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned. - Allow for weight matrices where the number of columns is not a multiple of 8. I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on. * Fix warning about initialisation order * Remove C++11 syntax * Fix build when AVX(2) is not available In this case the CV_TRY_X macros are defined to 0, rather than being undefined. * Minor changes as requested: - Don't check hardware support for AVX(2) when dispatch is disabled for these - Add braces * Fix out-of-bounds access in fully connected layer The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway). This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems. * Improve tail mask handling - Use static array for generating tail masks (as requested) - Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs * Revert whitespace change * Improve readability of conditions for using AVX * dnn(lstm): minor coding style changes, replaced left aligned load	2021-11-29 21:43:00 +00:00
Smirnov Egor	05db8784ae	fix Clip, LeakyReLU, LRN, Split defaults	2021-11-29 20:20:34 +03:00
Supernovae	b594ed99b8	Merge pull request #20933 from shubham-shahh:master Improved overall readability of the code * grid_nms.cu: minor fix-ups * Update grid_stride_range.hpp * Update tf_importer.cpp	2021-11-28 12:54:29 +00:00
Alexander Alekhin	58b06222ff	dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling	2021-11-28 07:44:05 +00:00
yuki takehara	a6277370ca	Merge pull request #21107 from take1014:remove_assert_21038 resolves #21038 * remove C assert * revert C header * fix several points in review * fix test_ds.cpp	2021-11-27 18:34:52 +00:00
Hanxi Guo	1fcf7ba5bc	Merge pull request #20406 from MarkGHX:gsoc_2021_webnn [GSoC] OpenCV.js: Accelerate OpenCV.js DNN via WebNN * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Fix the build issue * Update concat_layer.cpp Still have some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Delete bib19450.aux * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Update dnn.cpp * Fix Error in dnn.cpp * Resolve duplication in conditions in convolution_layer.cpp * Fixed the issues in the comments * Fix building issue * Update tutorial * Fixed comments * Address the comments * Update CMakeLists.txt * Offer more accurate perf test on native * Add better perf tests for both native and web * Modify per tests for better results * Use more latest version of Electron * Support latest WebNN Clamp op * Add definition of HAVE_WEBNN macro * Support group convolution * Implement Scale_layer using WebNN * Add Softmax option for native classification example * Fix comments * Fix comments	2021-11-23 21:15:31 +00:00
Alexander Alekhin	394e640909	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-11-13 15:11:30 +00:00
Alexander Alekhin	8041ab8a61	Merge pull request #21025 from alalek:issue_21004 * dnn(ocl4dnn): fix LRN layer accuracy problems - FP16 intermediate computation is not accurate and may provide NaN values * dnn(test): update tolerance for FP16	2021-11-12 01:54:07 +03:00
ZaKiiiiiiiii	98b6ce353c	Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer fix bug: wrong output dimension when "keep_dims" is false in pooling layer. * fix bug in max layer * code align * delete permute layer and add test case * add name assert * check other cases * remove c++11 features * style:add "const" remove assert * style:sanitize file names	2021-11-09 19:24:04 +03:00
Alexander Alekhin	7842181b47	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-11-05 09:27:46 +00:00
Alexander Alekhin	d484939c02	Merge pull request #20999 from alalek:dnn_replace_deprecated_calls dnn(protobuf): replace deprecated calls * dnn: replace deprecated ByteSize() => ByteSizeLong() * dnn: replace deprecated calls, use GetRepeatedFieldRef	2021-11-03 15:59:36 +00:00
Alexander Alekhin	ec10f2e72b	Merge pull request #20877 from rogday:simple_layers	2021-10-20 17:00:38 +00:00
rogday	b3f966e2ca	Merge pull request #20883 from rogday:eltwise_refactoring * backport elementwise_layers refactor * keep NULL	2021-10-19 13:29:22 +00:00
Alexander Alekhin	1926e919be	dnn(int8): fix using of incorrect UMat constructor	2021-10-18 04:46:00 +00:00
Alexander Alekhin	31c40fa4cc	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-10-15 13:35:03 +00:00
Smirnov Egor	1feb3838b5	add Ceil, Floor, Log, Round, Sqrt, Not, Equal, Less, Greater	2021-10-15 16:02:46 +03:00
Alexander Alekhin	53d6c9b9c0	Merge pull request #20860 from rogday:sum_fix	2021-10-12 15:36:32 +00:00
Smirnov Egor	238dbffb48	change asserts for Sum	2021-10-11 20:59:44 +03:00
Smirnov Egor	a9d7b6eab7	fix const - input and remove unimplemented function	2021-10-11 18:58:10 +03:00
Alexander Alekhin	4672dbda2a	Merge pull request #20818 from rogday:yolov4x_mish_cuda	2021-10-08 19:12:43 +00:00
Smirnov Egor	9c84749e2c	backport YOLOv4x-mish new_coords CUDA implementation	2021-10-08 14:14:49 +03:00
Alexander Alekhin	cca4c47781	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-10-08 11:05:45 +00:00
Alexander Alekhin	81e7988eb9	Merge pull request #20840 from alalek:dnn_ocl_cleanup_code	2021-10-08 05:07:51 +00:00
Alexander Alekhin	8c2dd5fb9a	dnn(ocl4dnn): cleanup dead code, improve logging	2021-10-08 00:39:40 +00:00
Alexander Alekhin	724e04e979	dnn(ocl4dnn): add extra checks to convolution layer - prevent running code over unsupported/non-tested configurations - prevent integer div by zero	2021-10-07 23:18:32 +00:00
Alexander Alekhin	03a08435e2	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-10-07 04:27:22 +00:00
Alexander Alekhin	822d468232	Merge pull request #20813 from rogday:soft_nms	2021-10-06 20:20:34 +00:00
Smirnov Egor	2221dcc9f2	add SoftNMS implementation	2021-10-06 21:31:45 +03:00
Oliver Kuckertz	a3d7811f24	Merge pull request #20725 from mologie:fix-dnn-tf-on-arm * dnn: fix unaligned memory access crash on armv7 The getTensorContent function would return a Mat pointing to some member of a Protobuf-encoded message. Protobuf does not make any alignment guarantees, which results in a crash on armv7 when loading models while bit 2 is set in /proc/cpu/alignment (or the relevant kernel feature for alignment compatibility is disabled). Any read attempt from the previously unaligned data member would send SIGBUS. As workaround, this commit makes an aligned copy via existing clone functionality in getTensorContent. The unsafe copy=false option is removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact writes(!) to the Protobuf message. We limit ourselves to fixing the alignment issues in this commit, and add getTensorContentRefUnaligned to cover the write case with a safe memcpy. A FIXME marks the issue. * dnn: reduce amount of .clone() calls * dnn: update FIXME comment Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2021-10-06 16:41:05 +00:00
HAN Liutong	e5fb50476c	Merge pull request #20521 from hanliutong:dev-rvv-multiVLEN Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics. * Update fastGEMM for multi VLEN. * Update fastGEMM1T for multi VLEN. * Update fastDepthwiseConv for multi VLEN. * Update fastConv for multi VLEN. * Replace malloc with cv::AutoBuffer.	2021-10-05 15:35:00 +00:00
Alexander Alekhin	1b70f94282	Merge pull request #20782 from YashasSamaga:cuda4dnn-eltwise-broadcast	2021-10-04 22:35:00 +00:00
Jebastin Nadar	cce78cc5e2	Merge pull request #20535 from SamFC10:onnx-q dnn : int8 quantized layers support in onnx importer * added quantized layers support in onnx importer * added more cases in eltwise node, some more checks * added tests for quantized nodes * relax thresholds for failed tests, address review comments * refactoring based on review comments * added support for unsupported cases and pre-quantized resnet50 test * relax thresholds due to int8 resize layer	2021-10-04 18:07:38 +00:00
Zihao Mu	9085b933d8	Merge pull request #20702 from zihaomu:tf_expand_dim_layer Add ExpandDims layer of tf_importer.cpp * Add ExpandDims to tf_importer. * add -1 expand test case. * Support different dimensions of input. * Compatible with 5-dimensional NDHWC data * Code align * support 3-dim input. * 3-dim bug fixed. * fixing error of code format.	2021-10-04 16:37:38 +00:00
YashasSamaga	505dde09de	support broadcasting in eltwise ops	2021-10-04 12:38:45 +05:30
SamFC10	87ebf2e50b	fix illegal memory access in int8 convolution	2021-10-03 15:16:01 +05:30
Alexander Alekhin	37c3f0d8a0	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-10-02 17:57:18 +00:00
Alexander Alekhin	f977d10a19	dnn(ocl): fix conv DWCONV workgroup	2021-10-01 18:52:07 +00:00
Alexander Alekhin	846317ef37	dnn(ocl): fix conv BASIC workgroup	2021-09-29 14:55:46 +00:00
Alexander Alekhin	24fcb7f813	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-09-25 17:50:00 +00:00
rogday	38b9ec7a18	Merge pull request #20682 from rogday:min * Add Min layer to CPU, OpenCL, Halide, Inference Engine, NGraph and CUDA * fix indentation * add min to fusion and halide tests; fix doc	2021-09-22 15:17:37 +03:00

... 2 3 4 5 6 ...

1718 Commits