opencv

mirror of https://github.com/opencv/opencv.git synced 2024-12-14 17:29:17 +08:00

Author	SHA1	Message	Date
zoom	5044af69d1	let MatMul can work when both two inputs are const	2022-11-27 17:32:41 +08:00
Alexander Smorkalov	6ca205a029	Merge pull request #22478 from WanliZhong:nary_eltwise_cuda DNN: Let part of the operators in nary_eltwise support CUDA	2022-11-22 16:15:50 +03:00
zihaomu	5bf64e7dfe	fix the infinite loop in tf importer of 3.4 branch	2022-11-15 11:42:10 +08:00
zoom	ef2677b0a6	Make MatMul layer support 3d or 4d operation with const input	2022-11-10 11:41:44 +08:00
zoom	11d492b0b9	Let part of the operators in nary_eltwise support cuda	2022-11-02 14:08:21 +08:00
Zihao Mu	17f2b56291	remove never used code in onnximporter	2022-11-02 10:45:16 +08:00
Alexander Alekhin	ee9137f176	Merge pull request #22725 from zihaomu:fix_infinit_loop_in_tf	2022-10-31 17:03:03 +00:00
Zihao Mu	903bf0147e	Merge pull request #22666 from zihaomu:support_onnx_qdq_model DNN: let Quant and Dequant of ONNX_importer support the Constant input. * let Quant and Dequant support the Constant input. * fix negative value of axis.	2022-10-31 16:06:31 +00:00
Zihao Mu	18fbb72f7d	fix the infinite loop in tf importer.	2022-10-31 20:10:25 +08:00
Alexander Smorkalov	23edec83fb	Merge pull request #22667 from zihaomu:bug_fix_in_winograd DNN: bug fixed in Winograd	2022-10-21 17:54:13 +03:00
Alexander Smorkalov	e4cd430710	Merge pull request #22653 from WanliZhong:issue22597 DNN-TF: let StridedSlice layer support const input	2022-10-21 17:51:00 +03:00
Dmitry Kurtaev	35b2cff295	Merge pull request #22656 from dkurt:halide_fixes * Fixes for Halide * Enable some Halide tests	2022-10-21 17:49:49 +03:00
Zihao Mu	cee8c86b6e	fixed bug at winograd of SIMD128 and more robust code.	2022-10-21 19:14:54 +08:00
Alexander Smorkalov	5d292826b2	Merge pull request #22593 from zihaomu:optimize_wino optimize winograd futher more	2022-10-19 13:08:32 +03:00
Alexander Smorkalov	f378f02954	Merge pull request #22652 from rogday:cuda_test_fixes Address CUDA-related errors	2022-10-19 09:37:12 +03:00
Zhi-Qiang Zhou	c8561eae2d	Update region_layer.cpp Fix objectness (dstData[index + 4]) is not assigned if new_coords == 1.	2022-10-19 11:17:23 +08:00
Smirnov Egor	dd14cf6a9c	address CUDA-related errors and enable cuda in elementwise ops	2022-10-18 16:54:42 +03:00
Alexander Smorkalov	ec7fc5adca	Merge pull request #22529 from fengyuentau:scatter_scatternd DNN: supports Scatter and ScatterND from ONNX	2022-10-17 14:57:46 +03:00
Alexander Smorkalov	02143cd0e2	Merge pull request #22531 from zihaomu:stop_rely_name Parsing quantized nodes does not rely on names	2022-10-17 11:20:24 +03:00
Alexander Smorkalov	1c5dcbcac8	Merge pull request #22639 from WanliZhong:issue#22625 DNN: Make Unsqueeze layer support negative axes	2022-10-17 09:27:49 +03:00
fengyuentau	d24d8f2abe	implementation of scatter and scatternd with conformance tests enabled	2022-10-17 11:30:32 +08:00
Alexander Alekhin	762481411d	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-10-15 16:44:47 +00:00
zoom	d816442e4d	Make Unsqueeze layer support negative axes.	2022-10-14 18:00:19 +08:00
Zihao Mu	0fa43e3aac	Optimize the winograd futher more.	2022-10-14 10:15:45 +08:00
zoom	9119692bb8	let StridedSlice layer support const input	2022-10-12 11:50:44 +08:00
Alexander Smorkalov	ec26541771	Merge pull request #22577 from zihaomu:Disable_winograd_branch_in_tryquantize DNN: add enableWinograd API for Net	2022-10-11 09:44:00 +03:00
Zihao Mu	d9eff7daeb	parse quantized nodes does not rely on name.	2022-10-10 17:08:46 +08:00
Alexander Smorkalov	3419e64dcf	Merge pull request #22611 from zihaomu:greaterOrEqual DNN: support GreaterOrEqual and LessOrEqual op in ONNX	2022-10-10 11:43:44 +03:00
Zihao Mu	1e2ceca4df	add enableWinograd API for Net.	2022-10-09 09:33:07 +08:00
Alexander Alekhin	347246901e	Merge pull request #21745 from alalek:dnn_plugin_openvino	2022-10-08 22:32:25 +00:00
Zihao Mu	9821fae59d	add greater_or_equal and less_or_equal ONNX support	2022-10-08 15:51:40 +08:00
Alexander Alekhin	43b2bb2c25	dnn: plugin support for OpenVINO	2022-10-07 16:57:31 +00:00
Alexander Smorkalov	96844b0ca5	Merge pull request #22554 from WanliZhong:slice_axes_no_seq DNN: Let Slice layer support non-sequential and negative axes	2022-10-03 10:15:55 +03:00
zoom	4557971481	enhance slice layer refactor the code for parsing Slice layer add test for Slice layer let 'begin' and 'end' resize to dims add opset message comment	2022-10-01 17:12:07 +08:00
Zihao Mu	15cfafb360	DNN: Remove unused code in onnx_importer.cpp	2022-09-29 10:53:43 +08:00
Alexander Smorkalov	a6274647a4	Merge pull request #21738 from rogday:gather add Gather implementation	2022-09-19 16:21:14 +03:00
Egor Smirnov	65f71ce2eb	add Gather implementation	2022-09-19 15:06:44 +03:00
Alexander Smorkalov	6aefb8e86f	Merge pull request #22290 from fengyuentau:naive_yolov7 Support for YOLOv7 ONNX (not simplified)	2022-09-19 14:43:18 +03:00
fengyuentau	4aef9b1c93	dnn: support yolov7 (not simplified)	2022-09-19 18:38:03 +08:00
Alexander Smorkalov	e1e9261450	Merge pull request #22479 from scottchou007:master Fix issues in opencv_test_dnn from conv48 kernels without bias	2022-09-16 09:05:55 +03:00
scottchou007	a3cb2020bc	Fix issues in opencv_test_dnn from conv48 kernels using uninitialized tensors when there is no bias.	2022-09-15 13:41:27 -07:00
Alexander Alekhin	65bdb3a544	dnn: eliminate GCC12 warning in total() call	2022-09-14 11:37:00 +00:00
Alexander Smorkalov	c2c8da2517	Merge pull request #22448 from Ichini24:reshape-permutations-fix changed names of permutations if Reshpe is in NHWC	2022-09-13 09:24:56 +03:00
wxsheng	4154bd0667	Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX * Add Loongson Advanced SIMD Extension support: -DCPU_BASELINE=LASX * Add resize.lasx.cpp for Loongson SIMD acceleration * Add imgwarp.lasx.cpp for Loongson SIMD acceleration * Add LASX acceleration support for dnn/conv * Add CV_PAUSE(v) for Loongarch * Set LASX by default on Loongarch64 * LoongArch: tune test threshold for Core/HAL.mat_decomp/15 Co-authored-by: shengwenxue <shengwenxue@loongson.cn>	2022-09-10 09:39:43 +03:00
Alexander Alekhin	ca7f964104	dnn: use inheritance for OpenVINO net impl	2022-09-06 18:05:00 +00:00
anton	337452b4c0	changed names of permutations if Reshpe is in NHWC	2022-09-03 19:02:41 +02:00
Zihao Mu	b69b1eae8f	fix bug 22450	2022-09-02 16:30:06 +08:00
Alexander Smorkalov	70fb1cd603	Merge pull request #22440 from zihaomu:fix_conv_bug	2022-08-30 07:01:05 +00:00
Alexander Smorkalov	d2c48b898c	Merge pull request #22306 from zihaomu:qgemm_and_squeeze_opset13_onnximporter	2022-08-30 06:33:57 +00:00
Zihao Mu	2d837efba7	add qgemm and squeeze op13 supported on ONNXImporter	2022-08-30 09:50:29 +08:00
Alexander Smorkalov	1fd45a1b85	Merge pull request #22362 from fengyuentau:conv_asym_pad_fuse Remove asymmetric padding in Conv layer since it is supported in CPU backend	2022-08-29 17:56:17 +03:00
Zihao Mu	2cd7e17b65	replace v_add with +	2022-08-29 17:15:35 +08:00
Alexander Smorkalov	2619099fe5	Merge pull request #22337 from zihaomu:load_ONNX_fp16_as_fp32 DNN: load fp16 ONNX model as fp32	2022-08-29 09:32:25 +03:00
fengyuentau	2959286eb5	tengine: supports conv with asymmetric padding	2022-08-29 02:51:26 +00:00
Zihao Mu	9638e34ab0	reuse WORDS_BIGENDIAN.	2022-08-27 07:42:38 +08:00
Zihao Mu	bb64db98d8	Further optimization of Conv2D, fused Conv_Add_Activation, bring latest code from ficus OpConv.fx. (#22401 )	2022-08-26 12:57:25 +03:00
Zihao Mu	7eaec9dd22	load fp16 as fp32 and align fp16 and double in onnx_graph_simplifie	2022-08-26 10:04:44 +08:00
Zihao Mu	5e92bf8e41	support silu activation in darknet	2022-08-22 10:51:29 +08:00
Alexander Alekhin	2ebdc04787	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-08-14 15:50:42 +00:00
fengyuentau	0cdff46725	tune for opencl	2022-08-14 17:47:48 +08:00
Alexander Smorkalov	bb71cb200e	Merge pull request #22199 from zihaomu:bug_fix_22195 DNN: Reduce Layer (add dynamic batch and ReduceSum support)	2022-08-11 12:59:51 +03:00
fengyuentau	e7e814fa8c	remove asymmetric padding checks	2022-08-10 19:52:44 +08:00
Zihao Mu	d4640f4647	support ReduceLayer without reshape layer.	2022-08-02 10:32:31 +08:00
Zihao Mu	57545653b1	replace new mish impl with softplus	2022-07-28 13:19:06 +08:00
Zihao Mu	3c5377ca1b	add another Mish graph simplifier.	2022-07-28 11:21:29 +08:00
HAN Liutong	e2bfe0ce76	Use "#if" instead of "#ifdef" for CV_SIMD128.	2022-07-21 03:23:57 +00:00
Zihao Mu	98c33c605d	batchsize dynamic is set to index 0.	2022-07-20 19:02:16 +08:00
rogday	ed69bcae2d	Merge pull request #21865 from rogday:nary_eltwise_layers Reimplementation of Element-wise layers with broadcasting support * init * semi-working initial version * add small_vector * wip * remove smallvec * add nary function * replace auto with Mat in lambda expr used in transform * uncomment asserts * autobuffer shape_buf & step_buf * fix a missing bracket * fixed a missing addLayer in parseElementWise * solve one-dimensional broadcast * remove pre_broadcast_transform for the case of two constants; fix missing constBlobsExtraInfo when addConstant is called * one autobuffer for step & shape * temporal fix for the missing original dimension information * fix parseUnsqueeze when it gets a 1d tensor constant * support sum/mean/min/max with only one input * reuse old code to handle cases of two non-constant inputs * add condition to handle div & mul of two non-constant inputs * use \|\| instead of or * remove trainling spaces * enlarge buf in binary_forward to contain other buffer * use autobuffer in nary_forward * generate data randomly and add more cases for perf * add op and, or & xor * update perf_dnn * remove some comments * remove legacy; add two ONNX conformance tests in filter * move from cpu_denylist to all_denylist * adjust parsing for inputs>=2 Co-authored-by: fengyuentau <yuantao.feng@opencv.org.cn>	2022-07-19 06:14:05 +03:00
fengyuentau	1c7b71bf9e	define data_layout as unknown for pack	2022-07-14 19:27:20 +08:00
Zihao Mu	1b8fba8e26	support ReduceSum with two input and dynamic shape batch size in ReduceLayer.	2022-07-13 13:46:16 +08:00
Zihao Mu	45fbb67aba	fix scale layer can not handle 1x1 weight correctly.	2022-07-13 11:25:27 +08:00
Zihao Mu	139c443770	Merge pull request #22183 from zihaomu:fastConv_ARMv7_compatible DNN: ARMv7 compatible fastConv * support armv7 on fastConv * remove whitespace.	2022-07-07 13:23:08 +03:00
Zihao Mu	a80fcacd90	Merge pull request #21372 from zihaomu:dnn_quantize_per_tensor Add per_tensor_quantize to int8 quantize * add per_tensor_quantize to dnn int8 module. * change api flag from perTensor to perChannel, and recognize quantize type and onnx importer. * change the default to hpp	2022-07-05 19:14:42 +03:00
Zihao Mu	59b870a87a	Merge pull request #21910 from zihaomu:fast_conv_ARM DNN: Accelerating convolution * Fast Conv of ARM, X86 and universal intrinsics. * improve code style. * error fixed. * improve the License * optimize memory allocated and Adjust the threshold. * change FasterRCNN_vgg16 to 2GB memory.	2022-07-01 13:03:15 +03:00
Zihao Mu	ef94275eb6	bug fixed of GEMM node in ONNX_importer	2022-06-22 21:08:48 +08:00
Wanli	a6ca48a1c2	Merge pull request #22100 from WanliZhong:issue_22015 Fix issue 22015, let Clip layer support 1-3 inputs * Fix issue 22015. Let layer Clip support 1-3 inputs. * Resolve other problems caused by modifications * Update onnx_importer.cpp added extra checks to min/max handling in Clip * Add assertions to check the size of the input * Add test for clip with min and max initializers * Separate test for "clip_init_min_max". Change the check method for input_size to provide a clearer message in case of problem. * Add tests for clip with min or max initializers * Change the implementation of getting input Co-authored-by: Vadim Pisarevsky <vadim.pisarevsky@gmail.com>	2022-06-22 14:21:16 +03:00
Zihao Mu	2411b825b4	bug fixed of GEMM node in ONNX_importer	2022-06-22 15:00:17 +08:00
Alexander Alekhin	583bd1a6e2	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-06-04 19:10:35 +00:00
Namgoo Lee	24547f40ff	remove const from functions returning by value	2022-05-26 21:30:41 +09:00
Alexander Alekhin	978dc76653	Merge pull request #22006 from rogday:21947_fix	2022-05-24 19:26:02 +00:00
rogday	a2ad997e97	fix vector access in TF::sortByExecutionOrder	2022-05-24 00:05:13 +03:00
berak	50d7c61c01	Update darknet_importer.cpp make it more obvious, that this is a '404', not a 'parsing' problem	2022-05-23 19:18:31 +02:00
Alexander Alekhin	d9bf522b27	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-05-23 16:06:14 +00:00
rogday	93dc0679ec	Merge pull request #21818 from rogday:revert_renaming * add prefixes to layer names and layer output names * dnn: OPENCV_DNN_ONNX_USE_LEGACY_NAMES runtime parameter Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2022-05-23 14:50:42 +00:00
Alexander Alekhin	bb5462e327	Merge pull request #21991 from fengyuentau:qconv_asympad	2022-05-19 17:20:04 +00:00
fengyuentau	ff88132620	support asymmetric paddings for qconv	2022-05-16 19:01:37 +08:00
OpenCV Developers	d9a444ca1a	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-05-14 11:23:21 +00:00
Yulv-git	15ac54d5d6	Fix some typos in modules/.	2022-04-30 13:40:07 +08:00
Zihao Mu	64ded50bbf	parsing depth2space and space2depth of ONNX importer	2022-04-29 10:17:02 +08:00
rogday	9cd5a0a1e6	Merge pull request #21884 from rogday:cuda_cleanup Fix CUDA compilation issues and adjust thresholds. * Fix CUDA compilation issues and adjust thresholds. * add conformance tests to denylist	2022-04-19 16:40:25 +00:00
OpenCV Developers	2985739b8c	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-04-16 14:41:15 +00:00
rogday	a2b84e9897	add assert to tf graph simplifier to address security concerns	2022-04-13 22:50:27 +03:00
zihaomu	e36948cfbc	add ONNX OP sign, shrink and reciprocal	2022-04-07 15:32:12 +08:00
Alexander Alekhin	a233982931	Merge pull request #20938 from JulieBar:lstm_cuda2	2022-04-01 22:10:08 +00:00
Zihao Mu	7b582b71ba	Merge pull request #21036 from fengyuentau:timvx_backend_support dnn: TIM-VX NPU backend support * Add TimVX NPU backend for DNN module. * use official branch from tim-vx repo; fix detecting viv sdk Co-authored-by: fytao <yuantao.feng@outlook.com>	2022-03-31 21:42:11 +00:00
Smirnov Egor	abebbf04b1	Add CUDA support for LSTM. Co-authored-by: Julia Bareeva <jbareeva@gmail.com>	2022-03-31 16:38:22 +03:00
Alexander Alekhin	5e434073d4	Merge pull request #21796 from alalek:dnn_reduce_fixup_21601	2022-03-30 22:26:28 +00:00
Alexander Alekhin	6f5cf8c15f	dnn: fix ReduceLayer implementation, update OpenVINO tests	2022-03-30 20:03:41 +00:00
Alexander Alekhin	1339ebaa84	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-03-26 16:00:28 +00:00
Alexander Alekhin	c9b90884da	Merge pull request #21601 from zihaomu:add_reduceLayer	2022-03-26 10:20:10 +00:00
luz paz	8e8e4bbabc	dnn: fix various dnn related typos Fixes source comments and documentation related to dnn code.	2022-03-23 18:12:12 -04:00
Alexander Alekhin	4c79318694	dnn: fix index access	2022-03-19 06:54:07 +00:00
Zihao Mu	b6b5c27cec	Support for some reduce layers for onnx	2022-03-18 10:19:13 +08:00
Alexander Alekhin	685797f403	Merge pull request #21662 from alalek:dnn_split	2022-03-17 16:09:17 +00:00
rogday	93353aea70	Merge pull request #21522 from rogday:lstm Fix LSTM support in ONNX * fix LSTM and add peephole support * disable old tests * turn lambdas into functions * more hacks for c++98 * add assertions * slice fixes * backport of cuda-related fixes * address review comments	2022-03-15 09:14:05 +03:00
Alexander Alekhin	5bf3c1df24	Merge pull request #21715 from ilyachur:change_type_info_creation	2022-03-14 09:18:58 +00:00
Ilya Churaev	419918076e	Changed call of NodeTypeInfo constructor	2022-03-14 10:55:33 +03:00
Alexander Alekhin	a120adde63	dnn: add dnn.cpp file with information about git commits history	2022-03-08 19:22:47 +00:00
Alexander Alekhin	a80af177b6	dnn: split dnn.cpp code base commit: `19926e2979` original dnn.cpp content: `19926e2979/modules/dnn/src/dnn.cpp`	2022-03-08 19:22:46 +00:00
Tsukasa Sugiura	8db7d435b9	Merge pull request #21692 from UnaNancyOwen:add_softmax * add apply softmax option to ClassificationModel * remove default arguments of ClassificationModel::setSoftMax() * fix build for python * fix docs warning for setSoftMax() * add impl for ClassficationModel() * fix failed build for docs by trailing whitespace * move to implement classify() to ClassificationModel_Impl * move to implement softmax() to ClassificationModel_Impl * remove softmax from public method in ClassificationModel	2022-03-07 20:26:15 +00:00
Alexander Alekhin	901e0ddfe4	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-03-05 19:46:28 +00:00
Alexander Alekhin	5cc27fd3b5	Merge pull request #21542 from rogday:split_expand	2022-02-28 22:38:24 +00:00
Egor Smirnov	375fe81311	fix slice and expand	2022-02-28 17:18:07 +03:00
Yuantao Feng	f77c3574af	Merge pull request #21607 from fengyuentau:fix_FaceDetectorYN_dynamic_shape Use YuNet of fixed input shape to fix not-supported-dynamic-zero-shape for FaceDetectorYN * use yunet with input of fixed shape * update yunet used in face recognition regression	2022-02-21 13:49:07 +00:00
Alexander Alekhin	19926e2979	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-02-11 17:32:37 +00:00
Alexander Alekhin	effce0573b	dnn: drop legacy Inference Engine NN builder API	2022-02-10 11:55:24 +00:00
Alexander Alekhin	57d3002ee1	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-02-06 16:10:43 +00:00
Alexander Alekhin	1da48beeec	dnn(ngraph): fix output names	2022-02-06 13:08:53 +00:00
Alexander Alekhin	b57ff73086	dnn(ngraph): fix outputs handling, drop 'unconnected' logic	2022-02-06 13:08:53 +00:00
Alexander Alekhin	67978b5746	dnn(ngraph): add debuging messages	2022-02-06 13:08:53 +00:00
Alexander Alekhin	062f305d1a	dnn: don't fuse 'outputs' with OpenVINO backend	2022-02-06 13:08:53 +00:00
Alexander Alekhin	aa5bc20c83	dnn(ngraph): fixup get_output_as_single_output_node() replacement patch	2022-02-06 10:35:59 +00:00
Alexander Alekhin	d573472a86	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-01-31 12:53:45 +00:00
Alexander Alekhin	85719a0a5d	dnn: support outputs registration under new names - fixed ONNX importer	2022-01-29 23:29:51 +00:00
Alexander Alekhin	dc35633aa4	Merge pull request #21521 from alalek:dnn_ignore_denormals	2022-01-28 15:31:44 +00:00
Zihao Mu	9e3ba487fa	Merge pull request #21518 from zihaomu:resize_onnx_opset13 Add resize layer compatible with ONNX opset13 version	2022-01-28 17:55:01 +03:00
Alexander Alekhin	9188ce68aa	Merge pull request #21490 from rogday:optional_outputs	2022-01-26 15:18:07 +00:00
Alexander Alekhin	70b0274c8e	dnn: apply hint to ignore denormals processing	2022-01-26 11:28:35 +00:00
Alexander Alekhin	b796ededae	Merge pull request #21437 from alalek:dnn_api_explicit_const_4.x	2022-01-21 20:19:50 +00:00
Alexander Alekhin	eb7b45d26b	dnn: fix API - explicit ctors, const methods	2022-01-21 12:38:51 +00:00
Smirnov Egor	17b2d92a3d	add optional outputs support and fix graph links	2022-01-21 12:31:46 +03:00
Alexander Alekhin	6ffa2b01e1	Merge pull request #21357 from rogday:model_diag	2022-01-18 15:50:11 +00:00
rogday	0fe7420638	fix model diagnostic tool	2022-01-18 01:22:22 +03:00
Alexander Alekhin	b304730225	dnn: fix API - explicit ctors, const methods	2022-01-17 21:45:29 +00:00
Maksim Shabunin	d5f73f89d8	Fixed issues found by static analysis	2022-01-13 14:51:25 +03:00
Alexander Alekhin	aebb65e983	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2022-01-12 13:26:10 +00:00
Alexander Alekhin	80d9f624d0	dnn: don't use aligned load without alignment checks - weights are unaligned in dasiamprn sample (comes from numpy)	2022-01-12 05:11:18 +00:00
Alexander Alekhin	76fb3652fc	dnn(ocl): fix fp16 kernel compilation	2021-12-29 19:58:25 +00:00
Alexander Alekhin	9699e2b483	dnn(onnx): handle non-default ONNX domains - re-enable quantized models tests	2021-12-25 01:38:52 +00:00
Alexander Alekhin	217fea9667	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-24 16:48:07 +00:00
Alexander Alekhin	cdd4354256	Merge pull request #21336 from alalek:dnn_pooling_check_array_indexes	2021-12-24 08:35:11 +00:00
Alexander Alekhin	6385511e88	dnn: add checks in pooling layer implementation - to avoid out of buffer access	2021-12-24 00:15:30 +00:00
Alexander Alekhin	ed4becf007	dnn(onnx): debug dump of inputs/outputs/initializers in importer	2021-12-23 21:11:40 +00:00
Alexander Alekhin	f5589445b9	Merge pull request #21322 from alalek:dnn_catch_errors	2021-12-23 20:09:22 +00:00
Alexander Alekhin	88a18c8b6a	dnn(onnx): emit error in Shape for dynamic input	2021-12-23 15:42:59 +00:00
Alexander Alekhin	51e65db715	dnn(onnx): fix Resize inputs handling	2021-12-23 15:42:59 +00:00
Alexander Alekhin	cc02fcd889	dnn: improve debug messages, add ONNX opset version	2021-12-23 15:42:59 +00:00
Alexander Alekhin	c408157a4d	dnn: do not try to rebuilt network during setInput() - this doesn't make sense in case of multiple inputs	2021-12-23 02:40:33 +00:00
Alexander Alekhin	9777fbacf6	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-22 15:57:02 +00:00
rogday	0a178a687a	fix const/x in Div	2021-12-20 19:53:37 +03:00
Alexander Alekhin	80492d663e	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-18 16:19:06 +00:00
Smirnov Egor	71a22e45b0	add celu, hardsigmoid, selu, thresholdedrelu layers	2021-12-18 03:19:54 +03:00
Smirnov Egor	1bd382c1d0	Add acos, acosh, asin, asinh, atan, atanh, cos, cosh, erf, hardswish, sin, sinh, softplus, softsign, tan layers	2021-12-17 18:19:40 +03:00
Smirnov Egor	fec2c7e715	fix Flatten layer	2021-12-17 16:29:56 +03:00
Alexander Alekhin	622b9d9276	Merge pull request #21267 from mshabunin:fix-kw-2021-12	2021-12-16 18:51:47 +00:00
Gruhuang	b4bb98ea60	Merge pull request #21268 from pccvlab:tf_Arg add argmax and argmin parsing for tensorflow * add argmax and argmin for tf * remove whitespace * remove whitespace * remove static_cast Signed-off-by: Crayon-new <1349159541@qq.com>	2021-12-16 17:06:02 +00:00
Maksim Shabunin	792b7e0629	(3.4) Fixed several issues found by static analysis original commit: `a079c2eb7c`	2021-12-16 17:02:58 +00:00
Maksim Shabunin	a079c2eb7c	Fixed several issues found by static analysis	2021-12-16 19:21:25 +03:00
Alexander Alekhin	6d677bbd63	dnn(test): update ONNX conformance filters (4.x)	2021-12-16 12:09:31 +00:00
Alexander Alekhin	299f9837b7	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-15 16:38:56 +00:00
Smirnov Egor	e97c7e042b	fix max_unpool missing attributes, add default value of keepdims in reducemean/max/sum, add support for keepdims=true in full reduction branch, add new padding type to Pad	2021-12-14 22:09:27 +03:00
rogday	4827fe86bb	Merge pull request #21088 from rogday:onnx_tests Onnx conformance tests * Add ONNX conformance tests * dnn(test): add filters for ONNX conformance tests * add filter lists for OCV backend * address review comments * move test_clip_inbounds to all_denylist * address clip issue * avoid empty lists Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	2021-12-14 16:58:06 +00:00
cqn2219076254	252ce0b581	add square layer	2021-12-13 21:43:13 +08:00
Alexander Alekhin	6e50e4b9ee	Merge pull request #21161 from rogday:elu_alpha_4x	2021-12-10 16:04:01 +00:00
HAN Liutong	1599f9f0c0	Merge pull request #21086 from hanliutong:rvv-dnn Further optimize DNN for RISC-V Vector. * Optimize DNN on RVV by using vsetvl. * Rename vl. * Update fastConv by using setvl instead of mask. * Fix fastDepthwiseConv	2021-12-10 16:03:22 +00:00
Gruhuang	17bc8565f6	Merge pull request #21154 from pccvlab:MatMul_with_two_inputs Add BatchMatMul layer support for tf_importer * two inputs * support batch_matmul * refactor: remove useless code * refactor: decrease nesting	2021-12-10 14:44:27 +03:00
Smirnov Egor	e608adea60	add ArgMax and ArgMin layers	2021-12-06 20:49:54 +03:00
HAN Liutong	4935b14539	Merge pull request #21012 from hanliutong:rvv_clang Update RVV backend for using Clang. * Update cmake file of clang. * Modify the RVV optimization on DNN to adapt to clang. * Modify intrin_rvv: Disable some existing types. * Modify intrin_rvv: Reinterpret instead of load&cast. * Modify intrin_rvv: Update load&store without cast. * Modify intrin_rvv: Rename vfredsum to fredosum. * Modify intrin_rvv: Rewrite Check all/any by using vpopc. * Modify intrin_rvv: Use reinterpret instead of c-style casting. * Remove all macros which is not used in v_reinterpret * Rename vpopc to vcpop according to spec.	2021-12-03 15:13:24 +00:00
Alexander Alekhin	8b4fa2605e	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-12-03 12:32:49 +00:00
Alexander Alekhin	35ff9af6ce	Merge pull request #21162 from rogday:softmax_simplification	2021-12-02 17:14:48 +00:00
Alexander Alekhin	dad2b9aac8	Merge pull request #21160 from rogday:elu_alpha	2021-12-02 17:13:57 +00:00
rogday	1613d30544	Merge pull request #21159 from rogday:ceil_mode fix ceil_mode for Average/MaxPooling * fix ceil_mode * add a comment	2021-12-02 20:11:11 +03:00
Alexander Alekhin	5da69c0b9a	Merge pull request #21164 from rogday:sum_identity	2021-12-01 22:49:02 +00:00
Alexander Alekhin	a806e8cc58	Merge pull request #21163 from rogday:transpose_default	2021-12-01 22:47:57 +00:00
Smirnov Egor	33e97e994d	add sum of 1 input	2021-11-30 15:42:20 +03:00
Smirnov Egor	11e6848bb9	add default order to transpose	2021-11-30 15:34:34 +03:00
Smirnov Egor	829410729c	add new (Log)SoftMax simplification passes	2021-11-30 15:20:52 +03:00
Smirnov Egor	4995aecd62	add alpha parameter to ELU	2021-11-30 14:43:18 +03:00
Smirnov Egor	0e2a3686c0	add alpha parameter to ELU layer	2021-11-30 12:20:35 +03:00
Alexander Alekhin	0d2857a242	Merge pull request #21152 from rogday:fix_defaults	2021-11-29 22:39:27 +00:00
Alexander Alekhin	17d99e6266	Merge pull request #21142 from alalek:dnn_two_inputs_ocl_fp16_3.4	2021-11-29 21:44:59 +00:00
Andrew Ryrie	ea7d4be3f8	Merge pull request #20658 from smbz:lstm_optimisation * dnn: LSTM optimisation This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm. fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications: - Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned. - Allow for weight matrices where the number of columns is not a multiple of 8. I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on. * Fix warning about initialisation order * Remove C++11 syntax * Fix build when AVX(2) is not available In this case the CV_TRY_X macros are defined to 0, rather than being undefined. * Minor changes as requested: - Don't check hardware support for AVX(2) when dispatch is disabled for these - Add braces * Fix out-of-bounds access in fully connected layer The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway). This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems. * Improve tail mask handling - Use static array for generating tail masks (as requested) - Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs * Revert whitespace change * Improve readability of conditions for using AVX * dnn(lstm): minor coding style changes, replaced left aligned load	2021-11-29 21:43:00 +00:00
Smirnov Egor	05db8784ae	fix Clip, LeakyReLU, LRN, Split defaults	2021-11-29 20:20:34 +03:00
Supernovae	b594ed99b8	Merge pull request #20933 from shubham-shahh:master Improved overall readability of the code * grid_nms.cu: minor fix-ups * Update grid_stride_range.hpp * Update tf_importer.cpp	2021-11-28 12:54:29 +00:00
Alexander Alekhin	58b06222ff	dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling	2021-11-28 07:44:05 +00:00
yuki takehara	a6277370ca	Merge pull request #21107 from take1014:remove_assert_21038 resolves #21038 * remove C assert * revert C header * fix several points in review * fix test_ds.cpp	2021-11-27 18:34:52 +00:00
Hanxi Guo	1fcf7ba5bc	Merge pull request #20406 from MarkGHX:gsoc_2021_webnn [GSoC] OpenCV.js: Accelerate OpenCV.js DNN via WebNN * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Fix the build issue * Update concat_layer.cpp Still have some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Delete bib19450.aux * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Update dnn.cpp * Fix Error in dnn.cpp * Resolve duplication in conditions in convolution_layer.cpp * Fixed the issues in the comments * Fix building issue * Update tutorial * Fixed comments * Address the comments * Update CMakeLists.txt * Offer more accurate perf test on native * Add better perf tests for both native and web * Modify per tests for better results * Use more latest version of Electron * Support latest WebNN Clamp op * Add definition of HAVE_WEBNN macro * Support group convolution * Implement Scale_layer using WebNN * Add Softmax option for native classification example * Fix comments * Fix comments	2021-11-23 21:15:31 +00:00
Alexander Alekhin	394e640909	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-11-13 15:11:30 +00:00
Alexander Alekhin	8041ab8a61	Merge pull request #21025 from alalek:issue_21004 * dnn(ocl4dnn): fix LRN layer accuracy problems - FP16 intermediate computation is not accurate and may provide NaN values * dnn(test): update tolerance for FP16	2021-11-12 01:54:07 +03:00
ZaKiiiiiiiii	98b6ce353c	Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer fix bug: wrong output dimension when "keep_dims" is false in pooling layer. * fix bug in max layer * code align * delete permute layer and add test case * add name assert * check other cases * remove c++11 features * style:add "const" remove assert * style:sanitize file names	2021-11-09 19:24:04 +03:00
Alexander Alekhin	7842181b47	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-11-05 09:27:46 +00:00
Alexander Alekhin	d484939c02	Merge pull request #20999 from alalek:dnn_replace_deprecated_calls dnn(protobuf): replace deprecated calls * dnn: replace deprecated ByteSize() => ByteSizeLong() * dnn: replace deprecated calls, use GetRepeatedFieldRef	2021-11-03 15:59:36 +00:00
Alexander Alekhin	ec10f2e72b	Merge pull request #20877 from rogday:simple_layers	2021-10-20 17:00:38 +00:00
rogday	b3f966e2ca	Merge pull request #20883 from rogday:eltwise_refactoring * backport elementwise_layers refactor * keep NULL	2021-10-19 13:29:22 +00:00
Alexander Alekhin	1926e919be	dnn(int8): fix using of incorrect UMat constructor	2021-10-18 04:46:00 +00:00
Alexander Alekhin	31c40fa4cc	Merge remote-tracking branch 'upstream/3.4' into merge-3.4	2021-10-15 13:35:03 +00:00
Smirnov Egor	1feb3838b5	add Ceil, Floor, Log, Round, Sqrt, Not, Equal, Less, Greater	2021-10-15 16:02:46 +03:00
Alexander Alekhin	53d6c9b9c0	Merge pull request #20860 from rogday:sum_fix	2021-10-12 15:36:32 +00:00
Smirnov Egor	238dbffb48	change asserts for Sum	2021-10-11 20:59:44 +03:00
Smirnov Egor	a9d7b6eab7	fix const - input and remove unimplemented function	2021-10-11 18:58:10 +03:00

... 2 3 4 5 6 ...

1739 Commits