opencv/doc/tutorials/dnn/dnn_googlenet/dnn_googlenet.markdown
Gursimar Singh 35eba9ca90
Merge pull request #25519 from gursimarsingh:improved_classification_sample
Improved classification sample #25519

#25006 #25314

This pull requests replaces the caffe model for classification with onnx versions. It also adds resnet in model.yml. 

### Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

- [x] I agree to contribute to the project under Apache 2 License.
- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [x] The PR is proposed to the proper branch
- [x] There is a reference to the original bug report and related work
- [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable
      Patch to opencv_extra has the same branch name.
- [x] The feature is well documented and sample code can be built with the project CMake
2024-08-06 09:16:11 +03:00

2.7 KiB

Load ONNX framework models

@tableofcontents

@next_tutorial{tutorial_dnn_openvino}

Original author Vitaliy Lyudvichenko
Compatibility OpenCV >= 4.5.4

Introduction

In this tutorial you will learn how to use opencv_dnn module for image classification by using GoogLeNet trained network from ONNX model zoo.

We will demonstrate results of this example on the following picture. Buran space shuttle

Source Code

We will be using snippets from the example application, that can be downloaded here.

@include dnn/classification.cpp

Explanation

-# Firstly, download GoogLeNet model files: @code python download_models.py googlenet @endcode

Also you need file with names of ILSVRC2012 classes: classification_classes_ILSVRC2012.txt.

Put these files into working dir of this program example.

-# Read and initialize network using path to .onnx file @snippet dnn/classification.cpp Read and initialize network

-# Read input image and convert to the blob, acceptable by GoogleNet @snippet dnn/classification.cpp Open a video file or an image file or a camera stream

cv::VideoCapture can load both images and videos.

@snippet dnn/classification.cpp Create a 4D blob from a frame We convert the image to a 4-dimensional blob (so-called batch) with 1x3x224x224 shape after applying necessary pre-processing like resizing and mean subtraction for each blue, green and red channels correspondingly using cv::dnn::blobFromImage function.

-# Pass the blob to the network @snippet dnn/classification.cpp Set input blob

-# Make forward pass @snippet dnn/classification.cpp Make forward pass During the forward pass output of each network layer is computed, but in this example we need output from the last layer only.

-# Determine the best class @snippet dnn/classification.cpp Get a class with a highest score We put the output of network, which contain probabilities for each of 1000 ILSVRC2012 image classes, to the prob blob. And find the index of element with maximal value in this one. This index corresponds to the class of the image.

-# Run an example from command line @code ./example_dnn_classification googlenet @endcode For our image we get prediction of class space shuttle with more than 99% sureness.