opencv/doc/tutorials/dnn/dnn_googlenet/dnn_googlenet.markdown

Load Caffe framework models  {#tutorial_dnn_googlenet}
===========================

Introduction
------------

In this tutorial you will learn how to use opencv_dnn module for image classification by using
GoogLeNet trained network from [Caffe model zoo](http://caffe.berkeleyvision.org/model_zoo.html).

We will demonstrate results of this example on the following picture.
![Buran space shuttle](images/space_shuttle.jpg)

Source Code
-----------

We will be using snippets from the example application, that can be downloaded [here](https://github.com/opencv/opencv/blob/master/samples/dnn/classification.cpp).

@include dnn/classification.cpp

Explanation
-----------

-# Firstly, download GoogLeNet model files:
   [bvlc_googlenet.prototxt  ](https://github.com/opencv/opencv_extra/blob/master/testdata/dnn/bvlc_googlenet.prototxt) and
   [bvlc_googlenet.caffemodel](http://dl.caffe.berkeleyvision.org/bvlc_googlenet.caffemodel)

   Also you need file with names of [ILSVRC2012](http://image-net.org/challenges/LSVRC/2012/browse-synsets) classes:
   [classification_classes_ILSVRC2012.txt](https://github.com/opencv/opencv/blob/master/samples/data/dnn/classification_classes_ILSVRC2012.txt).

   Put these files into working dir of this program example.

-# Read and initialize network using path to .prototxt and .caffemodel files
   @snippet dnn/classification.cpp Read and initialize network

   You can skip an argument `framework` if one of the files `model` or `config` has an
   extension `.caffemodel` or `.prototxt`.
   This way function cv::dnn::readNet can automatically detects a model's format.

-# Read input image and convert to the blob, acceptable by GoogleNet
   @snippet dnn/classification.cpp Open a video file or an image file or a camera stream

   cv::VideoCapture can load both images and videos.

   @snippet dnn/classification.cpp Create a 4D blob from a frame
   We convert the image to a 4-dimensional blob (so-called batch) with `1x3x224x224` shape
   after applying necessary pre-processing like resizing and mean subtraction
   `(-104, -117, -123)` for each blue, green and red channels correspondingly using cv::dnn::blobFromImage function.

-# Pass the blob to the network
   @snippet dnn/classification.cpp Set input blob

-# Make forward pass
   @snippet dnn/classification.cpp Make forward pass
   During the forward pass output of each network layer is computed, but in this example we need output from the last layer only.

-# Determine the best class
   @snippet dnn/classification.cpp Get a class with a highest score
   We put the output of network, which contain probabilities for each of 1000 ILSVRC2012 image classes, to the `prob` blob.
   And find the index of element with maximal value in this one. This index corresponds to the class of the image.

-# Run an example from command line
   @code
   ./example_dnn_classification --model=bvlc_googlenet.caffemodel --config=bvlc_googlenet.prototxt --width=224 --height=224 --classes=classification_classes_ILSVRC2012.txt --input=space_shuttle.jpg --mean="104 117 123"
   @endcode
   For our image we get prediction of class `space shuttle` with more than 99% sureness.
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00			`Load Caffe framework models {#tutorial_dnn_googlenet}`
			`===========================`

			`Introduction`
			`------------`

			`In this tutorial you will learn how to use opencv_dnn module for image classification by using`
			`GoogLeNet trained network from [Caffe model zoo](http://caffe.berkeleyvision.org/model_zoo.html).`

			`We will demonstrate results of this example on the following picture.`
			`![Buran space shuttle](images/space_shuttle.jpg)`

			`Source Code`
			`-----------`

Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`We will be using snippets from the example application, that can be downloaded [here](https://github.com/opencv/opencv/blob/master/samples/dnn/classification.cpp).`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`@include dnn/classification.cpp`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
			`Explanation`
			`-----------`

			`-# Firstly, download GoogLeNet model files:`
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`[bvlc_googlenet.prototxt ](https://github.com/opencv/opencv_extra/blob/master/testdata/dnn/bvlc_googlenet.prototxt) and`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00			`[bvlc_googlenet.caffemodel](http://dl.caffe.berkeleyvision.org/bvlc_googlenet.caffemodel)`

			`Also you need file with names of [ILSVRC2012](http://image-net.org/challenges/LSVRC/2012/browse-synsets) classes:`
Merge remote-tracking branch 'upstream/3.4' into merge-3.4 2019-03-30 03:21:47 +08:00			`[classification_classes_ILSVRC2012.txt](https://github.com/opencv/opencv/blob/master/samples/data/dnn/classification_classes_ILSVRC2012.txt).`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
			`Put these files into working dir of this program example.`

			`-# Read and initialize network using path to .prototxt and .caffemodel files`
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`@snippet dnn/classification.cpp Read and initialize network`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			You can skip an argument `framework` if one of the files `model` or `config` has an
			extension `.caffemodel` or `.prototxt`.
			`This way function cv::dnn::readNet can automatically detects a model's format.`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
			`-# Read input image and convert to the blob, acceptable by GoogleNet`
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`@snippet dnn/classification.cpp Open a video file or an image file or a camera stream`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`cv::VideoCapture can load both images and videos.`

			`@snippet dnn/classification.cpp Create a 4D blob from a frame`
			We convert the image to a 4-dimensional blob (so-called batch) with `1x3x224x224` shape
			`after applying necessary pre-processing like resizing and mean subtraction`
			`(-104, -117, -123)` for each blue, green and red channels correspondingly using cv::dnn::blobFromImage function.
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`-# Pass the blob to the network`
			`@snippet dnn/classification.cpp Set input blob`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
			`-# Make forward pass`
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`@snippet dnn/classification.cpp Make forward pass`
			`During the forward pass output of each network layer is computed, but in this example we need output from the last layer only.`
dnn: move module from opencv_contrib https://github.com/opencv/opencv_contrib/tree/e6f63c7a38ca40c5dc33e38736e3027e3528d6cb/modules/dnn 2017-06-26 18:35:51 +08:00
			`-# Determine the best class`
Update tutorials. A new cv::dnn::readNet function 2018-03-04 00:29:37 +08:00			`@snippet dnn/classification.cpp Get a class with a highest score`
			We put the output of network, which contain probabilities for each of 1000 ILSVRC2012 image classes, to the `prob` blob.
			`And find the index of element with maximal value in this one. This index corresponds to the class of the image.`

			`-# Run an example from command line`
			`@code`
			`./example_dnn_classification --model=bvlc_googlenet.caffemodel --config=bvlc_googlenet.prototxt --width=224 --height=224 --classes=classification_classes_ILSVRC2012.txt --input=space_shuttle.jpg --mean="104 117 123"`
			`@endcode`
			For our image we get prediction of class `space shuttle` with more than 99% sureness.