2011-09-30 17:54:31 +08:00
Latent SVM
===============================================================
Discriminatively Trained Part Based Models for Object Detection
---------------------------------------------------------------
The object detector described below has been initially proposed by
P.F. Felzenszwalb in [Felzenszwalb2010]_ . It is based on a
Dalal-Triggs detector that uses a single filter on histogram of
oriented gradients (HOG) features to represent an object category.
This detector uses a sliding window approach, where a filter is
applied at all positions and scales of an image. The first
innovation is enriching the Dalal-Triggs model using a
star-structured part-based model defined by a "root" filter
(analogous to the Dalal-Triggs filter) plus a set of parts filters
and associated deformation models. The score of one of star models
at a particular position and scale within an image is the score of
the root filter at the given location plus the sum over parts of the
maximum, over placements of that part, of the part filter score on
its location minus a deformation cost easuring the deviation of the
part from its ideal location relative to the root. Both root and
part filter scores are defined by the dot product between a filter
(a set of weights) and a subwindow of a feature pyramid computed
from the input image. Another improvement is a representation of the
class of models by a mixture of star models. The score of a mixture
model at a particular position and scale is the maximum over
components, of the score of that component model at the given
location.
2012-05-28 15:36:14 +08:00
In OpenCV there are C implementation of Latent SVM and C++ wrapper of it.
C version is the structure :ocv:struct: `CvObjectDetection` and a set of functions
working with this structure (see :ocv:func: `cvLoadLatentSvmDetector` ,
2011-10-04 20:10:57 +08:00
:ocv:func: `cvReleaseLatentSvmDetector` , :ocv:func: `cvLatentSvmDetectObjects` ).
2012-05-28 15:36:14 +08:00
C++ version is the class :ocv:class: `LatentSvmDetector` and has slightly different
functionality in contrast with C version - it supports loading and detection
of several models.
2011-10-04 20:10:57 +08:00
There are two examples of Latent SVM usage: `` samples/c/latentsvmdetect.cpp ``
and `` samples/cpp/latentsvm_multidetect.cpp `` .
.. highlight :: c
2011-09-30 17:54:31 +08:00
CvLSVMFilterPosition
--------------------
.. ocv:struct :: CvLSVMFilterPosition
2012-05-30 00:55:46 +08:00
Structure describes the position of the filter in the feature pyramid.
2011-09-30 17:54:31 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: unsigned int l
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
level in the feature pyramid
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: unsigned int x
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
x-coordinate in level l
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: unsigned int y
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
y-coordinate in level l
2012-05-28 15:36:14 +08:00
2011-09-30 17:54:31 +08:00
CvLSVMFilterObject
------------------
.. ocv:struct :: CvLSVMFilterObject
2012-05-30 00:55:46 +08:00
Description of the filter, which corresponds to the part of the object.
2011-09-30 17:54:31 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: CvLSVMFilterPosition V
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
ideal (penalty = 0) position of the partial filter
from the root filter position (V_i in the paper)
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: float fineFunction[4]
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
vector describes penalty function (d_i in the paper)
pf[0] * x + pf[1] * y + pf[2] * x^2 + pf[3] * y^2
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: int sizeX
.. ocv:member :: int sizeY
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
Rectangular map (sizeX x sizeY),
every cell stores feature vector (dimension = p)
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: int numFeatures
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
number of features
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: float *H
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
matrix of feature vectors to set and get
feature vectors (i,j) used formula H[(j * sizeX + i) * p + k],
where k - component of feature vector in cell (i, j)
2012-05-28 15:36:14 +08:00
2011-09-30 17:54:31 +08:00
CvLatentSvmDetector
-------------------
.. ocv:struct :: CvLatentSvmDetector
2012-05-30 00:55:46 +08:00
Structure contains internal representation of trained Latent SVM detector.
2011-09-30 17:54:31 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: int num_filters
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
total number of filters (root plus part) in model
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: int num_components
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
number of components in model
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: int* num_part_filters
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
array containing number of part filters for each component
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: CvLSVMFilterObject** filters
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
root and part filters for all model components
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: float* b
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
biases for all model components
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: float score_threshold
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
confidence level threshold
2012-05-28 15:36:14 +08:00
2011-09-30 17:54:31 +08:00
CvObjectDetection
-----------------
.. ocv:struct :: CvObjectDetection
2012-05-30 00:55:46 +08:00
Structure contains the bounding box and confidence level for detected object.
2011-09-30 17:54:31 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: CvRect rect
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
bounding box for a detected object
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: float score
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
confidence level
2011-09-30 17:54:31 +08:00
cvLoadLatentSvmDetector
-----------------------
Loads trained detector from a file.
.. ocv:function :: CvLatentSvmDetector* cvLoadLatentSvmDetector(const char* filename)
:param filename: Name of the file containing the description of a trained detector
2012-05-28 15:36:14 +08:00
2011-09-30 17:54:31 +08:00
cvReleaseLatentSvmDetector
--------------------------
Release memory allocated for CvLatentSvmDetector structure.
.. ocv:function :: void cvReleaseLatentSvmDetector(CvLatentSvmDetector** detector)
:param detector: CvLatentSvmDetector structure to be released
cvLatentSvmDetectObjects
------------------------
2012-05-28 15:36:14 +08:00
Find rectangular regions in the given image that are likely to contain objects
2011-09-30 17:54:31 +08:00
and corresponding confidence levels.
2012-05-28 15:36:14 +08:00
.. ocv:function :: CvSeq* cvLatentSvmDetectObjects( IplImage* image, CvLatentSvmDetector* detector, CvMemStorage* storage, float overlap_threshold=0.5f, int numThreads=-1 )
:param image: image
2011-09-30 17:54:31 +08:00
:param detector: LatentSVM detector in internal representation
:param storage: Memory storage to store the resultant sequence of the object candidate rectangles
:param overlap_threshold: Threshold for the non-maximum suppression algorithm
:param numThreads: Number of threads used in parallel version of the algorithm
2012-05-28 15:36:14 +08:00
2011-10-04 20:10:57 +08:00
.. highlight :: cpp
LatentSvmDetector
-----------------
.. ocv:class :: LatentSvmDetector
2012-05-28 15:36:14 +08:00
This is a C++ wrapping class of Latent SVM. It contains internal representation of several
trained Latent SVM detectors (models) and a set of methods to load the detectors and detect objects
2011-10-04 20:10:57 +08:00
using them.
LatentSvmDetector::ObjectDetection
----------------------------------
2012-05-28 15:36:14 +08:00
.. ocv:struct :: LatentSvmDetector::ObjectDetection
2011-10-04 20:10:57 +08:00
2012-05-30 00:55:46 +08:00
Structure contains the detection information.
2011-10-04 20:10:57 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: Rect rect
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
bounding box for a detected object
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: float score
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
confidence level
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
.. ocv:member :: int classID
2012-05-28 15:36:14 +08:00
2012-05-30 00:55:46 +08:00
class (model or detector) ID that detect an object
2012-05-28 15:36:14 +08:00
2011-10-04 20:10:57 +08:00
LatentSvmDetector::LatentSvmDetector
------------------------------------
Two types of constructors.
.. ocv:function :: LatentSvmDetector::LatentSvmDetector()
2013-03-22 23:31:36 +08:00
.. ocv:function :: LatentSvmDetector::LatentSvmDetector(const vector<String>& filenames, const vector<String>& classNames=vector<String>())
2011-10-04 20:10:57 +08:00
2012-05-28 15:36:14 +08:00
:param filenames: A set of filenames storing the trained detectors (models). Each file contains one model. See examples of such files here /opencv_extra/testdata/cv/latentsvmdetector/models_VOC2007/.
2011-10-04 20:10:57 +08:00
:param classNames: A set of trained models names. If it's empty then the name of each model will be constructed from the name of file containing the model. E.g. the model stored in "/home/user/cat.xml" will get the name "cat".
LatentSvmDetector::~LatentSvmDetector
2012-03-03 04:59:13 +08:00
-------------------------------------
2011-10-04 20:10:57 +08:00
Destructor.
.. ocv:function :: LatentSvmDetector::~LatentSvmDetector()
LatentSvmDetector::~clear
-------------------------
Clear all trained models and their names stored in an class object.
.. ocv:function :: void LatentSvmDetector::clear()
LatentSvmDetector::load
-----------------------
Load the trained models from given `` .xml `` files and return `` true `` if at least one model was loaded.
2013-03-22 23:31:36 +08:00
.. ocv:function :: bool LatentSvmDetector::load( const vector<String>& filenames, const vector<String>& classNames=vector<String>() )
2012-05-28 15:36:14 +08:00
:param filenames: A set of filenames storing the trained detectors (models). Each file contains one model. See examples of such files here /opencv_extra/testdata/cv/latentsvmdetector/models_VOC2007/.
2011-10-04 20:10:57 +08:00
:param classNames: A set of trained models names. If it's empty then the name of each model will be constructed from the name of file containing the model. E.g. the model stored in "/home/user/cat.xml" will get the name "cat".
LatentSvmDetector::detect
-------------------------
Find rectangular regions in the given image that are likely to contain objects of loaded classes (models)
and corresponding confidence levels.
2012-05-28 15:36:14 +08:00
.. ocv:function :: void LatentSvmDetector::detect( const Mat& image, vector<ObjectDetection>& objectDetections, float overlapThreshold=0.5f, int numThreads=-1 )
2011-10-04 20:10:57 +08:00
:param image: An image.
:param objectDetections: The detections: rectangulars, scores and class IDs.
:param overlapThreshold: Threshold for the non-maximum suppression algorithm.
:param numThreads: Number of threads used in parallel version of the algorithm.
2012-05-28 15:36:14 +08:00
2011-10-04 20:10:57 +08:00
LatentSvmDetector::getClassNames
--------------------------------
2012-04-14 05:50:59 +08:00
Return the class (model) names that were passed in constructor or method `` load `` or extracted from models filenames in those methods.
2011-10-04 20:10:57 +08:00
2013-03-22 23:31:36 +08:00
.. ocv:function :: const vector<String>& LatentSvmDetector::getClassNames() const
2011-10-04 20:10:57 +08:00
LatentSvmDetector::getClassCount
--------------------------------
Return a count of loaded models (classes).
2012-05-29 04:11:38 +08:00
.. ocv:function :: size_t LatentSvmDetector::getClassCount() const
2012-05-28 15:36:14 +08:00
.. [Felzenszwalb2010] Felzenszwalb, P. F. and Girshick, R. B. and McAllester, D. and Ramanan, D. *Object Detection with Discriminatively Trained Part Based Models* . PAMI, vol. 32, no. 9, pp. 1627-1645, September 2010
2011-09-30 17:54:31 +08:00