diff --git a/Command-Line-Usage.md b/Command-Line-Usage.md index d6d95b4..361e089 100644 --- a/Command-Line-Usage.md +++ b/Command-Line-Usage.md @@ -54,9 +54,10 @@ ____________________________________ Following examples use this image which has text in multiple languages. ![eurotext.png](http://dev.blog.fairway.ne.jp/wp-content/uploads/2014/04/eurotext.png) - ## Using One Language +Add '-l LANG' to the command where LANG is three character language code from the list of supported languages. If this is not given then English language is assumed by default. + tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng Output @@ -76,6 +77,8 @@ Output ## Using Multiple Languages +Add '-l LANG[+LANG]' to the command line to use multiple languages together for recognition + tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-engdeu -l eng+deu Output @@ -103,6 +106,8 @@ This creates a pdf with the image and a separate searchable text layer with the ## HOCR output +Use 'hocr' config file by adding hocr at the end of the command to get the HOCR output. + tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng hocr Output @@ -154,6 +159,8 @@ Output ## TSV output (Currently available in 3.05-dev in master branch on github) +Use 'tsv' config file by adding tsv at the end of the command to get the TSV output. + tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng tsv Output