mirror of
https://github.com/tesseract-ocr/tesseract.git
synced 2025-07-21 03:26:15 +08:00
Updated Command Line Usage (markdown)
parent
b16a625b4a
commit
2cf28035b8
@ -54,9 +54,10 @@ ____________________________________
|
|||||||
Following examples use this image which has text in multiple languages.
|
Following examples use this image which has text in multiple languages.
|
||||||
|
|
||||||

|

|
||||||
|
|
||||||
## Using One Language
|
## Using One Language
|
||||||
|
|
||||||
|
Add '-l LANG' to the command where LANG is three character language code from the list of supported languages. If this is not given then English language is assumed by default.
|
||||||
|
|
||||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng
|
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng
|
||||||
|
|
||||||
Output
|
Output
|
||||||
@ -76,6 +77,8 @@ Output
|
|||||||
|
|
||||||
## Using Multiple Languages
|
## Using Multiple Languages
|
||||||
|
|
||||||
|
Add '-l LANG[+LANG]' to the command line to use multiple languages together for recognition
|
||||||
|
|
||||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-engdeu -l eng+deu
|
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-engdeu -l eng+deu
|
||||||
|
|
||||||
Output
|
Output
|
||||||
@ -103,6 +106,8 @@ This creates a pdf with the image and a separate searchable text layer with the
|
|||||||
|
|
||||||
## HOCR output
|
## HOCR output
|
||||||
|
|
||||||
|
Use 'hocr' config file by adding hocr at the end of the command to get the HOCR output.
|
||||||
|
|
||||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng hocr
|
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng hocr
|
||||||
|
|
||||||
Output
|
Output
|
||||||
@ -154,6 +159,8 @@ Output
|
|||||||
|
|
||||||
## TSV output (Currently available in 3.05-dev in master branch on github)
|
## TSV output (Currently available in 3.05-dev in master branch on github)
|
||||||
|
|
||||||
|
Use 'tsv' config file by adding tsv at the end of the command to get the TSV output.
|
||||||
|
|
||||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng tsv
|
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng tsv
|
||||||
|
|
||||||
Output
|
Output
|
||||||
|
Loading…
Reference in New Issue
Block a user