mirror of
https://github.com/tesseract-ocr/tesseract.git
synced 2025-07-21 03:26:15 +08:00
Updated Command Line Usage (markdown)
parent
b16a625b4a
commit
2cf28035b8
@ -54,9 +54,10 @@ ____________________________________
|
||||
Following examples use this image which has text in multiple languages.
|
||||
|
||||

|
||||
|
||||
## Using One Language
|
||||
|
||||
Add '-l LANG' to the command where LANG is three character language code from the list of supported languages. If this is not given then English language is assumed by default.
|
||||
|
||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng
|
||||
|
||||
Output
|
||||
@ -76,6 +77,8 @@ Output
|
||||
|
||||
## Using Multiple Languages
|
||||
|
||||
Add '-l LANG[+LANG]' to the command line to use multiple languages together for recognition
|
||||
|
||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-engdeu -l eng+deu
|
||||
|
||||
Output
|
||||
@ -103,6 +106,8 @@ This creates a pdf with the image and a separate searchable text layer with the
|
||||
|
||||
## HOCR output
|
||||
|
||||
Use 'hocr' config file by adding hocr at the end of the command to get the HOCR output.
|
||||
|
||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng hocr
|
||||
|
||||
Output
|
||||
@ -154,6 +159,8 @@ Output
|
||||
|
||||
## TSV output (Currently available in 3.05-dev in master branch on github)
|
||||
|
||||
Use 'tsv' config file by adding tsv at the end of the command to get the TSV output.
|
||||
|
||||
tesseract --tessdata-dir ./ ./testing/eurotext.png ./testing/eurotext-eng -l eng tsv
|
||||
|
||||
Output
|
||||
|
Loading…
Reference in New Issue
Block a user