mirror of
https://github.com/tesseract-ocr/tesseract.git
synced 2024-11-24 02:59:07 +08:00
577e8a8b93
Add PAGE XML export and documentation. To generate PAGE XML output just add 'page' to the tesseract command. The output is outputname + '.page.xml' to avoid conflicts with ALTO export. The output can be customized with the flags: tessedit_create_page_polygon and tessedit_create_page_wordlevel. Co-authored-by: Stefan Weil <sw@weilnetz.de>
4 lines
67 B
Plaintext
4 lines
67 B
Plaintext
tessedit_create_page_xml 1
|
|
# page_xml_polygon 1
|
|
# page_xml_level 0
|