mirror of
https://github.com/tesseract-ocr/tesseract.git
synced 2024-12-14 08:39:27 +08:00
dabf3c299f
Text files should end with a LF, but not additional empty lines. Signed-off-by: Stefan Weil <sw@weilnetz.de>
19 lines
1.2 KiB
Plaintext
19 lines
1.2 KiB
Plaintext
This file documents the original source of various examples used for testing.
|
|
|
|
hebrew.png - Sample from Hebrew OCR with Nikud project by Adi Oz and Vered Shani
|
|
project URL - http://www.cs.bgu.ac.il/~nlpproj/hocr/
|
|
direct link to image - http://www.cs.bgu.ac.il/~nlpproj/hocr/images/image00.png
|
|
|
|
hebtypo.jpg - Sample from OCR and Hebrew on the Web project at Universiteit van Amsterdam
|
|
project URL - http://cf.uba.uva.nl/en/collections/rosenthaliana/menasseh/hebtypo.html
|
|
direct link to image - http://cf.uba.uva.nl/en/collections/rosenthaliana/menasseh/gif/hebtypo.jpg
|
|
|
|
DuTillet1004Pg2LG.jpg - Sample from Hebrew Matthew Project with parallel texts in Hebrew & Greek
|
|
as well as English page/chapter labels with Arabic numerals - test with -l heb+grc+eng
|
|
project URL - http://www.torahresource.com/Dutillet.html
|
|
direct link to image - http://www.torahresource.com/DuTillet/DuTillet1004Pg2LG.jpg
|
|
|
|
hebrew-nikud-genesis-1-2.png - Genesis 1-2 Hebrew example from OCR forum
|
|
forum post - https://community.logos.com/forums/p/16124/277997.aspx
|
|
direct link to image - https://community.logos.com/cfs-filesystemfile.ashx/__key/CommunityServer.Discussions.Components.Files/77/4578.Gen.png
|