install data files; small fix of INSTALL, README; removed ABOUT-NLS (NLS not used at the moment)

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@667 d0cd1f9f-072b-0410-8dd7-cf729c803f20
This commit is contained in:
zdenop@gmail.com 2012-02-05 16:25:40 +00:00
parent 0a50c9ca5c
commit 425c2b8205
4 changed files with 54 additions and 1078 deletions

1068
ABOUT-NLS

File diff suppressed because it is too large Load Diff

View File

@ -7,7 +7,8 @@ unlimited permission to copy, distribute and modify it.
Basic Installation Basic Installation
================== ==================
These are generic installation instructions. These are generic installation instructions. First you need to run
`./autogen.sh', that creates `configure' script.
The `configure' shell script attempts to guess correct values for The `configure' shell script attempts to guess correct values for
various system-dependent variables used during compilation. It uses various system-dependent variables used during compilation. It uses

19
README
View File

@ -21,15 +21,16 @@ in this distribution is now licensed under the Apache License:
** limitations under the License. ** limitations under the License.
Dependencies and Licenses: Dependencies and Licenses
========================== =========================
Leptonica is required. (www.leptonica.com). Tesseract no longer compiles Leptonica is required. (www.leptonica.com). Tesseract no longer compiles
without Leptonica. without Leptonica.
Libtiff is no longer required as a direct dependency.
Installing and Running Tesseract Installing and Running Tesseract
--------------------------------
All Users Do NOT Ignore! All Users Do NOT Ignore!
The tarballs are split into pieces. The tarballs are split into pieces.
@ -57,8 +58,8 @@ Instructions for using the training tools are documented separately at
Tesseract wiki http://code.google.com/p/tesseract-ocr/w/list Tesseract wiki http://code.google.com/p/tesseract-ocr/w/list
Windows: Windows
-------- -------
Please use installer (for 3.00 and above). Tesseract is library with Please use installer (for 3.00 and above). Tesseract is library with
command line interface. If you need GUI, please check AddOns wiki page command line interface. If you need GUI, please check AddOns wiki page
@ -83,8 +84,8 @@ on AddOns wiki page:
http://code.google.com/p/tesseract-ocr/wiki/AddOns#Tesseract_3.0x http://code.google.com/p/tesseract-ocr/wiki/AddOns#Tesseract_3.0x
Non-Windows (or Cygwin): Non-Windows (or Cygwin)
------------------------ -----------------------
You have to tell Tesseract through a standard unix mechanism where to You have to tell Tesseract through a standard unix mechanism where to
find its data directory. You must either: find its data directory. You must either:
@ -110,8 +111,8 @@ libtesseract_api.
History: History
======== =======
The engine was developed at Hewlett Packard Laboratories Bristol and The engine was developed at Hewlett Packard Laboratories Bristol and
at Hewlett Packard Co, Greeley Colorado between 1985 and 1994, with some at Hewlett Packard Co, Greeley Colorado between 1985 and 1994, with some
more changes made in 1996 to port to Windows, and some C++izing in 1998. more changes made in 1996 to port to Windows, and some C++izing in 1998.

View File

@ -1,2 +1,44 @@
datadir = @datadir@/tessdata datadir = @datadir@/tessdata
SUBDIRS = configs tessconfigs SUBDIRS = configs tessconfigs
EXTRA_DIST = makedummies
data_DATA = ./bul.traineddata ./mlt.traineddata ./chr.traineddata \
./slk.traineddata ./dan-frak.traineddata ./eng.traineddata \
./ces.traineddata ./afr.traineddata ./swa.traineddata \
./kan.traineddata ./bel.traineddata ./ind.traineddata \
./lit.traineddata ./nld.traineddata ./osd.traineddata \
./mkd.traineddata ./est.traineddata ./fra.traineddata \
./hin.traineddata ./lat_lid.traineddata ./nor.traineddata \
./por.traineddata ./ron.traineddata ./swe.traineddata \
./pol.traineddata ./ara.traineddata ./tel.traineddata \
./ell.traineddata ./mal.traineddata ./vie.traineddata \
./heb.traineddata ./deu.traineddata ./eus.traineddata \
./ita_old.traineddata ./rus.traineddata ./sqi.traineddata \
./spa.traineddata ./glg.traineddata ./slk-frak.traineddata \
./equ.traineddata ./hrv.traineddata ./frk.traineddata \
./cat.traineddata ./lav.traineddata ./ukr.traineddata \
./enm.traineddata ./dan.traineddata ./fin.traineddata \
./ben.traineddata ./srp.traineddata ./tha.traineddata \
./hun.traineddata ./tgl.traineddata ./frm.traineddata \
./slv.traineddata ./chi_sim.traineddata ./tam.traineddata \
./tur.traineddata ./epo.traineddata ./msa.traineddata \
./kor.traineddata ./isl.traineddata ./jpn.traineddata \
./chi_tra.traineddata ./ita.traineddata ./spa_old.traineddata \
./deu-frak.traineddata ./aze.traineddata ./fra.cube.lm \
./ita.tesseract_cube.nn ./eng.cube.word-freq ./rus.cube.lm \
./spa.cube.size ./fra.cube.nn ./fra.cube.params ./rus.cube.size \
./fra.cube.fold ./eng.cube.size ./ita.cube.bigrams \
./eng.tesseract_cube.nn ./rus.cube.params ./hin.cube.nn \
./spa.cube.params ./hin.cube.lm ./fra.cube.word-freq \
./spa.cube.word-freq ./ara.cube.nn ./ara.cube.word-freq \
./spa.cube.fold ./eng.cube.nn ./eng.cube.params ./eng.cube.lm \
./ita.cube.size ./hin.tesseract_cube.nn ./ita.cube.lm \
./fra.cube.bigrams ./ara.cube.fold ./spa.cube.bigrams \
./hin.cube.word-freq ./rus.cube.word-freq ./ita.cube.word-freq \
./fra.tesseract_cube.nn ./rus.cube.fold ./ara.cube.size \
./eng.cube.fold ./ita.cube.params ./ara.cube.params ./ita.cube.fold \
./ara.cube.bigrams ./hin.cube.params ./hin.cube.fold ./spa.cube.lm \
./ita.cube.nn ./fra.cube.size ./eng.cube.bigrams ./ara.cube.lm \
./rus.cube.nn ./spa.cube.nn ./hin.cube.bigrams