Commit Graph

5641 Commits

Author SHA1 Message Date
Nick White
306610c5ec Use shell quoting rather than pluses to separate font arguments in tesstrain.sh
The way tesstrain.sh handled font names was really weird, using '+'
signs as a delimiter. However quoting arguments is a much more
straightforward, standard and sensible way to do things.

So whereas previously one would have used this:
  --fontlist Times New Roman + Arial Black
Now they should be specified like this:
  --fontlist "Times New Roman" "Arial Black"
2015-10-30 13:26:45 +00:00
zdenop
b882590491 Merge pull request #65 from ws233/master
Type mismatch on 64bit platforms
2015-10-28 20:02:20 +01:00
zdenop
b16b8a84bf Merge pull request #119 from amitdo/osd-script-name
OSD: Print script name instead of meaningless script id
2015-10-28 10:14:56 +01:00
amitdo
dcfdd5c035 OSD: Print script name instead of meaningless script id 2015-10-28 09:50:28 +02:00
zdenop
86407964b7 Update INSTALL.GIT.md 2015-10-11 17:16:42 +02:00
Zdenko Podobný
bf43d6a6e9 improve cmake instruction and rename it to INSTALL.GIT.md 2015-10-11 17:09:01 +02:00
Zdenko Podobný
0eefed6a14 cmake - add initial cmake instruction to INSTALL.GIT ; rename cmake output tesseractmain to tesseract; updage badges links 2015-10-10 17:26:32 +02:00
zdenop
bb15031266 Merge pull request #109 from egorpugin/master
CMake improvements
2015-10-10 14:49:26 +02:00
Egor Pugin
f3b20a0f4d Merge branch 'master' of github.com-egorpugin:egorpugin/tesseract 2015-10-09 18:22:16 +03:00
Egor Pugin
0788098b80 Fix Linux build with Leptonica_BUILD_DIR. 2015-10-09 18:22:04 +03:00
Egor Pugin
8e157dbb7e Fix build. 2015-10-09 18:18:02 +03:00
Egor Pugin
dfb5aa5c34 Add Leptonica_BUILD_DIR option for CMake find_package in Config mode. Set correct .so name on Linux. 2015-10-09 18:12:02 +03:00
zdenop
896db80f26 Merge pull request #108 from johnteslade/unicharset_extractor
Unicharset extractor problems with wchar
2015-10-05 21:49:03 +02:00
Egor Pugin
a614edbe94 Merge branch 'master' of github.com:egorpugin/tesseract 2015-10-05 22:33:24 +03:00
Egor Pugin
93d1c66e4e Add FindLeptonica.cmake module to find installed liblept on *nix systems. 2015-10-05 22:33:10 +03:00
John Slade
379da1f2e0 training/unicharset_extractor.cpp: Print whether WCTYPE is included
Character properties are autogenerated only if wctype is found on the
system.  However, it is not possible to know if a version of
unicharset_extractor was compiled with this support (especially if it
was installed as a pre-compiled binary).

This commit adds a print to the usage details to output if the binary
was compiled with wctype support.
2015-10-05 11:54:24 +01:00
John Slade
2517ffefe7 configure.ac: Detect wchar_t using wchar.h header
The wchar_t type is defined in `wchar.h` and if this header is not
included by autoconf the detection of the type will fail.  This type is
required by `unicharset_extractor` to autogenerate the character
properties.

This problem was detected when running under Fedora 21.
2015-10-05 11:20:34 +01:00
zdenop
dd8c129973 Merge pull request #107 from egorpugin/master
Initial CMake build system implementation
2015-10-02 20:10:11 +02:00
Egor Pugin
f369585f56 Merge branch 'master' of github.com:tesseract-ocr/tesseract 2015-10-02 12:02:04 +03:00
zdenop
2e7a633f06 Merge pull request #105 from syk0saje/patch-1
adds sudo to "make install" command
2015-09-28 22:16:32 +02:00
Pepe Bawagan
bd917b9fee adds sudo to "make install" command
for consistency with instructions that show up while installing
2015-09-29 00:24:05 +08:00
zdenop
0d61f0c05a Merge pull request #95 from stweil/master
Fix a bunch of typos in comments, documentation and strings
2015-09-15 09:32:54 +02:00
Stefan Weil
00a4e06be9 wordrec: Fix typos in comments
All of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:23:48 +02:00
Stefan Weil
425c2391b2 Java: Fix typos in comments and strings
All of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:18:44 +02:00
Stefan Weil
4d2fd0f8c9 Doxyfile: Fix typo in comment (found by codespell)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:17:48 +02:00
Stefan Weil
97d47a406d dict: Fix typos in comments and strings
All of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:16:42 +02:00
Stefan Weil
55c81cb193 cutil: Fix typos in comments
All of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:16:13 +02:00
Stefan Weil
5378679dce cube: Fix typos in comments
All of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:14:03 +02:00
Stefan Weil
55fde61a8f classify: Fix typos in comments and strings
All of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:12:06 +02:00
Stefan Weil
539b7fbbab ccutil: Fix typos in comments and strings
Most of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:09:18 +02:00
Stefan Weil
bef8cad38d ccstruct: Fix typos in comments and strings
Most of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:02:00 +02:00
Stefan Weil
318b88daa6 ccmain: Fix typos in comments and strings
Most of them were found by codespell.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 21:59:16 +02:00
Stefan Weil
11b2a4d9af api: Fix typos in comments (all found by codespell)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 21:54:27 +02:00
Stefan Weil
48171dea5f COPYING: Fix typo found by codespell
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 21:52:54 +02:00
Nick White
b389e2782c Set default exposure settings for grc training 2015-09-11 13:30:05 +01:00
Nick White
ac6050def0 Remove NUMBER_DAWG_FACTOR and WORD_DAWG_FACTOR from grc rules
These aren't used anywhere, and are difficult to calculate for grc,
so leave them as the default.
2015-09-11 09:29:55 +01:00
zdenop
b216f6f66b Merge pull request #92 from nickjwhite/bettertesstrain
Improve tesstrain.sh script
2015-09-10 19:00:26 +02:00
Nick White
714d2cc4ae Use different font list for grc training
This font list contains a selection fonts produced by the Greek Font
Society <http://greekfontsociety.gr>, and is the result of testing
with a large corpus of a variety of scanned works.
2015-09-10 15:34:44 +01:00
Nick White
de789ac8ea Use mktemp to create workspace directory
mktemp is a better idea for security, as well as enabling users to
specify a different directory using the TMPDIR environment variable,
which is useful if /tmp is a small tmpfs.

Also fix a bug where the first few log messages were failing as the
workspace directory wasn't been created early enough.
2015-09-10 15:05:07 +01:00
Nick White
c0133ecfa6 Add --exposures option to tesstrain.sh
This flag can be used to specify multiple different exposure levels
for a training. There was some code already in tesstrain_utils.sh
to deal with multiple exposure levels, so it looks like this
functionality was always intended.

The default usage does not change, with exposure level 0 being the
only one used if --exposures is not used.
2015-09-10 14:57:17 +01:00
zdenop
da703a0a78 Merge pull request #91 from tfmorris/scrollview-convenience
Simplify build and run of ScrollView
2015-09-10 08:36:08 +02:00
zdenop
f25d63ea91 Merge pull request #90 from tfmorris/32bitlong-overflow
Add ULL to constants which overflow 32 bits
2015-09-10 08:35:38 +02:00
Tom Morris
b10fe909bc Merge branch 'scrollview-convenience' of github.com:tfmorris/tesseract into scrollview-convenience 2015-09-09 17:24:30 -04:00
Tom Morris
4d92667e89 Simplify build and run of ScrollView 2015-09-09 17:21:58 -04:00
Tom Morris
a1e14ea93c Add ULL to constants which overflow 32 bits 2015-09-09 16:51:45 -04:00
Egor Pugin
25136e40ea Restore ICU_INCLUDE_DIRS for OS X. 2015-09-07 13:02:22 +03:00
Egor Pugin
5e7ffe9e56 Remove unused option. 2015-09-07 12:57:02 +03:00
Egor Pugin
252080dbca Do static build with some tools on appveyor. 2015-09-07 12:50:20 +03:00
Egor Pugin
670e0fafb3 Hide pango and cairo includes from targets that do not use it. 2015-09-07 12:49:08 +03:00
Egor Pugin
da3852dc77 Fix cygwin build. 2015-09-07 02:49:18 +03:00