Commit Graph

2177 Commits

Author SHA1 Message Date
Stefan Weil
61f96981e5 training: Fix typos in comments (found by codespell)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-03 09:16:02 +02:00
Stefan Weil
b9365cdff1 api: Fix typo in comment
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-03 09:14:00 +02:00
Shreeshrii
10cf508442 Changed syntax for AM_CPPFLAGS
as per suggestion in https://github.com/tesseract-ocr/tesseract/pull/1088#discussion_r136700733

Not ALL of these are needed currently.
2017-09-03 08:09:18 +05:30
Shreeshrii
3b05b3e28b revert blank line at end
as not needed after 
8bb5a89d5a
2017-09-02 20:00:45 +05:30
Stefan Weil
6ac5d0ba8f dawg: Fix typos and file information in file header
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-30 21:48:06 +02:00
Egor Pugin
9c2fa0d165 Merge pull request #1100 from Sineaggi/master
cmake install fixes
2017-08-30 02:49:24 +03:00
Clayton Walker
69b9d298c7 Fix version.h not being installed 2017-08-29 15:14:54 -06:00
Clayton Walker
52cac3a42e Fix library being named liblibtesseract on non win32 platforms 2017-08-29 15:14:13 -06:00
zdenop
27d25e9c99 Merge pull request #1094 from stweil/spaces
Add missing spaces in help text
2017-08-24 08:16:53 +02:00
Stefan Weil
b016c48d06 Add missing spaces in help text
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-23 19:12:41 +02:00
zdenop
1e3bf29cf6 Merge pull request #1092 from stweil/fixtext
Don't add empty line to text output
2017-08-21 12:05:01 +02:00
Stefan Weil
8bb5a89d5a Don't add empty line to text output
Empty lines in text output are needed to separate paragraphs,
but there should not be an empty line at the end of the text.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-21 09:47:35 +02:00
zdenop
de98a68dd0 Merge pull request #1089 from stweil/text
Fix hint for training build
2017-08-19 22:32:32 +02:00
Stefan Weil
742b303548 Fix hint for training build
* Fix grammar.
* Fix text ("after building ..." is not necessary).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-19 21:35:55 +02:00
Shreeshrii
6773e8b909 add blank lines to match OCRed text 2017-08-19 19:17:08 +05:30
Shreeshrii
267734bc34 Changes needed for adding make check for unittest 2017-08-19 19:01:46 +05:30
Shreeshrii
f3dc156f22 updated version of apiexample_test 2017-08-19 18:46:23 +05:30
Shreeshrii
7b409a1bfa unittest testfile 2017-08-19 18:43:57 +05:30
Shreeshrii
436ad77e44 Create readme.md 2017-08-19 18:43:10 +05:30
Shreeshrii
88e4c62b39 Add files via upload 2017-08-19 18:42:06 +05:30
theraysmith
6f13d75534 Merge pull request #1051 from stweil/googletest
Add GoogleTest infrastructure
2017-08-17 20:19:46 -07:00
zdenop
3847b7dd74 Merge pull request #1085 from KindDragon/patch-1
Added CMake option to use system ICU library
2017-08-17 08:37:33 +02:00
Arkady Shapkin
d171488e21 Added CMake option to use system ICU library 2017-08-17 02:50:54 +03:00
zdenop
7afa05a03e Merge pull request #1072 from stweil/listlangs
List available languages recursively
2017-08-13 14:50:42 +02:00
zdenop
197b89b6ac Merge pull request #1077 from chrismamo1/chore/cleanup-compiler-warnings
WIP: Chore/cleanup compiler warnings
2017-08-13 14:50:26 +02:00
zdenop
3755a29abb Merge pull request #1076 from chrismamo1/bug/listlangs-without-eng
move code around so that list-langs will work without an English traineddata file
2017-08-13 14:50:10 +02:00
chrismamo1
6f281c36a7 fix a problem I introduced in a previous commit 2017-08-12 18:09:22 -05:00
chrismamo1
7111167497 fix a set-but-not-used warning and add casts for comparing signed+unsigned numbers 2017-08-12 17:53:28 -05:00
chrismamo1
b89bb09f9b fix a set but not used warning and cleanup some old code from 2007 2017-08-12 17:48:33 -05:00
chrismamo1
f9b51d7983 suppress a strict aliasing warning; the original author was very clear about the nature of the problematic code 2017-08-12 17:36:50 -05:00
chrismamo1
5fd3e22f74 move code around so that list-langs will work without an English traineddata file 2017-08-12 17:15:27 -05:00
Stefan Weil
cc0d87c5b8 List available languages recursively
Tesseract supports hierarchies of languages and uses them since
the new files best/*.traineddata were added.

Now `tesseract --list-langs` also shows any traineddata files in
subdirectories of the tessdata directory.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-10 18:55:38 +02:00
Egor Pugin
efa50daf5a Merge pull request #1070 from stweil/resolution
Change default resolution from 70 to 300 dpi
2017-08-08 23:05:14 +03:00
Stefan Weil
0720b3f38b Change default resolution from 70 to 300 dpi
The default resolution is used for images without an explicit resolution
or with an unreasonable resolution (smaller than 70 or larger than 2400).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-08 16:48:10 +02:00
Ray Smith
5f5e85e4a0 Fixed lack of error on non-existent traineddata 2017-08-07 09:58:43 -07:00
Ray Smith
0a91498195 Improved error message on missing optional config 2017-08-07 09:50:49 -07:00
Ray Smith
4b3c5f6c35 Added check for non-empty traineddata flag 2017-08-07 09:43:30 -07:00
Egor Pugin
c67c2e9f41 Add combine_lang_model to cmake and cppan builds. 2017-08-06 14:46:32 +03:00
zdenop
08ec5775a1 Merge pull request #1064 from stweil/win32
Fix broken build for Windows
2017-08-04 10:50:01 +02:00
Stefan Weil
cdec915e17 Fix broken build for Windows
Windows does not provide a mkdir function with two parameters.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-04 10:18:35 +02:00
Ray Smith
8e55e52be7 Harder unittest that uses file i/o and string manipulation 2017-08-03 15:51:18 -07:00
Ray Smith
4572940639 Portability fix to help tests compile with the same code in both Google and github 2017-08-03 15:42:26 -07:00
Ray Smith
2fbcba62e5 Initial push of one simple unittest 2017-08-02 17:35:29 -07:00
Ray Smith
77c44cdecd Added convert to int and directory listing to combine_tessdata 2017-08-02 14:53:07 -07:00
Ray Smith
2ef1aeaeb4 Added AVX2 and AVX512 detector 2017-08-02 14:15:50 -07:00
Ray Smith
39b168a0b6 Removed errors introduced by git merge 2017-08-02 14:12:45 -07:00
Ray Smith
4e9665debf Added ADAM optimizer, unless git screwed it up, cos there is no diff 2017-08-02 14:03:50 -07:00
Ray Smith
2633fef0b6 Part 2 of separating out the unicharset from the LSTM model, fixing command line for training 2017-08-02 13:29:23 -07:00
Egor Pugin
61adbdfa4b Merge pull request #1054 from tdhintz/master
std::max build fix.
2017-07-27 02:49:21 +03:00
Hintz
67314ea9bd Merge pull request #1 from tdhintz/tdhintz-stdmax-patch
Define std::max under VS2017 x64
2017-07-26 16:40:08 -05:00