Thijs Leegwater
f061503a14
Added JPEG quality option parameter (-c jpg_quality=n)
2018-01-11 09:11:30 +01:00
Stefan Weil
c4d8f27019
Fix compiler warning (-Wchar-subscript) ( #1259 )
...
ccstruct/seam.cpp:66:26: warning:
array subscript has type 'char' [-Wchar-subscripts]
Fix it by using an unsigned index and use the same type for related values.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-01-08 21:26:25 +01:00
Egor Pugin
000d027a9f
Rename tesseract library.
2018-01-05 18:51:35 +03:00
Amit D
bc668da042
Update README.md ( #1239 )
2017-12-20 08:14:18 +01:00
Josh Reid
cdc35338c5
Added check if input PSM value is outside of range ( #1236 )
...
Wrote a function to throw an error if PSM is outside 0-13 or OEM is outside 0-5.
fixes #1234
2017-12-14 11:37:44 +01:00
Egor Pugin
eba0ae3b88
Merge pull request #1218 from hsen-dev/master
...
fixed missing include for std::back_inserter.
2017-11-24 17:31:27 +03:00
Ria
d751305804
fixed missing include for std::back_inserter.
...
with Visual Studio 2015 RTM:
Error C2039: 'back_inserter': is not a member of 'std'
Error C3861: 'back_inserter': identifier not found
need "iterator" with Visual Studio 2015 (vc14).
#include <iterator>
2017-11-23 11:37:35 +03:30
Stefan Weil
ebbfc3ae8d
Improve robustness of function LoadDataFromFile ( #1207 )
...
ftell returns a long value which can be negative when an error occurred.
It returns LONG_MAX for directories.
Both cases were not handled by the old code.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-11-10 15:46:38 +01:00
Stefan Weil
f3c4b894dc
Fix help message for unicharset_extractor ( #1206 )
...
If unicharset_extractor was called without any argument,
a help message was printed by tesseract::ParseCommandLineFlags.
Replace that by the local help message which is better.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-11-10 15:45:35 +01:00
Amit D
ad5ee18415
Make font size estimation work with the lstm engine ( #1173 )
...
**Partial** fix for issue #1074
2017-10-20 10:07:16 +02:00
ivanzz1001
fb359fc981
Update unicharset_extractor.cpp ( #1153 )
...
* change IsWhitespace to IsUTF8Whitespace
To solve "Phase UP: Generating unicharset and unichar properties files" ERROR #1147
please reference: [#1147 ](https://github.com/tesseract-ocr/tesseract/issues/1147 )
* Update unicharset_extractor.cpp
fix the "Phase UP: Generating unicharset and unichar properties files" ERROR
* Update unicharset_extractor.cpp
fix "Phase UP: Generating unicharset and unichar properties files" ERROR #1147
* Update unicharset_extractor.cpp
fix the encoding invalid problem and fix the comment
2017-10-13 11:46:42 +02:00
Egor Pugin
1b0379c257
Merge pull request #1163 from cysp/bugfix/leptonica-pkgconfig
...
Add Leptonica's pkg-config-found library directory to the search path
2017-10-03 16:05:51 +03:00
Scott Talbot
a538cd126b
Add Leptonica's pkg-config-found library directory to the search path
2017-10-03 21:15:44 +11:00
Egor Pugin
1b4fb3a762
Update appveyor.yml
2017-09-26 17:01:52 +03:00
zdenop
2cc531e6bf
Merge pull request #1140 from stweil/pagebreak
...
Remove Tesseract parameter "include_page_breaks" and use FF by default
2017-09-19 08:41:08 +02:00
Stefan Weil
aa6eb6bd46
Remove Tesseract parameter "include_page_breaks" and use FF by default
...
Now Tesseract adds a page break (normally form feed) by default.
It is still possible to suppress page breaks by setting an empty
page_separator.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-19 07:34:32 +02:00
zdenop
3bb573aeb9
Merge pull request #1139 from stweil/revert
...
Revert "change type to UChar32 to fix IsValidCodepoint"
2017-09-18 08:38:35 +02:00
Stefan Weil
07f1400e6f
Revert "change type to UChar32 to fix IsValidCodepoint"
...
This reverts commit a404c9cdb3
.
That code no longer matched the specification (see code comment).
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-18 07:42:00 +02:00
zdenop
3b62badd77
Merge pull request #1134 from Shreeshrii/fixvalidcodepoint
...
change type to UChar32 to fix IsValidCodepoint
2017-09-17 15:38:58 +02:00
Shree Devi Kumar
a404c9cdb3
change type to UChar32 to fix IsValidCodepoint
2017-09-16 14:10:34 +05:30
zdenop
e62e8f5f80
Merge pull request #1109 from mingodad/mingodad-fix-interword-spaces
...
Fix to preserve_interword_spaces work again
2017-09-15 08:45:21 +02:00
zdenop
188e1fc2c1
Merge pull request #1128 from vidiecan/issue_1127
...
Returns the correct dictionary if lstm only used
2017-09-14 20:38:49 +02:00
jm
2a77d5ad69
returns the correct dictionary if lstm only used
2017-09-14 13:03:22 +02:00
zdenop
d962d344d9
Merge pull request #1126 from stweil/unittest
...
Fix unittest/apiexample_test
2017-09-13 14:11:17 +02:00
Stefan Weil
67f706f9ef
unittest/apiexample_test: Use googletest API for error handling
...
Use the Google C++ Testing Framework instead of calling fprint / exit.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-13 13:50:39 +02:00
Stefan Weil
b42df389be
unittest/apiexample_test: Report missing test image
...
This avoids a runtime error (SIGSEGV caused by nullptr) for that case.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-13 13:43:49 +02:00
Stefan Weil
5c7600c282
unittest/apiexample_test: Fix test for out-of-tree builds
...
The test expects to find phototest.tif and phototest.txt
in directory ../testing. Create symbolic links if those
files don't exist there.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-13 13:29:16 +02:00
Stefan Weil
8655524772
unittest/apiexample_test: Use "C" locale
...
We cannot assume that the locale "en_US.UTF-8" is always available.
Using the "C" locale should work better.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-13 13:26:09 +02:00
zdenop
7e4f5faa72
Merge pull request #1123 from stweil/googletest
...
unittest: Fix path for libtesseract in out of tree builds
2017-09-12 20:44:01 +02:00
Stefan Weil
4ae227df53
unittest: Fix path for libtesseract in out of tree builds
...
The library is provided in the build path (which is not
the same as the source path for out of tree builds).
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-12 17:24:34 +02:00
zdenop
04b0397c5f
Merge pull request #1120 from amitdo/multiplelibs
...
Autotools build: Remove the option 'USING_MULTIPLELIBS'
2017-09-12 09:02:13 +02:00
zdenop
876344f8d0
Merge pull request #1121 from Shreeshrii/install-git-instructions
...
Add missing information
2017-09-12 09:01:16 +02:00
Shreeshrii
c55e487ea4
Add additional information
...
- minimum Leptonica version for 4.0x
- sudo ldconfig
2017-09-12 11:32:27 +05:30
amitdo
a905548ed6
Autotools build: Remove the option 'USING_MULTIPLELIBS'
...
Libtool's convenience libraries should never be installed. Fixes #985 .
2017-09-11 15:03:53 +03:00
zdenop
dd422c36e6
Merge pull request #1118 from Shreeshrii/fixoverwrite
...
fix accidental overwrite because of using old version
2017-09-11 12:22:48 +02:00
Shree Devi Kumar
4e9c975859
fix accidental overwrite using old version
2017-09-11 14:45:25 +05:30
zdenop
7569c899f7
Merge pull request #1116 from Shreeshrii/fixunittest
...
delete gunit.h; add merge_unicharsets to build
2017-09-11 09:33:02 +02:00
Shreeshrii
9a038f893a
Add merge_unicharsets to build
2017-09-10 21:51:52 +05:30
Shree Devi Kumar
f6c22dbf66
remove duplicate - gunit.h
2017-09-10 14:50:59 +05:30
Egor Pugin
c50e4f6238
Fix cppan dependency.
2017-09-10 02:28:05 +03:00
Egor Pugin
36e0d2093a
Fix windows build.
2017-09-09 21:25:25 +03:00
Egor Pugin
63e1862049
Disable travis os x builds as they do not work well on travis-side.
2017-09-09 21:13:14 +03:00
Ray Smith
3493785f7d
Fixed apiexample and intsimdmatrix tests and prepared Makefile.am for more tests
2017-09-08 17:34:31 +01:00
Ray Smith
9d258e20d3
Fixed build of unicharset_extractor
2017-09-08 15:33:03 +01:00
Ray Smith
fc6a390c6c
Added intsimdmatrix as a generic integer matrixdotvector function with AVX2 and SSE specializations
2017-09-08 15:06:19 +01:00
Ray Smith
ad74e8a69c
Fixed integer overflow error
2017-09-08 12:46:48 +01:00
Ray Smith
4cf123e099
Added ability to randomly rotate images upside-down during training for training OSD
2017-09-08 12:42:57 +01:00
Ray Smith
3e63918f9d
Fixed order of characters in ligatures of RTL languages issue #648
2017-09-08 11:55:11 +01:00
Ray Smith
a912967cc3
Rewrote unicharset_extractor to use the new string normalizer and read plain text as well as box files.
2017-09-08 11:49:57 +01:00
Ray Smith
c773eb5784
Fixed rendering of Thai and units of char spacing
2017-09-08 10:29:03 +01:00