Commit Graph

4500 Commits

Author SHA1 Message Date
zdenop
2b688983f6 cmake: fix typo 2019-11-11 19:10:01 +01:00
zdenop
2d6f38eebf fix using bilevel tiff in pdf output 2019-11-10 16:11:52 +01:00
Shreeshrii
99dfa8a680 Add separator and training_iteration to checkpoint name (#2752)
* Add separator and training_iteration to checkpoint name
* specify modelname_N.NN_NN_NN.checkpoint for intermediate checkpoint
2019-11-09 12:22:40 +01:00
Egor Pugin
944c1d9bff
Merge pull request #2751 from stweil/master
Fix issue #2748
2019-11-08 21:43:23 +03:00
Stefan Weil
ac46b286a4 Fix issue #2748
Commit 94d0f77f56 tried to fix issue #2741
but created a new problem.

This commit should fix both old and new issue.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-08 17:12:20 +01:00
Stefan Weil
0406f7706d Use BRT_UNKNOWN instead of BRT_NOISE to initialize ColPartition::blob_type_
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-08 07:40:06 +01:00
Stefan Weil
9b46a67efa Use "C" locale for printing parameters
This fixes a test for the Python wrapper `tesserocr` (python setup.py test).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-04 19:21:20 +01:00
Stefan Weil
41c3e93cb7 Add missing libraries in configuration for pkg-config
This fixes linker errors in third-party software like tesserocr for builds
which use any of these libraries.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-02 19:00:05 +01:00
Egor Pugin
ab836dbb31
Merge pull request #2743 from DavidMaung/master
Exposed the text2image option --ptsize to tesstrain.sh.
2019-11-02 17:09:51 +03:00
Egor Pugin
5d5ae6a963
Merge pull request #2745 from stweil/format-headers
Format API headers
2019-11-02 11:44:56 +03:00
Egor Pugin
ceea079eff
Merge pull request #2744 from stweil/master
Fix issue #2741
2019-11-02 11:44:36 +03:00
Stefan Weil
90db9b5224 Format API header files
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-02 07:54:55 +01:00
Stefan Weil
29dcfb081a Use relative includes for API headers in API headers
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-02 07:54:55 +01:00
Stefan Weil
a306cd7370 Fail if no valid lstmf file was written (fix issue #2741)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-01 21:52:45 +01:00
Stefan Weil
94d0f77f56 Don't create an empty lstmf file
If Tesseract cannot find text in the input image, it should not write
an empty lstmf file. This problem was reported in issue #2741.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-11-01 21:43:26 +01:00
maungd@battelle.org
3d7afb69ea Exposed the text2image option --ptsize to tesstrain.sh. Text2image has the
option --ptsize which defaults to 12.  This option is not exposed through
tesstrain.sh; thus, you cannot use tesstrain.sh to explore training with
different font sizes.  I made a small modification to expose the --ptsize
option to tesstrain.sh.  It defaults to 12 if not specified.
2019-11-01 15:10:58 -04:00
zdenop
6f31c3b849 Merge branch 'master' of https://github.com/tesseract-ocr/tesseract 2019-11-01 09:29:10 +01:00
zdenop
dbcff0ae88 cmake: fix clan openmp build on windows 2019-11-01 09:28:51 +01:00
zdenop
a5716692b2 cmake: allow build on Raspbian GNU/Linux 9 (stretch) 2019-11-01 09:28:08 +01:00
Egor Pugin
0d96ff3617
Merge pull request #2740 from stweil/master
Use pre-calculated lookup tables for all C++ compilers
2019-11-01 01:13:42 +03:00
Egor Pugin
92e543538a
Update .travis.yml 2019-11-01 00:47:36 +03:00
Stefan Weil
b5498c70fa Use pre-calculated lookup tables for all C++ compilers
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-31 20:26:01 +01:00
Egor Pugin
2bcc9d8093 Remove cppan build. 2019-10-30 21:37:38 +03:00
Stefan Weil
ca87b06d59 Fix build for Intel Compiler (issue #2736)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-30 10:09:44 +01:00
Stefan Weil
20a50e9bcb Fix typo in comment
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-30 10:06:31 +01:00
Stefan Weil
6a3df91edf Update cppan build for new include file structure
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 18:09:08 +01:00
Stefan Weil
e3e7a9bf33 Use #include <tesseract/*.h> for unittest
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 18:01:18 +01:00
Egor Pugin
c7c250c8b4 Add include dir explicitly. 2019-10-29 14:57:32 +03:00
Egor Pugin
2a37f5dd62 Update includes to use <>. 2019-10-29 14:50:11 +03:00
Stefan Weil
7bd94a9354 Remove old code for no longer existing src/api/*.h
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 12:47:04 +01:00
Egor Pugin
9e324938ab Update includes to use <>. 2019-10-29 14:31:38 +03:00
Stefan Weil
629b05d978 Update README.md and other documentation for new include file structure
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 12:26:41 +01:00
Stefan Weil
7ef20bb0e6 Use flat make for include/tesseract
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 12:01:51 +01:00
Stefan Weil
061eccd6ae Rename tesseract/tess_version.h -> tesseract/version.h
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 11:47:31 +01:00
Egor Pugin
301704b3a3 Merge branch 'master' of github.com-egorpugin:tesseract-ocr/tesseract 2019-10-29 13:29:48 +03:00
Egor Pugin
8d10857973 Update sw build. 2019-10-29 13:27:12 +03:00
Stefan Weil
0481cebfd9 Support sw build with new include file structure
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 11:14:37 +01:00
Egor Pugin
aeb98d9ff2
Merge pull request #2735 from amitdo/move-headers2
Move all public headers to include/tesseract
2019-10-29 12:36:40 +03:00
Stefan Weil
26ba7e2f81 Fix #include path of public headers for unittest
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 08:41:47 +01:00
Stefan Weil
17b0951736 Support cmake with new include file structure
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-29 08:33:55 +01:00
amitdo
2f8884a64e Fix autotools build 2019-10-28 21:23:58 +02:00
amitdo
e1bae15547 Fix #include path of public headers 2019-10-28 19:10:30 +02:00
amitdo
dfede8ac01 Move all public headers to include/tesseract 2019-10-28 18:50:31 +02:00
zdenop
cede5b34e7
Add pageseg_apply_music_mask option to allow disabling the musi… (#2732)
Add pageseg_apply_music_mask option to allow disabling the music mask
2019-10-27 17:02:05 +01:00
zdenop
4a37cde0d9 fix inverting (Bilevel BW png) in pdf; fixes # 2059 2019-10-27 14:15:12 +01:00
Nat
52bc15acd9 Add pageseg_apply_music_mask option to allow disabling the music mask 2019-10-24 11:44:05 -05:00
Egor Pugin
048f729785 Merge branch 'master' of github.com-egorpugin:tesseract-ocr/tesseract 2019-10-23 23:30:12 +03:00
Egor Pugin
401e60c54c
Merge pull request #2728 from egorpugin/master
Remove TESS_CALL.
2019-10-23 23:29:38 +03:00
Shree
df6b1ce452 remove legacy parameter disable_character_fragments from lstm.train 2019-10-23 13:15:16 +02:00
Egor Pugin
c727b556f0 Remove unneeded TESS_API from source file. 2019-10-23 13:26:46 +03:00