Noah Metzger
e8b9c10d07
Clean up lstm_choice_mode and cut it down to 2 modes instead of 4
...
Signed-off-by: Noah Metzger <noah.metzger@bib.uni-mannheim.de>
2019-09-02 11:09:53 +02:00
Stefan Weil
fdf4067296
Fix warnings from LGTM
...
This fixes three LGTM warnings:
Multiplication result may overflow 'float' before it is converted to 'double'.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-30 22:04:24 +02:00
Stefan Weil
4a434809b0
fuzzer-api: Use optional macro LIB_FUZZING_ENGINE for build
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-30 15:32:33 +02:00
Stefan Weil
c460d19316
Add missing TensorFlow libraries for fuzzer-api
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-30 14:54:04 +02:00
Stefan Weil
dc90741f1b
Fix crash when function lookup tables are accessed with NaN
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-30 13:42:09 +02:00
zdenop
d889a38f80
Merge pull request #2627 from stweil/master
...
capi: Add missing PSM_RAW_LINE to TessPageSegMode
2019-08-25 15:36:43 +02:00
Stefan Weil
7968f50fe6
capi: Add missing PSM_RAW_LINE to TessPageSegMode
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-25 09:08:09 +02:00
zdenop
0ded672067
fix typo
2019-08-18 18:47:32 +02:00
Egor Pugin
0a3a351cb3
Merge pull request #2620 from stweil/simd
...
simd: Check whether the OS supports FMA, AVX, ...
2019-08-17 08:31:54 +03:00
Stefan Weil
00cff79f7f
simd: Check whether the OS supports FMA, AVX, ...
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-16 22:51:17 +02:00
Stefan Weil
43b2e9513b
lstmtrainer: Fix diagnostic message
...
Signed character values must be converted to unsigned integers for %x.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-15 14:31:32 +02:00
Stefan Weil
100d8cd29b
lstmtester: Add missing space in log messages
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-14 14:12:47 +02:00
Stefan Weil
a86251c62b
classify/Makefile: Fix inconsistent style
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-13 21:35:59 +02:00
Egor Pugin
423a188513
Export some classify vars.
2019-08-13 20:12:21 +03:00
Stefan Weil
46e2a0f106
Remove more code for builds with disabled legacy engine
...
Now the Tesseract library no longer includes unused code.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-13 17:49:10 +02:00
Stefan Weil
f43ca88f29
[sw] Update build for commit e84cb24def
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 19:36:41 +02:00
Egor Pugin
f71e58c557
[sw] Try to fix build.
2019-08-12 19:50:22 +03:00
Egor Pugin
73f713519c
Merge pull request #2614 from stweil/training
...
Move source files which are used for training only to src/training
2019-08-12 19:35:50 +03:00
Egor Pugin
23afe833f9
Merge pull request #2613 from stweil/unused
...
Remove unused code
2019-08-12 19:33:52 +03:00
Stefan Weil
e84cb24def
Move source files which are used for training only to src/training
...
They are moved from src/classify and src/lstm to src/training.
This reduces the size of the Tesseract library.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 17:08:08 +02:00
Stefan Weil
ba17bc8204
OpenCL: Add static attribute for kernel_src
...
It is only used in openclwrapper.cpp.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 15:13:45 +02:00
Stefan Weil
970622fbd1
Remove unused functions create_edges_window, draw_raw_edge
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 15:04:10 +02:00
Stefan Weil
23e605911f
Remove unused function truncate_path and related files
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 14:48:56 +02:00
Stefan Weil
bce585286d
Remove global array kPolyBlockNames from Tesseract library
...
It is only used in unittest/layout_test.cc after moving a test from
baseapi_test.cc to that file, so it can be made local.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 14:33:55 +02:00
Egor Pugin
c757b4ec19
Merge pull request #2612 from stweil/unicharset
...
Remove UNICHARSET::load_from_inmemory_file and related code
2019-08-12 14:50:28 +03:00
Stefan Weil
beec85e023
Remove UNICHARSET::load_from_inmemory_file and related code
...
The method was only used in unittest where it can be replaced by
UNICHARSET::load_from_file which also simplifies the code.
This allows removing the class InMemoryFilePointer and fixes a TODO.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-12 13:07:15 +02:00
Egor Pugin
ae020e7fbd
[sw] Update build script.
2019-08-10 15:46:59 +03:00
Stefan Weil
315dd9df3f
cmake: Don't link pthread on Windows
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-07 15:24:00 +02:00
Stefan Weil
ab953c1d51
unittest: Fix build and simplify build rules
...
Now more tests (those which use fileio) depend on the training build.
This is required since commit c5a50b93ce
.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-07 13:58:12 +02:00
Stefan Weil
9786b7276e
Fix linker error in Appveyor CI
...
This completes commit c5a50b93ce
.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-07 10:24:57 +02:00
Stefan Weil
b8079d8ce1
universalambigs: Add hack to fix builds with Microsoft compiler
...
The MS compiler only accepts string constants up to 65535 characters,
so shorten the string for that compiler to fix the compilation.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-06 15:46:07 +02:00
Egor Pugin
cb99fe9b41
[sw] Use the latest pango again.
2019-08-06 15:04:32 +03:00
Zdenko Podobný
c5a50b93ce
move fileio.cpp and fileio.h to training (this fix android build)
2019-08-04 21:26:39 +02:00
zdenop
f1eb172cb6
Merge pull request #2602 from stweil/ambigs
...
Clean ambigs.h and replace octal characters by UTF-8 string in universalambigs
2019-08-04 20:08:03 +02:00
Stefan Weil
6acab45837
universalambigs: Replace octal characters by UTF-8 string
...
This improves readability and reduces the file size.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-04 19:21:59 +02:00
Stefan Weil
8127b4dd27
Clean ambigs.h
...
* Remove unused kUnigramAmbigsBufferSize and kAmbigNgramSeparator
* Move some declarations to ambigs.cpp
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-08-04 19:21:59 +02:00
Egor Pugin
c58efee4ba
Use pangocairo-1.43 for the moment. Remove private pango header.
2019-08-01 11:55:18 +03:00
Egor Pugin
f1a567e814
Try to fix #2599
2019-08-01 11:35:15 +03:00
Egor Pugin
08d50baf2c
[sw] Install tess headers.
2019-07-28 18:20:43 +03:00
Stefan Weil
23ef93ac4d
cmake: Add missing pthread library
...
It is needed for C++ threads since commit 85068be405
.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-26 07:45:51 +02:00
Stefan Weil
25f2af9d1d
cmake: Add missing include directory for LibArchive
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-25 09:13:31 +02:00
Egor Pugin
8dc33b9815
[appveyor] Disable VS2019 image because it's too slow.
2019-07-19 16:00:19 +03:00
Stefan Weil
e6ca7f3ec6
hocrrenderer: Add missing escaping of special characters in HTML output
...
This converts special character like '<' or '>' to the
correct HTML entities.
Optimize also the code a little bit.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-19 13:53:36 +02:00
Stefan Weil
2679cae5d8
Simplify code by using ClipToRange
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-19 13:37:39 +02:00
Egor Pugin
3560097fe9
Update appveyor.yml
2019-07-18 17:38:33 +03:00
Stefan Weil
4b2927ae41
LSTMRecognizer: Add non const get functions
...
This allows removing several const casts.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-18 11:26:51 +02:00
Stefan Weil
4cb3f34c09
Improve formatting of hOCR output with character boxes
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-18 11:07:18 +02:00
Stefan Weil
9195a904a7
Use auto data type for results of std::ftell
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-18 10:56:17 +02:00
Stefan Weil
4132194c49
Remove unused filesize_ from class InputBuffer
...
This also simplifies the constructors.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-18 10:48:27 +02:00
Stefan Weil
a2b13b49ff
Simplify shell code (fixes warning from Codacy)
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-07-17 21:33:24 +02:00