Stefan Weil
ec87dd4d49
Abort LSTM training with integer model (fixes issue #1573 )
...
Tesseract currently cannot continue LSTM training from an
integer (fast) model.
Report this to users who try it nevertheless instead of crashing
with an assertion.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-06 08:18:55 +02:00
Stefan Weil
b5d4b67a3a
Update test submodule
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-05 21:37:34 +02:00
Stefan Weil
a027dca007
Extend URI support for Tesseract with libcurl
...
libcurl not only supports HTTP and HTTPS, but also a lot of other protocols,
for example FTP and SFTP. Those protocols can also be useful for Tesseract.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-05 16:49:22 +02:00
Egor Pugin
1f437f3be8
Merge pull request #3545 from stweil/issue-3544
...
Rename processed TIFF output file and add page number if needed (fixe…
2021-09-01 15:26:06 +03:00
Stefan Weil
7fc9a34f79
Rename processed TIFF output file and add page number if needed (fixes issue #3544 )
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-01 14:16:05 +02:00
Robert Pösel
40fdacd485
Add missing check for __ARM_NEON
...
This makes it consistent with intsimdmatrixneon.cpp file and allows having this file included in builds even for non-NEON platforms (simplifies build config).
2021-08-26 15:28:59 +02:00
Egor Pugin
0fb170b994
Merge pull request #3540 from stweil/tessdata_prefix
...
Fix handling of TESSDATA_PREFIX containing // (fixes issue #3527 )
2021-08-24 21:53:27 +03:00
Stefan Weil
4dcd8fa591
Fix handling of TESSDATA_PREFIX containing // (fixes issue #3527 )
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-24 20:05:54 +02:00
Egor Pugin
e57a3113fb
Merge pull request #3539 from stweil/submodels
...
Use model prefix also for submodels
2021-08-24 16:07:27 +03:00
Stefan Weil
391e713ae8
Use model prefix also for submodels
...
Fix also a regression in the for loop which handles submodels.
Fixes: 0d91c700c0
("Modernize code in Tesseract::init_tesseract")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-24 13:41:00 +02:00
Stefan Weil
7cfcfe1101
cmake: Remove universalambigs.cpp
...
Fixes: 407346246c
("[universalambigs] Use inline variables.")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-23 07:30:03 +02:00
Stefan Weil
0d91c700c0
Modernize code in Tesseract::init_tesseract
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-23 07:30:03 +02:00
Egor Pugin
1d3d1fbc62
Move member function bodies into class template.
2021-08-20 12:42:40 +03:00
Egor Pugin
c539328d7d
Merge branch 'master' of github.com-egorpugin:tesseract-ocr/tesseract
2021-08-20 12:38:12 +03:00
Egor Pugin
407346246c
[universalambigs] Use inline variables.
2021-08-20 12:38:03 +03:00
Stefan Weil
7acda5cb6c
Fix cloning of Image with pix_ == nullptr (issue #537 )
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-18 19:22:23 +02:00
Egor Pugin
feb32ecbe5
Merge branch 'master' of github.com-egorpugin:tesseract-ocr/tesseract
2021-08-18 18:15:05 +03:00
Egor Pugin
6056c84977
[image] Mark PIX** cast explicit to prevent implicit bool checks in ternary operators.
2021-08-18 18:14:47 +03:00
Stefan Weil
547164edae
Create new pre-release 5.0.0-beta-20210815
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-15 17:07:11 +02:00
Egor Pugin
536112ce6f
[sw] Fix build.
2021-08-12 22:46:45 +03:00
Stefan Weil
59271470b4
Remove unneeded type cast
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-12 20:55:14 +02:00
Stefan Weil
aaec341449
Avoid call of ColumnFinder::DisplayBlocks (small optimization)
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-12 15:23:44 +02:00
Stefan Weil
6da7d6fcda
Optimize check for non empty string and fix code
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-12 14:45:22 +02:00
Stefan Weil
92cae8f194
Optimize check for non empty string
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-12 14:44:45 +02:00
Stefan Weil
63c12a9ee5
unittest: Enable more code for tatweel_test without requiring Tensorflow
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-12 14:12:53 +02:00
Egor Pugin
c1180a8bc0
Merge pull request #3520 from stweil/unused
...
Remove some unused code
2021-08-10 23:36:34 +03:00
Stefan Weil
3ef403c345
Compile LSTM::PrintW and LSTM::PrintDW conditionally
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-10 22:04:57 +02:00
Stefan Weil
5d99041f5d
Remove unused function Wordrec::merge_fragments
...
Remove also more functions which are now also unused.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-10 22:04:57 +02:00
Stefan Weil
f1c8df0ce9
Remove unused global variable fx_debug
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-10 22:04:57 +02:00
Egor Pugin
3178c49729
Merge pull request #3517 from stweil/alto
...
Write image filename in ALTO output and reduce size of renderer classes
2021-08-08 00:17:31 +03:00
Stefan Weil
16fd1439fa
Write image filename in ALTO output
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-07 22:14:03 +02:00
Stefan Weil
5f10fed5d9
Reduce size of TessResultRenderer
...
Changing the order reduces the size from 72 to 64 bytes
on 64 bit Linux.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-07 22:14:03 +02:00
Stefan Weil
95223cfaab
cmake: Link tiff library only for Windows
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-07 19:57:24 +02:00
Stefan Weil
2215174951
unittest: Fix compiler warning for unused function
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-07 18:14:50 +02:00
Egor Pugin
3a68a80eed
Merge pull request #3516 from stweil/abseil
...
Remove submodule abseil
2021-08-07 15:05:29 +03:00
Egor Pugin
33fcb99d3a
[sw] Do not build arm neon file.
2021-08-07 13:40:47 +03:00
Stefan Weil
49f410ced3
unittest: Remove dependency on absl::StripAsciiWhitespace()
...
This removes the last dependency on Abseil, so that submodule
is now removed completely.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:10 +02:00
Stefan Weil
87707bb8b0
unittest: Remove dependency on absl::StrSplit()
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:09 +02:00
Stefan Weil
f407345cbe
unittest: Remove dependency on absl::StrJoin()
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:09 +02:00
Stefan Weil
61b8e301dd
unittest: Remove dependency on absl::StrCat()
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:09 +02:00
Stefan Weil
8486f59493
unittest: Remove dependency on absl::StrFormat()
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:09 +02:00
Stefan Weil
fe5ca9dad9
unittest: Remove dependency on absl::GetCurrentTimeNanos()
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:09 +02:00
Stefan Weil
6b8b1f0007
unittest: Remove some dependencies on abseil
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:59:09 +02:00
Stefan Weil
d50baec7a7
cmake: Add dotproductneon.cpp
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-06 20:55:40 +02:00
zdenop
7975fec2fd
Add new cmake option -DFAST_FLOAT=ON for faster LSTM with float ( #3514 )
...
Co-authored-by: Stefan Weil <sw@weilnetz.de>
2021-08-05 21:35:54 +02:00
Stefan Weil
4c8799ac40
codeql-analysis: Disable analysis of Python code
...
It should be enabled again (and also the analysis for Java)
as soon as it does not compile all C++ code, too.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-03 11:53:51 +02:00
Stefan Weil
a73e7b97a4
Add float dotproduct implementation for NEON
...
Signed-off-by: Stefan Weil <stefan.weil@bib.uni-mannheim.de>
2021-08-03 10:35:22 +02:00
Stefan Weil
bb4a1219d7
Improve setting of dot product functions via environment variable
...
Apply the settings which are selected by environment variable DOTPRODUCT
after the autodetection which detects the available SIMD hardware.
'accelerate', 'fma' and 'std::inner_product' now no longer change
the setting for intSimdMatrix to 'generic' because they don't provide
their own implementation for it.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-03 10:34:33 +02:00
Stefan Weil
2786a887cd
Update codeql-analysis.yml for Tesseract autotools build
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-01 13:23:28 +02:00
Stefan Weil
ebae27435a
Update codeql-analysis.yml for Tesseract autotools build
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-08-01 13:17:45 +02:00