Commit Graph

5567 Commits

Author SHA1 Message Date
Stefan Weil
93348a83a3 Remove scripts for training
They were replaced by Python3 scripts (part of the tesstrain repository).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-18 10:47:44 +02:00
Stefan Weil
5eb2e86635 Fix some typos (found by codespell)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-17 15:18:43 +02:00
nagadomi
42e4b91132 Refactor ObjectCache::DeleteUnusedObjects with reverse iterator 2021-05-17 14:50:30 +02:00
Stefan Weil
d4bcbbd6de Add test for TessBaseAPI::ClearPersistentCache to unit tests
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-17 09:28:21 +02:00
Egor Pugin
83a27d357a
Merge pull request #3431 from nagadomi/fix_segv
Fix crash in ObjectCache::DeleteUnusedObjects
2021-05-17 07:50:00 +03:00
nagadomi
dc4a8a6ce0 Fix crash in ObjectCache::DeleteUnusedObjects 2021-05-17 10:25:17 +09:00
Stefan Weil
1c77e50b97 travis: Use libleptonica-dev from Ubuntu focal instead of local build
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-16 11:56:12 +02:00
Stefan Weil
f6a67c52d8 Simplify list of required packages for CI builds
libcairo2-dev is automatically installed as a direct dependency of libpango1.0-dev.
libicu-dev is an indirect dependency and also installed automatically.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-16 11:23:58 +02:00
Stefan Weil
0c4e2f1cb5 Fix comment in code
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-16 07:47:19 +02:00
Stefan Weil
364431dbb9 Remove unneeded dependency autoconf-archive from CI builds
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-16 07:45:12 +02:00
Stefan Weil
19db25e5e5 Update submodule abseil to tagged release 20210324.1
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-15 15:08:01 +02:00
Amit D
bbfd240d46 Update README.md 2021-05-15 15:06:47 +02:00
Stefan Weil
57b7974292 Remove an arbitrary limit for the image size
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-15 15:03:22 +02:00
Stefan Weil
a0cf117c5d Fix compiler warning in binarization code (uninitialized local variable)
Simplify the code also a little bit.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-15 15:03:22 +02:00
Stefan Weil
bf84fb9f2d Optimize code for binarization
Some code is only needed for Otsu or even not at all.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-15 15:03:22 +02:00
Stefan Weil
4b5dd25b84 Fix compiler warning
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-15 15:03:22 +02:00
Stefan Weil
bf3421ff12 Fix autoconf build for latest MacOS (Intel and M1)
On latest MacOS 11.3 the system header file "ostream" includes a file
named "version".

The macro DEFAULT_INCLUDES adds the source root to the list of include
directories by default. As MacOS uses a case insensitive file system,
the compiler finds and includes the file "VERSION" there which causes
compiler errors and a failing build process.

Setting an empty DEFAULT_INCLUDES fixes that, but requires moving
config_auto.h to another directory in the include search path.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-13 17:22:58 +02:00
Stefan Weil
12c29639fc Add conditional compilation with GRAPHICS_DISABLED
This fixes a compiler warning when GRAPHICS_DISABLED is defined.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-13 17:22:24 +02:00
Stefan Weil
6c7df2acf3 Fix warning from autoconf
autoconf reported this warning:

    configure.ac:263: warning: AC_CHECK_HEADERS("tensorflow/core/framework/graph.pb.h"): you should use literals

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-05-13 12:57:20 +02:00
Egor Pugin
95689f5c5c
Merge pull request #3422 from nickjwhite/lstmevalmoreinfo
lstmeval: Print char and word error rates for each line tested
2021-05-11 15:25:16 +03:00
Nick White
ad7010a5eb lstmeval: Only print char and word error rates for verbosity 2/3 2021-05-11 13:15:35 +01:00
Nick White
4787414d88 lstmeval: Print char and word error rates for each line tested 2021-05-11 10:54:34 +01:00
Egor Pugin
43747d6ea8 Postfix for #3418. 2021-05-10 15:06:27 +03:00
Egor Pugin
e7c01a6f15
Merge pull request #3418 from amitdo/thresholder
Add more binarization options
2021-05-10 14:45:03 +03:00
Amit Dovev
21e76c7a13 Convert enum ThreshMethod to enum class 2021-05-09 18:49:09 +03:00
Egor Pugin
176d0927bd Allow explicit casts of Image to Pix**. 2021-05-07 21:30:42 +03:00
Amit Dovev
11c73c9481 Add more binarization options
Use functions from Leptonica to provide more binarization options. The new options are: 1) Adaptive Otsu and 2) Sauvola (Tiled) .
2021-05-07 16:48:26 +03:00
Egor Pugin
65118b2e3a [misc] Fix variable type. Fixes warning. 2021-05-04 16:12:40 +03:00
Egor Pugin
346b77c94e Remove unneeded header. 2021-05-04 16:10:52 +03:00
Egor Pugin
4fbe9f1de2 Revert d6cdc52. Fixes #3412. 2021-05-04 00:51:39 +03:00
zdenop
a9682bb6e2
Merge pull request #3410 from ulb-sachsen-anhalt/fix/fix-typo-icu
[doc][fix] typo libicu-devel
2021-04-30 13:32:54 +02:00
Uwe Hartwig
930c909db5 [doc][fix] typo libicu-devel 2021-04-30 13:01:32 +02:00
Egor Pugin
ff92aa6f16
Merge pull request #3408 from GerHobbelt/patch-2
fix compile error: PrintFontsTable() is for legacy builds only
2021-04-30 00:33:46 +03:00
Ger Hobbelt
bd8adff829 fix compile error: PrintFontsTable() is for legacy builds only
# Conflicts:
#	googletest
2021-04-29 23:27:20 +02:00
Egor Pugin
723eb135c5
Merge pull request #3406 from Lucas-C/add-print-fonts-table-arg
Adding --print-fonts-table parameter & tessedit_font_id configuration option
2021-04-29 13:54:58 +03:00
Lucas Cimon
b852d658cb Adding --print-fonts-table parameter & tessedit_font_id configuration option 2021-04-29 11:25:40 +02:00
zdenop
aa8f654416
Update cmake-win64.yml
fix png link
2021-04-29 10:12:43 +02:00
Stefan Weil
2e2a5b3ef4 Improved fix for issue #3405
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-04-27 22:15:36 +02:00
Stefan Weil
0b7fc068d2 Revert "Fix double free. Closes #3405."
This reverts commit 3997cf54d2.
It will be replaced by a simpler fix.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-04-27 22:15:18 +02:00
Egor Pugin
3a195e5b05 Misc. 2021-04-27 22:08:29 +03:00
Egor Pugin
3997cf54d2 Fix double free. Closes #3405. 2021-04-27 22:08:06 +03:00
Egor Pugin
e3ac1835e0 Remove unneeded ctor. 2021-04-23 04:26:18 +03:00
Egor Pugin
a7f938d28e Make FontSet just a vector. 2021-04-23 04:25:45 +03:00
Egor Pugin
4ae5a7d6b5 Properly init font set. 2021-04-23 04:05:59 +03:00
Egor Pugin
048e63c02b Replace FontSet struct with vector. It may be improved further (remove pointer?). 2021-04-23 02:38:25 +03:00
Egor Pugin
d6cdc521e5 Remove unused headers. 2021-04-23 02:06:06 +03:00
Stefan Weil
740d10b61b Fix issue #3404 (empty page regression)
The regression was caused by a bug in commit 5db92b26aa.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-04-22 20:51:23 +02:00
Stefan Weil
66a963b50a Remove two assertions which are triggered by fuzzing
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-04-20 19:04:49 +02:00
Stefan Weil
26c21a6db4 Fix some compiler warnings with GRAPHICS_DISABLED
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-04-20 07:58:31 +02:00
Stefan Weil
6d0595b443 Fix memory leak (OSS-Fuzz issue 33220)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-04-19 20:59:18 +02:00