Commit Graph

5997 Commits

Author SHA1 Message Date
Amit D
d37dd73439
Fix broken msys2 build with gcc 11
Fix #3672.
2021-12-05 08:57:49 +02:00
Amit D
9508530aa3
Update unittest-disablelegacy.yml 2021-12-03 15:11:10 +02:00
Egor Pugin
c540102620
Merge pull request #3669 from pkubaj/patch-1
Correctly detect amd64 and i386 on FreeBSD
2021-12-02 08:01:31 +03:00
pkubaj
fb74c7eac1
Correctly detect amd64 and i386 on FreeBSD
FreeBSD uses amd64 and i386 names.
2021-12-02 03:46:41 +00:00
Amit D
4abb0e08bc
README: Latest stable version is 5.0.0 2021-12-01 22:36:29 +02:00
Stefan Weil
924b746e6f Create new release 5.0.0
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-30 18:48:16 +01:00
Egor Pugin
b5d33a104b
Merge pull request #3664 from stweil/classify
Fix some compiler warnings and avoid float / double conversions in class Classify
2021-11-28 23:04:01 +03:00
Egor Pugin
bb155a1bb4
Merge pull request #3663 from stweil/clang7
Allow compilation with clang-7
2021-11-28 23:02:41 +03:00
Stefan Weil
eb089c1346 autobuild: Fix autogen.sh (reduce build time)
After running autogen.sh and configure, the following make had to
run autoreconf because of dependencies which needed an update.

This is fixed by running aclocal twice.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-28 19:22:58 +01:00
Stefan Weil
a1f40cadc1 Avoid some unnecessary conversions from float to double
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-28 18:55:27 +01:00
Stefan Weil
5e8d877262 Modernize code in class Classify
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-28 18:44:20 +01:00
Stefan Weil
ffe2038ea6 Allow compilation with clang-7
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-28 10:45:46 +01:00
Stefan Weil
839f528b9a Remove unused GenericVector::contains_index, UnicityTable::contains_id
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-28 09:54:59 +01:00
Stefan Weil
8b21e4f0b8 Remove member function GenericVector<T>::contains
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-27 09:40:36 +01:00
Stefan Weil
739057c586 Remove member function UnicityTable<T>::contains
It was only used once, and the code using it can be simplified.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-27 09:40:17 +01:00
Egor Pugin
3313bb794b
Merge pull request #3657 from stweil/bcer
Limit BCER to interval [0,1]
2021-11-25 13:43:47 +03:00
Stefan Weil
99aea21336 Limit BCER to interval [0,1]
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-25 08:04:26 +01:00
Egor Pugin
515e9906d4
Update sw.yml 2021-11-24 18:41:06 +03:00
Egor Pugin
6f399c0df1
[ci] Add vs2022 to sw workflow. 2021-11-24 14:17:08 +03:00
Stefan Weil
ee29fca9ce Create new release 5.0.0-rc3
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-22 22:01:18 +01:00
Amit D
2087c45f20
Update unittest-disablelegacy.yml 2021-11-22 21:28:29 +02:00
Stefan Weil
2c4665466e Format code with clang-format
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-22 19:47:39 +01:00
Bernhard Liebl
555aa55f05 Add RowAttributes getter to PageIterator
[sw]: Cherry-picked commit from 4.1 branch

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-22 19:47:39 +01:00
Stefan Weil
b649222de3 Fix resultiterator_test with --disable-legacy
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-20 14:58:36 +01:00
Stefan Weil
5f27310d22 Fix some compiler warnings with --disable-legacy
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-20 14:57:08 +01:00
Amit D
47abbaa48f
Training: Fix compiler warnings (#3650)
warning: format ‘%c’ expects argument of type ‘int’, but argument 2 has type ‘tesseract::Validator::CharClass’ [-Wformat=]
2021-11-19 21:01:04 +02:00
Amit D
34b4391227
Update unittest-disablelegacy.yml 2021-11-19 11:05:20 +02:00
Amit D
75253a24c7
Improve the disable legacy build (#3649)
resultiterator_test: Disable some parts of EasyTest.
2021-11-19 10:52:18 +02:00
Stefan Weil
455feb35f2 Replace char error by BCER in more training messages
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-18 21:34:16 +01:00
Amit D
ff11f5dc65
Improve disable legacy build (#3648)
resultiterator_test: Disable SmallCapDropCapTest

Co-authored-by: Shree Devi Kumar <5095331+Shreeshrii@users.noreply.github.com>
2021-11-18 16:07:55 +02:00
Stefan Weil
981c167f8c Improve result message from lstmeval
Old message:

    At iteration 0, stage 0, BCER eval=2.553356, BWER eval=5.586173

New message:

    BCER eval=2.553356, BWER eval=5.586173

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-17 09:02:49 +01:00
Stefan Weil
c716ebdc42
Improve training messages (issue #3560) (#3644)
The old messages could wrongly be interpreted as CER / WER values,
but Tesseract training currently uses simple bag of characters /
bag of words error rates (see LSTMTrainer::ComputeCharError,
LSTMTrainer::ComputeWordError).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-17 09:39:23 +02:00
Stefan Weil
ef3bf98cc1 lstmtrainer: Fix comment
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-15 20:19:54 +01:00
Stefan Weil
83ad8a18de Clean code with clang-tidy (performance-move-const)
Command used:

    clang-tidy --checks="-*,performance-move-const-arg"

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-15 20:18:29 +01:00
Stefan Weil
f48620fffb scrollview: Add const attributes
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-15 20:17:59 +01:00
Stefan Weil
66dc90bc5f Create new release 5.0.0-rc2
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-14 20:04:23 +01:00
Stefan Weil
f0b8c0254b stepblob: Fix some warnings from clang-tidy
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-14 16:40:38 +01:00
Stefan Weil
25cdca6492 combine_tessdata: Print "Version:" instead of "Version string:"
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-14 16:38:52 +01:00
Stefan Weil
d8d63fd71b Optimize performance with clang-tidy
The code was partially formatted with clang-format and optimized with

    clang-tidy --checks="-*,perfor*" --fix src/*/*.cpp

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-14 15:54:04 +01:00
Stefan Weil
e5011c545a Remove unused function ScrollView::AwaitEventAnyWindow
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-14 12:10:37 +01:00
Stefan Weil
37b33749da ScrollView: Fix memory leak and modernize code
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-14 10:34:20 +01:00
Stefan Weil
371ee2232e Remove spaces at line endings and empty last lines
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 22:45:47 +01:00
Stefan Weil
e18826cfab Fix some compiler warnings and modernize code in class TrainingSampleSet
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 22:33:22 +01:00
Stefan Weil
6360e60877 Modernize code in TessBaseAPI::Init
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 21:43:46 +01:00
Stefan Weil
03f2cfdf02 Show tessdata directory when listing models
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 21:43:01 +01:00
Stefan Weil
c2ee0cd06f Fix listing of languages
The last fix for OCR with more than one model introduced
a regression for `tesseract --list-langs`.

Fixes: 9091055783 ("Fix loading of additional model files")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 21:34:29 +01:00
Stefan Weil
ebce8ab2eb combine_tessdata: Support -dl and -ld options
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 11:33:10 +01:00
Stefan Weil
905795041f Fix new GitHub action CIFuzz
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 09:56:26 +01:00
Stefan Weil
3378d79ae6 Add new GitHub action CIFuzz
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-13 09:42:04 +01:00
Stefan Weil
5884036ecd Don't use compiler flags -march=native -mtune=native in autoconf builds
Using those flags is not acceptable for Linux distributions
because the resulting code then depends on the build
infrastructure, so the build result is not deterministic.

It is still possible to use those compiler flags by specifying
CXXFLAGS.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-11-11 12:29:51 +01:00