Commit Graph

6065 Commits

Author SHA1 Message Date
Stefan Weil
a9c3f6d87f ccmain/paragraphs: Make local function UnicodeFor and fix signed/unsigned
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 20:00:30 +02:00
Stefan Weil
4c36e2e29a Fix compiler warnings in TWERD::MergeBlobs and optimize code
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 20:00:30 +02:00
Stefan Weil
0cdcd0f02b Remove unused code
Fixes: 766b7bd620 ("Don't drop words with low certainty")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 20:00:30 +02:00
Stefan Weil
ca0e68f046 Avoid implicit conversions from float to double
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 20:00:30 +02:00
Stefan Weil
9315d4c7e2 Change size and count arguments in TFile from int to size_t
This matches standard functions like fread, fwrite.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 20:00:30 +02:00
Stefan Weil
85cb6678fa Replace new / delete by std::unique_ptr and std::vector in class Classify
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 14:08:12 +02:00
Stefan Weil
5d903da1ce Replace new / delete by std::vector in class Wordrec
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 14:07:25 +02:00
Stefan Weil
467f24c0b6 Replace new / delete by std::vector in class Trie
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 14:06:48 +02:00
Stefan Weil
ed1100832c Replace new / delete by std::vector in class WERD_CHOICE
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-10 14:05:47 +02:00
Stefan Weil
0aad8b8619 Fix build with OpenCL and add namespace to OpenCL code
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-06 07:51:03 +02:00
Stefan Weil
571a8fe1d7 autotools: Use AM_LDFLAGS for framework Accelerate
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-10-06 07:40:54 +02:00
Egor Pugin
7874c3fa88
Merge pull request #3589 from Shreeshrii/main
Actions CI: cmake - fix macos for -lcurl, add macos-11 to matrix
2021-10-02 23:34:37 +03:00
Shree Devi Kumar
653313e7fb Actions CI: cmake - fix macos for -lcurl, add macos-11 to matrix 2021-10-02 15:59:02 +00:00
Egor Pugin
ae8e748000
Merge pull request #3581 from MohanSha/patch-1
added table of contents
2021-09-30 23:23:27 +03:00
Mohan Sha
7d23e0272e
added table of contents 2021-10-01 00:33:05 +05:30
Amit D
0cb9c40528
Add configurable variables to control thresholding (#3577) 2021-09-29 23:17:22 +03:00
zdenop
c4ad9b7bbf cmake: fix curl linking if CURL_LIBRARIES is not available 2021-09-25 12:31:05 +02:00
zdenop
da084ea9e6 cmake: fix copy&paste error 2021-09-25 12:26:06 +02:00
zdenop
ebb214c443 destroy temporary page_pix 2021-09-25 10:26:31 +02:00
zdenop
9a53758bc9 cmake: improve formating 2021-09-25 10:10:38 +02:00
zdenop
b294e3fc92
Merge pull request #3578 from adaptech-cz/improve-cmake
cmake: Improve configuration
2021-09-25 10:03:38 +02:00
Robert Pösel
362ed9b5e7 cmake: Improve configuration
- Use correct library name in TesseractConfig.cmake on all platforms
- Expose Tesseract_VERSION and Tesseract_VERSION_* variables in TesseractConfig.cmake
2021-09-24 22:15:46 +02:00
Egor Pugin
749beb6b14
Merge pull request #3576 from tesseract-ocr/Shreeshrii-patch-1
Update vcpkg.yml
2021-09-24 22:50:47 +03:00
zdenop
6b4447b931 cmake: remove REQUIRED during finding leptonica library, as we raise FATAL_ERROR later 2021-09-24 20:43:36 +02:00
zdenop
a4a14cb92b fix vcpkg action 2021-09-24 20:30:10 +02:00
Shreeshrii
e7d7ca86d6
Update vcpkg.yml
As suggested by @zdenop  in https://github.com/tesseract-ocr/tesseract/issues/3574#issuecomment-926031923
2021-09-24 21:35:54 +05:30
Stefan Weil
f9d17598a8 Make automake builds less noisy by default
The old commit only silenced parts of the build,
while the new one silences the whole build.

Fixes: 47af1282f4
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-24 15:21:26 +02:00
Amit D
adaaef87a4
Fix wrong tiles parameters in Sauvola (#3570)
Thanks to Robert Sachunsky @bertsky that pointed out the issue.
2021-09-23 10:26:07 +03:00
Amit D
6998c0ed71
Merge pull request #3571 from MerlijnWajer/hocr-write-scan-res
hocrrenderer: write scan_res property to the ocr_page
2021-09-22 09:22:26 +03:00
Stefan Weil
6ef6e36f78 codeql: Run apt-get update before trying to install Ubuntu packages
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-22 07:04:33 +02:00
Merlijn Wajer
ca177e72f3 hocrrenderer: write scan_res property to the ocr_page
This will make Tesseract emit the DPI of the document, if known at OCR
time. This is requird to properly interpret the x_fsize (font size)
property of words, since Tesseract scales the font size to the DPI.

See issue #3326 (https://github.com/tesseract-ocr/tesseract/issues/3326)
2021-09-21 11:02:52 +02:00
zdenop
19cc9afb25 cmake: add Tesseract_LIBRARY_DIRS 2021-09-19 18:34:24 +02:00
zdenop
89d86d6eee cmake: improve configuration,
prefer find_package instead of PKG_CONFIG for leptonica and tiff
2021-09-19 18:28:45 +02:00
zdenop
2d397a8551 cmake: improve libarchive support - prefer cmake function instead of pkg-config 2021-09-19 11:17:40 +02:00
zdenop
eafbb2b22a cmake: add option to build with libcurl support 2021-09-19 11:13:21 +02:00
zdenop
ff5f59cf24 cmake: fix cygwin GNU c++ build; fixes #2379 2021-09-19 11:06:19 +02:00
Egor Pugin
9ac988b27a
Merge pull request #3568 from adaptech-cz/fix-cmake-linking
Fix linking Tesseract to project using CMake on Linux
2021-09-17 23:15:39 +03:00
Robert Pösel
d7528e7cea Fix linking Tesseract to project using CMake on Linux
In the past Tesseract library was wrongly named as "liblibtesseract" so it was needed to use "-llibtesseract" as argument for linker. The name was fixed in commit 52cac3a42e but the ${Tesseract_LIBRARIES} variable still wrongly holds "libtesseract" and causes linker error when one is using it in target_link_libraries(...). This commit fixes that.
2021-09-17 21:35:30 +02:00
zdenop
0c49ee18cd fix visibility compilation 2021-09-17 17:38:01 +02:00
zdenop
4bdeda4af7
Merge pull request #3566 from stweil/issue3564
Simplify function LoadTrainingData and fix mastertrainer_test
2021-09-17 10:45:21 +02:00
Stefan Weil
638045133f Simplify function LoadTrainingData and fix mastertrainer_test
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-17 08:24:50 +02:00
Egor Pugin
2b4c3599de
Merge pull request #3565 from stweil/issue3564
Fix crash of shapeclustering (fixes #3564)
2021-09-17 00:11:49 +03:00
Stefan Weil
d87e08f266 Fix crash of shapeclustering (fixes #3564)
Fixes: 4415209fd6 ("Remove tessopt. This fixes mastertrainer test in shared build")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-16 22:31:09 +02:00
Stefan Weil
75f167ac8c Create new pre-release 5.0.0-beta-20210916
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-16 06:35:29 +02:00
Stefan Weil
386dd8a0c0 Update (master branch was renamed to main)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-13 07:42:46 +02:00
Stefan Weil
60fd2b4aba CI: Link basicapitest with Accelerate framework for MacOS
Clean also some other compiler options for basicapitest.

Fixes: 3ab8dcbf72 ("Use Apple Accelerate framework [...]")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-06 17:19:55 +02:00
Stefan Weil
e5e12f2856 Disable HAVE_FRAMEWORK_ACCELERATE for compilers which fail to compile with it
g++-10 and g++-11 throw compiler errors in builds with the
Accelerate framework, so disable it for all GNU compilers
before version 12 (which still has to be tested).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-06 17:15:46 +02:00
Egor Pugin
35dee4646f
Merge pull request #3549 from stweil/issue1573
Abort LSTM training with integer model (fixes issue #1573)
2021-09-06 13:11:46 +03:00
Stefan Weil
ec87dd4d49 Abort LSTM training with integer model (fixes issue #1573)
Tesseract currently cannot continue LSTM training from an
integer (fast) model.

Report this to users who try it nevertheless instead of crashing
with an assertion.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-06 08:18:55 +02:00
Stefan Weil
b5d4b67a3a Update test submodule
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2021-09-05 21:37:34 +02:00