Commit Graph

6512 Commits

Author SHA1 Message Date
Stefan Weil
4a49e0340f Remove trailing whitespace from PAGE XML <Unicode> output
Fixes: 577e8a8b93 ("Add PAGE XML renderer / export (#4214)")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-05-03 08:48:40 +02:00
Sadra Barikbin
d461867812
Refactor function Textord::clean_noise_from_row (#4225)
Co-authored-by: Stefan Weil <sw@weilnetz.de>
2024-04-26 07:53:31 +02:00
Sadra Barikbin
a1837f18e0 Remove the variable 2024-04-26 07:48:57 +02:00
Sadra Barikbin
77c99e0782 A few typo fix 2024-04-26 07:48:27 +02:00
Sadra Barikbin
072b5d47c1 Apply the change 2024-04-26 07:47:22 +02:00
Sadra Barikbin
6802705cf0 Apply the changes 2024-04-26 07:46:30 +02:00
Stefan Weil
d6f0073c75 Remove unused local variables in PAGE renderer
Fixes: 577e8a8b93 ("Add PAGE XML renderer / export (#4214)")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-04-24 10:33:39 +02:00
Stefan Weil
549b8767e9 Support training without lstmf files
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-04-24 10:32:47 +02:00
Stefan Weil
88771ca4c6 Add missing new function prototypes for PAGE renderer to C API
Fixes: 577e8a8b93 ("Add PAGE XML renderer / export (#4214)")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-04-24 10:22:27 +02:00
Jan Kamlah
577e8a8b93 Add PAGE XML renderer / export (#4214)
Add PAGE XML export and documentation.
To generate PAGE XML output just add 'page' to the tesseract command.

The output is outputname + '.page.xml' to avoid conflicts with ALTO export.

The output can be customized with the flags:
tessedit_create_page_polygon and tessedit_create_page_wordlevel.

Co-authored-by: Stefan Weil <sw@weilnetz.de>
2024-04-19 21:12:39 +02:00
Mahesh Madhav
bae520ea00
Facilitate vectorization for generic build (#4223)
* Update intsimdmatrix.cpp

Unroll loop into chunks of four operations to facilitate compilers recognizing vectorization opportunity.

* Apply suggestions from code review

Co-authored-by: Stefan Weil <sw@weilnetz.de>
2024-04-16 19:28:55 +02:00
Stefan Weil
d5e000bc58
Remove unsupported OpenCL code and related API functions (#4220)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-04-11 19:15:39 +03:00
Stefan Weil
912deb3978 Fix comments for function ReCachePages
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-04-04 11:08:53 +02:00
zdenop
87a152c0eb cmake: target_compile_features instead of set_property 2024-03-30 11:25:27 +01:00
Amit D
2b07505e0e
issue-bug.yml: Add a link to Supported Operating Systems page 2024-03-17 05:08:20 +02:00
Amit D
cc3c507003
Update autotools.yml
Remove the gcc-8 job.
2024-03-11 06:50:05 +02:00
Amit D
872ca6db73
Update cmake.yml
Remove the gcc-8 job
2024-03-11 06:45:54 +02:00
Egor Pugin
d9ae5c356b
Merge pull request #4205 from stweil/docker
Remove broken Dockerfile
2024-03-09 16:17:47 +03:00
Stefan Weil
398ea8a86f Remove broken Dockerfile
It was used for Travis CI which is no longer supported.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 14:08:46 +01:00
Stefan Weil
baf8e80068 Fix CID 1534954 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
4c0f911092 Fix CID 1534957 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
151cf07086 Fix CID 1534940 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
f00d4b22dc Fix CID 1534933, 1534937 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
479789b379 Fix CID 1534948 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
d172f9b802 Fix CID 1534956 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
ddd2ec24a8 Fix CID 1534932, 1534942 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
7465da9058 Fix CID 1510987 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
aa02fb300b Fix CID 1534944 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
5a9a841f40 Fix CID 1534941, 1534950, 1534952 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
bee3b01619 Fix CID 1534943, 1534947, 1534949, 1534955 (Use of auto that causes a copy)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
8dfa2dee86 Fix CID 1534953 (COPY_INSTEAD_OF_MOVE)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 13:50:52 +01:00
Stefan Weil
77fc53c579 unittest-macos.yml: Install libtool (fixes macos-arm-14-clang-unittest)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 12:00:56 +01:00
Stefan Weil
bcc1a3b45b Rename frk -> deu_latf (ISO 639-3, ISO 15924)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 11:25:28 +01:00
Amit D
a76ca9b545
README.md: Add a link to the 'supported compilers' page 2024-03-08 08:39:05 +02:00
Amit D
6cce24fad8
unittest-macos.yml: Add a job that will run on Apple silicon
https://github.com/actions/runner-images/issues/9254
2024-03-08 06:47:03 +02:00
Amit D
8f667335da
Update unittest-disablelegacy.yml 2024-03-06 14:01:17 +02:00
Amit D
bd3c1a2ae0
Update unittest-disablelegacy.yml 2024-03-05 17:27:19 +02:00
Amit D
cdd783eca2
Update unittest-disablelegacy.yml 2024-03-05 17:21:31 +02:00
Stefan Weil
8690d5ac37 Remove unneeded constant kNumSingleQuoteUnicodes
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-05 14:12:50 +01:00
Stefan Weil
6a608e5251 Remove unneeded constant kNumDoubleQuoteUnicodes
Fixes: 2fbb81070f ("normstrngs: add more hyphens and quotes")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-05 14:11:07 +01:00
Stefan Weil
6b99ec616f Remove unneeded constant kNumHyphenPuncUnicodes
Fixes: 2fbb81070f ("normstrngs: add more hyphens and quotes")
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-05 14:05:26 +01:00
Robert Sachunsky
a08a6ac217 fix initializer syntax 2024-03-05 13:12:10 +01:00
Robert Sachunsky
2fbb81070f normstrngs: add more hyphens and quotes 2024-03-05 13:12:10 +01:00
Amit D
f7d4b39125
Update unittest-disablelegacy.yml 2024-03-05 10:01:12 +02:00
Amit D
6231549b42
cmake.yml: Remove the clang++ 14 on Linux job 2024-03-04 16:21:28 +02:00
Amit D
cb9afbd85f
autotools.yml: Remove the clang++ 14 on Linux job 2024-03-04 16:03:36 +02:00
Amit D
dfb2b226bd
Update msys2.yml: Drop the 32-bit job 2024-03-04 15:40:36 +02:00
Stefan Weil
aa7c342297 Update GitHub actions/checkout to v4
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-03 22:02:01 +01:00
Egor Pugin
94bd98b7ef [pdfrenderer] Add some debugging capabilities. Do not compress symbol table and text data during debug. 2024-02-23 23:49:40 +03:00
Egor Pugin
b8961a7375 [pdfrenderer] Do not include space symbol into the word pdf box.
See #2879.
2024-02-23 23:48:39 +03:00