Stefan Weil
b94bbd6e83
Update version handling ( #1408 )
...
ccutil/version.h is now no longer needed.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-03-22 21:49:47 +01:00
Stefan Weil
aa6eb6bd46
Remove Tesseract parameter "include_page_breaks" and use FF by default
...
Now Tesseract adds a page break (normally form feed) by default.
It is still possible to suppress page breaks by setting an empty
page_separator.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-19 07:34:32 +02:00
Ray Smith
da03e4e910
Fixes from pull of cleanups: clang tidied, reviewed, fixed new bugs, undeleted needed code. Probably breaks the build, due to some inclusion of changes in utf8/32 conversion
2017-07-14 09:30:14 -07:00
Stefan Weil
84396707a8
Fix crash if output file could not be opened
...
This error case results in fout_ == nullptr.
Closing a nullptr file is not allowed.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-05-13 17:27:07 +02:00
Raf Schietekat
8aa0a2dd48
RAII: *::GetUNLVText()
2017-05-11 02:02:37 +02:00
Raf Schietekat
1dab23916f
RAII: *::GetBoxText()
2017-05-11 02:02:37 +02:00
Raf Schietekat
b7b68a65dd
RAII: *::GetTSVText()
2017-05-11 02:02:37 +02:00
Raf Schietekat
a1fff874b4
RAII: *::GetHOCRText()
2017-05-11 02:02:37 +02:00
Raf Schietekat
f75665c34f
RAII: TessBaseAPI::GetUTF8Text()
2017-05-11 02:02:37 +02:00
Ray Smith
2c837dffc3
Result of clang tidy on recent merge
2016-11-07 10:46:33 -08:00
Steffen Rehberg
29d971eb0c
Fix text box width/hight calculation
...
In Tesseract's coordinate system, width is just right - left, cf. slide #2 of
github.com/tesseract-ocr/docs/blob/master/das_tutorial2016/2ArchitectureAndDataStructures.pdf
2016-06-25 12:40:28 +02:00
Zdenko Podobný
66f37f0cd3
add copyright to renderer.cpp and pdfr.cpp
2016-03-18 19:43:45 +01:00
Tom Morris
6700edd8bc
Cleanup TSV renderer
...
Remove all references to hocr, hocr.tsv, etc. Remove dead code for font
info, input filename, HTML escapes. Improved comments. Fixed
indentation.
2016-03-01 13:41:19 -05:00
Sundar M. Vaidya
b1e4a82b0b
Render output in TSV format.
2016-03-01 12:30:39 -05:00
Sundar M. Vaidya
4d13892f5b
Adds TessHOcrTsvRenderer class for rendering HOCR info in tsv format.
2016-03-01 12:13:42 -05:00
amitdo
6bbcb50dd9
Added osd renderer for psm 0.
...
Works for single page and multi-page.
2015-10-30 20:09:00 +02:00
Robert Theis
aa6a0b12f9
Remove extraneous line feed
2015-08-12 18:02:35 -07:00
Ray Smith
03f3c9dc88
Misc fixes missed from previous commits
2015-05-12 18:13:15 -07:00
Zdenko Podobný
4c7c960bfd
fix issue 1417
2015-02-07 22:22:20 +01:00
zdenop
1156098567
Add font info to hocr output - fix issue 1219
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1132 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-03 16:22:12 +00:00
zdenop
905e6162b9
put info about (API) version; fix typo
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1117 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-06-22 18:31:42 +00:00
theraysmith@gmail.com
25a8c7b720
Enabled streaming input and output of multi-page documents
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1105 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-21 15:46:21 +00:00
theraysmith@gmail.com
4c3475ad2e
Fixed fmemopen portability problem
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@890 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-10-10 02:07:26 +00:00
zdenop@gmail.com
af319b4d90
fix for windows build - part 1
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@883 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-09-25 09:56:49 +00:00
theraysmith@gmail.com
88ea81c89e
Added renderer to API
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@869 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-09-20 19:39:59 +00:00