tesseract/src/ccmain
Noah Metzger 91c7504a35 Added a feature to enrich the hOCR output with glyph confidences
By using the parameter -c glyph_confidences=true the user is able to enrich
the hOCR output with additional information. Tesseract then lists additionally
the timesteps with all glyphs that were considered with their confidence
for every timestep of the LSTM.

The format of the hOCR output is slightly changed: There is now a linebreak
after every word for better readability by humans.

Signed-off-by: Noah Metzger <noah.metzger@bib.uni-mannheim.de>
2018-07-25 18:18:58 +02:00
..
adaptions.cpp adaptions: Remove unneeded include statement 2018-06-25 14:28:04 +02:00
applybox.cpp Merge branch 'master' into disable-legacy 2018-07-04 11:14:33 +03:00
control.cpp Add missing include statements and clean some include statements 2018-07-07 16:24:53 +02:00
control.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
docqual.cpp More fixes BOOL8 -> bool 2018-05-21 00:40:58 +03:00
docqual.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
equationdetect.cpp Fix some typos (most found by codespell) 2018-05-27 18:49:43 +02:00
equationdetect.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
fixspace.cpp Add missing include statements and clean some include statements 2018-07-07 16:24:53 +02:00
fixspace.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
fixxht.cpp Replace tabs by blanks in source code 2018-07-03 16:29:14 +02:00
linerec.cpp Added a feature to enrich the hOCR output with glyph confidences 2018-07-25 18:18:58 +02:00
ltrresultiterator.cpp WordFontAttributes: Check that word != nullptr earlier. Fix #1665 2018-06-13 23:38:27 +03:00
ltrresultiterator.h Clean some include statements 2018-06-23 21:15:54 +02:00
Makefile.am Fix build with legacy engine disabled 2018-07-04 17:56:42 +02:00
mutableiterator.h Use default keyword instead of empty ctors/dtors. 2018-05-21 13:35:46 +03:00
osdetect.cpp Fix compiler warnings [-Wmissing-prototypes] 2018-07-05 16:03:02 +02:00
osdetect.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
output.cpp Fix compiler warnings [-Wmissing-prototypes] 2018-07-05 16:03:02 +02:00
output.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
pageiterator.cpp Use std::max/std::min instead of MAX/MIN macros. 2018-05-20 17:49:48 +03:00
pageiterator.h Move sources into src dir. Update build scripts. 2018-04-25 11:02:54 +03:00
pagesegmain.cpp Add an option to compile tesseract without the code of the legacy OCR engine 2018-07-03 18:49:42 +03:00
pagewalk.cpp Add missing file change. 2018-05-21 00:43:22 +03:00
par_control.cpp Fix file endings 2018-04-25 19:35:33 +02:00
paragraphs_internal.h Fix compiler warnings [-Wmissing-prototypes] 2018-07-05 16:03:02 +02:00
paragraphs.cpp Fix compiler warnings [-Wmissing-prototypes] 2018-07-05 16:03:02 +02:00
paragraphs.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
paramsd.cpp Clean more include files and include statements 2018-06-24 19:45:12 +02:00
paramsd.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
pgedit.cpp Fix some comments 2018-07-07 21:19:01 +02:00
pgedit.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
recogtraining.cpp Fix compiler warnings [-Wmissing-prototypes] 2018-07-05 16:03:02 +02:00
reject.cpp Remove unused iterator 2018-07-05 21:20:27 +02:00
reject.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
resultiterator.cpp Added a feature to enrich the hOCR output with glyph confidences 2018-07-25 18:18:58 +02:00
resultiterator.h Added a feature to enrich the hOCR output with glyph confidences 2018-07-25 18:18:58 +02:00
superscript.cpp Fix compiler warnings [-Wmissing-prototypes] 2018-07-05 16:03:02 +02:00
tessbox.cpp Remove empty tessbox.h 2018-06-24 19:45:12 +02:00
tessedit.cpp Merge branch 'master' into disable-legacy 2018-07-04 11:14:33 +03:00
tessedit.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00
tesseractclass.cpp Added a feature to enrich the hOCR output with glyph confidences 2018-07-25 18:18:58 +02:00
tesseractclass.h Added a feature to enrich the hOCR output with glyph confidences 2018-07-25 18:18:58 +02:00
tessvars.cpp Replace tabs by blanks in source code 2018-07-03 16:29:14 +02:00
tessvars.h Replace tabs by blanks in source code 2018-07-03 16:29:14 +02:00
tfacepp.cpp Rename all C-style headers (e.g. <stdio.h>) to C++ style (<cstdio>). 2018-05-20 00:52:04 +03:00
thresholder.cpp Add missing include statement 2018-07-07 21:19:00 +02:00
thresholder.h Move sources into src dir. Update build scripts. 2018-04-25 11:02:54 +03:00
werdit.cpp Clean more include files and include statements 2018-06-24 19:45:12 +02:00
werdit.h Clean more include files and include statements 2018-06-24 19:45:12 +02:00