tesseract/unittest
Stefan Weil 085f6b2572 Use std::list for paragraph models
Signed-off-by: Stefan Weil <sw@weil.de>
2020-12-28 21:03:29 +01:00
..
fuzzers Add build script for oss-fuzz fuzzers 2020-05-08 17:37:37 +02:00
syntaxnet Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
third_party/utf Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
util/utf8 Fix code from tensorflow/models/research/syntaxnet/util/utf8 2019-06-28 07:30:40 +02:00
apiexample_test.cc Improve some unit tests. 2020-12-28 01:11:13 +03:00
applybox_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
baseapi_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
baseapi_thread_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
bitvector_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
cleanapi_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
colpartition_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
commandlineflags_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
cycletimer.h CycleTimer: Add missing initialization (CID 1396168) 2018-10-16 16:53:23 +02:00
dawg_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
denorm_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
doubleptr.h Move doubleptr.h to unittests as it is used only there. 2020-12-28 02:32:27 +03:00
equationdetect_test.cc Simplify code for equationdetect_test 2020-12-23 17:03:06 +01:00
fileio_test.cc Improve some unit tests. 2020-12-28 01:11:13 +03:00
heap_test.cc Move doubleptr.h to unittests as it is used only there. 2020-12-28 02:32:27 +03:00
imagedata_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
include_gunit.h Move doubleptr.h to unittests as it is used only there. 2020-12-28 02:32:27 +03:00
indexmapbidi_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
intfeaturemap_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
intsimdmatrix_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
lang_model_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
layout_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
ligature_table_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
linlsq_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
loadlang_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
log.h unittest: Fix and enable baseapi_thread_test 2019-06-30 20:21:56 +02:00
lstm_recode_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
lstm_squashed_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_test.cc Remove member functions STRING::string and StringParam::string (continued) 2019-09-25 10:07:51 +02:00
lstm_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
lstmtrainer_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
Makefile.am Fix dependency on tmp directory for unittest programs 2020-12-24 08:54:54 +01:00
mastertrainer_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
matrix_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
networkio_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
normstrngs_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
normstrngs_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
nthitem_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
osd_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
pagesegmode_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
pango_font_info_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
paragraphs_test.cc Use std::list for paragraph models 2020-12-28 21:03:29 +01:00
params_model_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
progress_test.cc [test] Fix include order. 2020-12-28 20:36:04 +03:00
qrsequence_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
README.md Provide more details for sucessfull running of unittests. 2019-10-20 22:15:21 +02:00
recodebeam_test.cc Improve some unit tests. 2020-12-28 01:11:13 +03:00
rect_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
resultiterator_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
scanutils_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
shapetable_test.cc [test] Correctly use assert instead of expect. 2020-12-28 03:24:05 +03:00
stats_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
stridemap_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
stringrenderer_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tablefind_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tablerecog_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tabvector_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tatweel_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tesseract_leaksanitizer.supp Add more patterns to suppress memory leaks from libfontconfig 2020-12-04 13:30:58 +01:00
textlineprojection_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tfile_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
unichar_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
unicharcompress_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
unicharset_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_grapheme_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_indic_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_khmer_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_myanmar_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validator_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00

Unit Testing for Tesseract

Requirements

Files and structure


├── langdata_lstm
│   ├── common.punc
│   ├── common.unicharambigs
│   ├── desired_bigrams.txt
│   ├── eng
│   │   ├── desired_characters
│   │   ├── eng.config
│   │   ├── eng.numbers
│   │   ├── eng.punc
│   │   ├── eng.singles_text
│   │   ├── eng.training_text
│   │   ├── eng.unicharambigs
│   │   ├── eng.wordlist
│   │   └── okfonts.txt
│   ├── extended
│   │   └── extended.config
│   ├── extendedhin
│   │   └── extendedhin.config
│   ├── font_properties
│   ├── forbidden_characters_default
│   ├── hin
│   │   ├── hin.config
│   │   ├── hin.numbers
│   │   ├── hin.punc
│   │   └── hin.wordlist
│   ├── kan
│   │   └── kan.config
│   ├── kor
│   │   └── kor.config
│   ├── osd
│   │   └── osd.unicharset
│   └── radical-stroke.txt
├── tessdata
│   ├── ara.traineddata
│   ├── chi_tra.traineddata
│   ├── eng.traineddata
│   ├── heb.traineddata
│   ├── hin.traineddata
│   ├── jpn.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── vie.traineddata
├── tessdata_best
│   ├── eng.traineddata
│   ├── fra.traineddata
│   ├── kmr.traineddata
│   └── osd.traineddata
├── tessdata_fast
│   ├── eng.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── script
│       └── Latin.traineddata
└── tesseract
    ├── abseil
    ...
    ├── test
    ├── unittest
    └── VERSION

Fonts

Run tests

To run the tests, do the following in tesseract folder

autoreconf -fiv
git submodule update --init
export TESSDATA_PREFIX=/prefix/to/path/to/tessdata
make check