tesseract/unittest
2021-01-15 13:04:38 +00:00
..
fuzzers Add include path for leptonica for fuzzer build 2021-01-07 22:30:09 +01:00
syntaxnet Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
third_party/utf Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
util/utf8 Fix code from tensorflow/models/research/syntaxnet/util/utf8 2019-06-28 07:30:40 +02:00
apiexample_test.cc Improve some unit tests. 2020-12-28 01:11:13 +03:00
applybox_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
baseapi_test.cc Remove unnecessary subtest with missing input image 2021-01-14 15:38:55 +01:00
baseapi_thread_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
bitvector_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
capiexample_c_test.c Improve C API. Add tests. 2021-01-02 03:57:25 +03:00
capiexample_test.cc Improve C API. Add tests. 2021-01-02 03:57:25 +03:00
cleanapi_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
colpartition_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
commandlineflags_test.cc [test] Remove set locale as it causes errors on some systems. It includes grouping for numbers, then pid and some other numbers in gtest are formatted incorrectly. 2020-12-30 16:30:40 +03:00
cycletimer.h CycleTimer: Add missing initialization (CID 1396168) 2018-10-16 16:53:23 +02:00
dawg_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
denorm_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
doubleptr.h Move doubleptr.h to unittests as it is used only there. 2020-12-28 02:32:27 +03:00
equationdetect_test.cc [test] Disable loading equ.traineddata in equationsdetect test until IdentifySpecialText is turned back on. 2020-12-30 14:12:49 +03:00
fileio_test.cc Replace snprintf by strncpy (fix compiler warning) 2021-01-08 19:01:26 +01:00
heap_test.cc Remove genericvector.h from public API 2020-12-28 21:03:29 +01:00
imagedata_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
include_gunit.h Improve CHECK macro for unittest 2020-12-30 20:08:34 +01:00
indexmapbidi_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
intfeaturemap_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
intsimdmatrix_test.cc [test] Fix intsimdmatrix test. Update result value based on updated TRand engine. 2020-12-31 03:28:36 +03:00
lang_model_test.cc More std::vector. 2021-01-07 13:57:57 +03:00
layout_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
ligature_table_test.cc Fix ligature_table_test 2021-01-10 23:22:45 +01:00
linlsq_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
list_test.cc Add some basic tests for ELIST 2021-01-03 22:02:51 +01:00
loadlang_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
log.h unittest: Fix and enable baseapi_thread_test 2019-06-30 20:21:56 +02:00
lstm_recode_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
lstm_squashed_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_test.cc More std::vector. 2021-01-07 13:57:57 +03:00
lstm_test.h More std::vector. 2021-01-07 13:57:57 +03:00
lstmtrainer_test.cc Delete TESSDATA_BEST_DIR macro 2021-01-08 20:25:26 +01:00
Makefile.am Do not run textlineprojection_test with disable-legacy, uses OSD 2021-01-15 13:04:38 +00:00
mastertrainer_test.cc [training] More unique ptrs. 2021-01-05 17:03:26 +03:00
matrix_test.cc Order and clean include statements 2020-12-30 10:50:39 +01:00
networkio_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
normstrngs_test.cc Remove strngs.h from public API 2020-12-28 21:03:29 +01:00
normstrngs_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
nthitem_test.cc Order and clean include statements 2020-12-30 10:50:39 +01:00
osd_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
pagesegmode_test.cc Remove helpers.h from public API 2020-12-31 09:06:16 +01:00
pango_font_info_test.cc Fix pango_font_info_test 2021-01-10 23:22:45 +01:00
paragraphs_test.cc Replace GenericVector<RowInfo> by std::vector<RowInfo> 2020-12-30 12:14:43 +01:00
params_model_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
progress_test.cc [test] Fix include order. 2020-12-28 20:36:04 +03:00
qrsequence_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
README.md Provide more details for sucessfull running of unittests. 2019-10-20 22:15:21 +02:00
recodebeam_test.cc More std::vector. 2021-01-07 13:57:57 +03:00
rect_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
resultiterator_test.cc Remove GenericVector::compare_callback. This fixes several tests after previous commit. 2020-12-31 17:26:40 +03:00
scanutils_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
shapetable_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
stats_test.cc Remove genericvector.h from public API 2020-12-28 21:03:29 +01:00
stridemap_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
stringrenderer_test.cc Fix stringrenderer_test 2021-01-10 23:22:45 +01:00
tablefind_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tablerecog_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tabvector_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tatweel_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
tesseract_leaksanitizer.supp Add more patterns to suppress memory leaks from libfontconfig 2020-12-04 13:30:58 +01:00
textlineprojection_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
tfile_test.cc Use old genericvector.h 2020-12-30 07:10:29 +01:00
unichar_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
unicharcompress_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
unicharset_test.cc More std::vector. 2021-01-07 13:57:57 +03:00
validate_grapheme_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_indic_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_khmer_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_myanmar_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validator_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00

Unit Testing for Tesseract

Requirements

Files and structure


├── langdata_lstm
│   ├── common.punc
│   ├── common.unicharambigs
│   ├── desired_bigrams.txt
│   ├── eng
│   │   ├── desired_characters
│   │   ├── eng.config
│   │   ├── eng.numbers
│   │   ├── eng.punc
│   │   ├── eng.singles_text
│   │   ├── eng.training_text
│   │   ├── eng.unicharambigs
│   │   ├── eng.wordlist
│   │   └── okfonts.txt
│   ├── extended
│   │   └── extended.config
│   ├── extendedhin
│   │   └── extendedhin.config
│   ├── font_properties
│   ├── forbidden_characters_default
│   ├── hin
│   │   ├── hin.config
│   │   ├── hin.numbers
│   │   ├── hin.punc
│   │   └── hin.wordlist
│   ├── kan
│   │   └── kan.config
│   ├── kor
│   │   └── kor.config
│   ├── osd
│   │   └── osd.unicharset
│   └── radical-stroke.txt
├── tessdata
│   ├── ara.traineddata
│   ├── chi_tra.traineddata
│   ├── eng.traineddata
│   ├── heb.traineddata
│   ├── hin.traineddata
│   ├── jpn.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── vie.traineddata
├── tessdata_best
│   ├── eng.traineddata
│   ├── fra.traineddata
│   ├── kmr.traineddata
│   └── osd.traineddata
├── tessdata_fast
│   ├── eng.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── script
│       └── Latin.traineddata
└── tesseract
    ├── abseil
    ...
    ├── test
    ├── unittest
    └── VERSION

Fonts

Run tests

To run the tests, do the following in tesseract folder

autoreconf -fiv
git submodule update --init
export TESSDATA_PREFIX=/prefix/to/path/to/tessdata
make check