tesseract/unittest
Egor Pugin fd8907471c Improve C API. Add tests.
1. Add simple C API test in C++ program.
2. Add simple C API test in C program.
3. Fix including capi.h in C++ files.
2021-01-02 03:57:25 +03:00
..
fuzzers Add build script for oss-fuzz fuzzers 2020-05-08 17:37:37 +02:00
syntaxnet Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
third_party/utf Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
util/utf8 Fix code from tensorflow/models/research/syntaxnet/util/utf8 2019-06-28 07:30:40 +02:00
apiexample_test.cc Improve some unit tests. 2020-12-28 01:11:13 +03:00
applybox_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
baseapi_test.cc Fix baseapi test. Use C++ regex instead of gtest ones. 2020-12-30 01:28:50 +03:00
baseapi_thread_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
bitvector_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
capiexample_c_test.c Improve C API. Add tests. 2021-01-02 03:57:25 +03:00
capiexample_test.cc Improve C API. Add tests. 2021-01-02 03:57:25 +03:00
cleanapi_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
colpartition_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
commandlineflags_test.cc [test] Remove set locale as it causes errors on some systems. It includes grouping for numbers, then pid and some other numbers in gtest are formatted incorrectly. 2020-12-30 16:30:40 +03:00
cycletimer.h CycleTimer: Add missing initialization (CID 1396168) 2018-10-16 16:53:23 +02:00
dawg_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
denorm_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
doubleptr.h Move doubleptr.h to unittests as it is used only there. 2020-12-28 02:32:27 +03:00
equationdetect_test.cc [test] Disable loading equ.traineddata in equationsdetect test until IdentifySpecialText is turned back on. 2020-12-30 14:12:49 +03:00
fileio_test.cc Improve some unit tests. 2020-12-28 01:11:13 +03:00
heap_test.cc Remove genericvector.h from public API 2020-12-28 21:03:29 +01:00
imagedata_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
include_gunit.h Improve CHECK macro for unittest 2020-12-30 20:08:34 +01:00
indexmapbidi_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
intfeaturemap_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
intsimdmatrix_test.cc [test] Fix intsimdmatrix test. Update result value based on updated TRand engine. 2020-12-31 03:28:36 +03:00
lang_model_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
layout_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
ligature_table_test.cc Use TESS_API for every public symbol. Public symbol is exported from the library. This also applies to unit test and training symbols. Users will be limited to public api, but set of exported symbols will be wider still. 2020-12-31 16:32:29 +03:00
linlsq_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
loadlang_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
log.h unittest: Fix and enable baseapi_thread_test 2019-06-30 20:21:56 +02:00
lstm_recode_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
lstm_squashed_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_test.cc Use old genericvector.h 2020-12-30 07:10:29 +01:00
lstm_test.h Remove helpers.h from public API 2020-12-31 09:06:16 +01:00
lstmtrainer_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
Makefile.am Make autotools builds for unittest less noisy by default 2020-12-31 18:17:25 +01:00
mastertrainer_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
matrix_test.cc Order and clean include statements 2020-12-30 10:50:39 +01:00
networkio_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
normstrngs_test.cc Remove strngs.h from public API 2020-12-28 21:03:29 +01:00
normstrngs_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
nthitem_test.cc Order and clean include statements 2020-12-30 10:50:39 +01:00
osd_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
pagesegmode_test.cc Remove helpers.h from public API 2020-12-31 09:06:16 +01:00
pango_font_info_test.cc Use TESS_API for every public symbol. Public symbol is exported from the library. This also applies to unit test and training symbols. Users will be limited to public api, but set of exported symbols will be wider still. 2020-12-31 16:32:29 +03:00
paragraphs_test.cc Replace GenericVector<RowInfo> by std::vector<RowInfo> 2020-12-30 12:14:43 +01:00
params_model_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
progress_test.cc [test] Fix include order. 2020-12-28 20:36:04 +03:00
qrsequence_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
README.md Provide more details for sucessfull running of unittests. 2019-10-20 22:15:21 +02:00
recodebeam_test.cc Remove helpers.h from public API 2020-12-31 09:06:16 +01:00
rect_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
resultiterator_test.cc Remove GenericVector::compare_callback. This fixes several tests after previous commit. 2020-12-31 17:26:40 +03:00
scanutils_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
shapetable_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
stats_test.cc Remove genericvector.h from public API 2020-12-28 21:03:29 +01:00
stridemap_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
stringrenderer_test.cc Use TESS_API for every public symbol. Public symbol is exported from the library. This also applies to unit test and training symbols. Users will be limited to public api, but set of exported symbols will be wider still. 2020-12-31 16:32:29 +03:00
tablefind_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tablerecog_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tabvector_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
tatweel_test.cc Run more unittests without requiring tensorflow 2020-12-30 20:10:26 +01:00
tesseract_leaksanitizer.supp Add more patterns to suppress memory leaks from libfontconfig 2020-12-04 13:30:58 +01:00
textlineprojection_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
tfile_test.cc Use old genericvector.h 2020-12-30 07:10:29 +01:00
unichar_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
unicharcompress_test.cc Make tmp directory for all unit tests 2020-12-30 18:17:58 +01:00
unicharset_test.cc Use old genericvector.h 2020-12-30 07:10:29 +01:00
validate_grapheme_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_indic_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_khmer_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validate_myanmar_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00
validator_test.cc Add / fix namespace tesseract for unittest 2020-12-27 10:54:43 +01:00

Unit Testing for Tesseract

Requirements

Files and structure


├── langdata_lstm
│   ├── common.punc
│   ├── common.unicharambigs
│   ├── desired_bigrams.txt
│   ├── eng
│   │   ├── desired_characters
│   │   ├── eng.config
│   │   ├── eng.numbers
│   │   ├── eng.punc
│   │   ├── eng.singles_text
│   │   ├── eng.training_text
│   │   ├── eng.unicharambigs
│   │   ├── eng.wordlist
│   │   └── okfonts.txt
│   ├── extended
│   │   └── extended.config
│   ├── extendedhin
│   │   └── extendedhin.config
│   ├── font_properties
│   ├── forbidden_characters_default
│   ├── hin
│   │   ├── hin.config
│   │   ├── hin.numbers
│   │   ├── hin.punc
│   │   └── hin.wordlist
│   ├── kan
│   │   └── kan.config
│   ├── kor
│   │   └── kor.config
│   ├── osd
│   │   └── osd.unicharset
│   └── radical-stroke.txt
├── tessdata
│   ├── ara.traineddata
│   ├── chi_tra.traineddata
│   ├── eng.traineddata
│   ├── heb.traineddata
│   ├── hin.traineddata
│   ├── jpn.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── vie.traineddata
├── tessdata_best
│   ├── eng.traineddata
│   ├── fra.traineddata
│   ├── kmr.traineddata
│   └── osd.traineddata
├── tessdata_fast
│   ├── eng.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── script
│       └── Latin.traineddata
└── tesseract
    ├── abseil
    ...
    ├── test
    ├── unittest
    └── VERSION

Fonts

Run tests

To run the tests, do the following in tesseract folder

autoreconf -fiv
git submodule update --init
export TESSDATA_PREFIX=/prefix/to/path/to/tessdata
make check