tesseract/unittest
Robin Watts 872816897a Rejig intsimdmatrix to reduce FP ops.
Avoid 1) floating point division by 127, 2) conversion of
bias to double, 3) FP addition, in favour of 1) integer
multiplication by 127, and 2) integer addition.

(Also costs extra work in the serialisation/deserialisation of
the scale values, and conversion of weights to int formats, but
these are all one offs).
2020-10-12 04:30:46 -07:00
..
fuzzers Add build script for oss-fuzz fuzzers 2020-05-08 17:37:37 +02:00
syntaxnet Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
third_party/utf Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
util/utf8 Fix code from tensorflow/models/research/syntaxnet/util/utf8 2019-06-28 07:30:40 +02:00
apiexample_test.cc Don't check for limits.h (now unused) 2020-06-20 10:39:13 +02:00
applybox_test.cc Use const char* for filename parameters 2020-07-07 14:20:09 +02:00
baseapi_test.cc Add TessBaseAPI::GetPageRes again 2020-05-04 14:03:39 +02:00
baseapi_thread_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
bitvector_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
cleanapi_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
colpartition_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
commandlineflags_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
cycletimer.h CycleTimer: Add missing initialization (CID 1396168) 2018-10-16 16:53:23 +02:00
dawg_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
denorm_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
equationdetect_test.cc unittest: Fix and enable equationdetect_test 2019-06-30 12:39:54 +02:00
fileio_test.cc unittest: Add fileio_test 2018-10-12 08:13:39 +02:00
heap_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
imagedata_test.cc Remove member functions STRING::string and StringParam::string (continued) 2019-09-25 10:07:51 +02:00
include_gunit.h unittest: Fix and enable stridemap_test 2019-06-30 15:30:03 +02:00
indexmapbidi_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
intfeaturemap_test.cc Fix more build error for compilation without legacy engine 2019-05-16 18:12:55 +02:00
intsimdmatrix_test.cc Rejig intsimdmatrix to reduce FP ops. 2020-10-12 04:30:46 -07:00
lang_model_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
layout_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
ligature_table_test.cc unittest: Fix and enable ligature_table_test 2019-06-11 16:40:23 +02:00
linlsq_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
loadlang_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
log.h unittest: Fix and enable baseapi_thread_test 2019-06-30 20:21:56 +02:00
lstm_recode_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_squashed_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_test.cc Remove member functions STRING::string and StringParam::string (continued) 2019-09-25 10:07:51 +02:00
lstm_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
lstmtrainer_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
Makefile.am Improve disabled legacy engine build 2020-10-12 11:47:16 +01:00
mastertrainer_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
matrix_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
networkio_test.cc unittest: Fix and enable networkio_test 2019-06-30 15:28:42 +02:00
normstrngs_test.cc unittest: Update comments in normstrngs_test.cc 2020-07-07 11:29:48 +02:00
normstrngs_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
nthitem_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
osd_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
pagesegmode_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
pango_font_info_test.cc Remove unused functions FontUtils::GetAllRenderableCharacters 2020-10-03 12:04:40 +02:00
paragraphs_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
params_model_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
progress_test.cc Don't check for limits.h (now unused) 2020-06-20 10:39:13 +02:00
qrsequence_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
README.md Provide more details for sucessfull running of unittests. 2019-10-20 22:15:21 +02:00
recodebeam_test.cc unittest: Remove unused local variables 2020-06-29 16:19:23 +02:00
rect_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
resultiterator_test.cc unittest: Remove unused local variables 2020-06-29 16:19:23 +02:00
scanutils_test.cc unittest: Fix two issues reported by Coverity Scan (CID 1402761, 1402755) 2019-07-10 16:38:30 +02:00
shapetable_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
stats_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
stridemap_test.cc unittest: Fix and enable stridemap_test 2019-06-30 15:30:03 +02:00
stringrenderer_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
tablefind_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
tablerecog_test.cc Remove redundant method from class GenericVector 2020-01-18 11:30:14 +01:00
tabvector_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
tatweel_test.cc unittest: Fix and enable tatweel_test 2019-07-07 10:59:27 +02:00
textlineprojection_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
tfile_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
unichar_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
unicharcompress_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
unicharset_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
validate_grapheme_test.cc unittest: Fix and enable validate_grapheme_test 2019-01-23 13:54:27 +01:00
validate_indic_test.cc unittest: Fix and enable validate_indic_test 2019-01-23 13:54:27 +01:00
validate_khmer_test.cc unittest: Fix and enable validate_khmer_test 2019-01-23 13:54:27 +01:00
validate_myanmar_test.cc unittest: Fix and enable validate_myanmar_test 2019-01-23 13:54:27 +01:00
validator_test.cc unittest: Format code 2018-09-29 09:19:13 +02:00

Unit Testing for Tesseract

Requirements

Files and structure


├── langdata_lstm
│   ├── common.punc
│   ├── common.unicharambigs
│   ├── desired_bigrams.txt
│   ├── eng
│   │   ├── desired_characters
│   │   ├── eng.config
│   │   ├── eng.numbers
│   │   ├── eng.punc
│   │   ├── eng.singles_text
│   │   ├── eng.training_text
│   │   ├── eng.unicharambigs
│   │   ├── eng.wordlist
│   │   └── okfonts.txt
│   ├── extended
│   │   └── extended.config
│   ├── extendedhin
│   │   └── extendedhin.config
│   ├── font_properties
│   ├── forbidden_characters_default
│   ├── hin
│   │   ├── hin.config
│   │   ├── hin.numbers
│   │   ├── hin.punc
│   │   └── hin.wordlist
│   ├── kan
│   │   └── kan.config
│   ├── kor
│   │   └── kor.config
│   ├── osd
│   │   └── osd.unicharset
│   └── radical-stroke.txt
├── tessdata
│   ├── ara.traineddata
│   ├── chi_tra.traineddata
│   ├── eng.traineddata
│   ├── heb.traineddata
│   ├── hin.traineddata
│   ├── jpn.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── vie.traineddata
├── tessdata_best
│   ├── eng.traineddata
│   ├── fra.traineddata
│   ├── kmr.traineddata
│   └── osd.traineddata
├── tessdata_fast
│   ├── eng.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── script
│       └── Latin.traineddata
└── tesseract
    ├── abseil
    ...
    ├── test
    ├── unittest
    └── VERSION

Fonts

Run tests

To run the tests, do the following in tesseract folder

autoreconf -fiv
git submodule update --init
export TESSDATA_PREFIX=/prefix/to/path/to/tessdata
make check