tesseract/unittest
Robin Watts 9dfdac51c6 Tweak scales array for intSimdMatrix case.
Currently, the size of the scales array is not rounded up
in the same way as the weights are. This blocks us pushing
the scale calculations into the SIMD, as when we "overread"
the end of the scale array, we potentially get errors.

Here, we adjust the intSimdMatrix stuff to ensure that the
scales array reserves enough entries to allow such overreads
to work.

This doesn't make any difference for now, but opens the way
for future optimisations.
2020-10-12 11:47:16 +01:00
..
fuzzers Add build script for oss-fuzz fuzzers 2020-05-08 17:37:37 +02:00
syntaxnet Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
third_party/utf Add code from tensorflow/models 2019-06-28 07:30:40 +02:00
util/utf8 Fix code from tensorflow/models/research/syntaxnet/util/utf8 2019-06-28 07:30:40 +02:00
apiexample_test.cc Don't check for limits.h (now unused) 2020-06-20 10:39:13 +02:00
applybox_test.cc Use const char* for filename parameters 2020-07-07 14:20:09 +02:00
baseapi_test.cc Add TessBaseAPI::GetPageRes again 2020-05-04 14:03:39 +02:00
baseapi_thread_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
bitvector_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
cleanapi_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
colpartition_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
commandlineflags_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
cycletimer.h CycleTimer: Add missing initialization (CID 1396168) 2018-10-16 16:53:23 +02:00
dawg_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
denorm_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
equationdetect_test.cc unittest: Fix and enable equationdetect_test 2019-06-30 12:39:54 +02:00
fileio_test.cc unittest: Add fileio_test 2018-10-12 08:13:39 +02:00
heap_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
imagedata_test.cc Remove member functions STRING::string and StringParam::string (continued) 2019-09-25 10:07:51 +02:00
include_gunit.h unittest: Fix and enable stridemap_test 2019-06-30 15:30:03 +02:00
indexmapbidi_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
intfeaturemap_test.cc Fix more build error for compilation without legacy engine 2019-05-16 18:12:55 +02:00
intsimdmatrix_test.cc Tweak scales array for intSimdMatrix case. 2020-10-12 11:47:16 +01:00
lang_model_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
layout_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
ligature_table_test.cc unittest: Fix and enable ligature_table_test 2019-06-11 16:40:23 +02:00
linlsq_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
loadlang_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
log.h unittest: Fix and enable baseapi_thread_test 2019-06-30 20:21:56 +02:00
lstm_recode_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_squashed_test.cc Partially fix and enable more unittests 2019-01-27 06:49:57 +00:00
lstm_test.cc Remove member functions STRING::string and StringParam::string (continued) 2019-09-25 10:07:51 +02:00
lstm_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
lstmtrainer_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
Makefile.am Improve disabled legacy engine build 2020-10-12 11:47:16 +01:00
mastertrainer_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
matrix_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
networkio_test.cc unittest: Fix and enable networkio_test 2019-06-30 15:28:42 +02:00
normstrngs_test.cc unittest: Update comments in normstrngs_test.cc 2020-07-07 11:29:48 +02:00
normstrngs_test.h Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
nthitem_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
osd_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
pagesegmode_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
pango_font_info_test.cc Remove unused functions FontUtils::GetAllRenderableCharacters 2020-10-03 12:04:40 +02:00
paragraphs_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
params_model_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
progress_test.cc Don't check for limits.h (now unused) 2020-06-20 10:39:13 +02:00
qrsequence_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
README.md Provide more details for sucessfull running of unittests. 2019-10-20 22:15:21 +02:00
recodebeam_test.cc unittest: Remove unused local variables 2020-06-29 16:19:23 +02:00
rect_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
resultiterator_test.cc unittest: Remove unused local variables 2020-06-29 16:19:23 +02:00
scanutils_test.cc unittest: Fix two issues reported by Coverity Scan (CID 1402761, 1402755) 2019-07-10 16:38:30 +02:00
shapetable_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
stats_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
stridemap_test.cc unittest: Fix and enable stridemap_test 2019-06-30 15:30:03 +02:00
stringrenderer_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
tablefind_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
tablerecog_test.cc Remove redundant method from class GenericVector 2020-01-18 11:30:14 +01:00
tabvector_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
tatweel_test.cc unittest: Fix and enable tatweel_test 2019-07-07 10:59:27 +02:00
textlineprojection_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
tfile_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
unichar_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
unicharcompress_test.cc Use #include <tesseract/*.h> for unittest 2019-10-29 18:01:18 +01:00
unicharset_test.cc Run more unittests with the user's locale 2019-05-16 18:12:55 +02:00
validate_grapheme_test.cc unittest: Fix and enable validate_grapheme_test 2019-01-23 13:54:27 +01:00
validate_indic_test.cc unittest: Fix and enable validate_indic_test 2019-01-23 13:54:27 +01:00
validate_khmer_test.cc unittest: Fix and enable validate_khmer_test 2019-01-23 13:54:27 +01:00
validate_myanmar_test.cc unittest: Fix and enable validate_myanmar_test 2019-01-23 13:54:27 +01:00
validator_test.cc unittest: Format code 2018-09-29 09:19:13 +02:00

Unit Testing for Tesseract

Requirements

Files and structure


├── langdata_lstm
│   ├── common.punc
│   ├── common.unicharambigs
│   ├── desired_bigrams.txt
│   ├── eng
│   │   ├── desired_characters
│   │   ├── eng.config
│   │   ├── eng.numbers
│   │   ├── eng.punc
│   │   ├── eng.singles_text
│   │   ├── eng.training_text
│   │   ├── eng.unicharambigs
│   │   ├── eng.wordlist
│   │   └── okfonts.txt
│   ├── extended
│   │   └── extended.config
│   ├── extendedhin
│   │   └── extendedhin.config
│   ├── font_properties
│   ├── forbidden_characters_default
│   ├── hin
│   │   ├── hin.config
│   │   ├── hin.numbers
│   │   ├── hin.punc
│   │   └── hin.wordlist
│   ├── kan
│   │   └── kan.config
│   ├── kor
│   │   └── kor.config
│   ├── osd
│   │   └── osd.unicharset
│   └── radical-stroke.txt
├── tessdata
│   ├── ara.traineddata
│   ├── chi_tra.traineddata
│   ├── eng.traineddata
│   ├── heb.traineddata
│   ├── hin.traineddata
│   ├── jpn.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── vie.traineddata
├── tessdata_best
│   ├── eng.traineddata
│   ├── fra.traineddata
│   ├── kmr.traineddata
│   └── osd.traineddata
├── tessdata_fast
│   ├── eng.traineddata
│   ├── kmr.traineddata
│   ├── osd.traineddata
│   └── script
│       └── Latin.traineddata
└── tesseract
    ├── abseil
    ...
    ├── test
    ├── unittest
    └── VERSION

Fonts

Run tests

To run the tests, do the following in tesseract folder

autoreconf -fiv
git submodule update --init
export TESSDATA_PREFIX=/prefix/to/path/to/tessdata
make check