diff --git a/4.0-with-LSTM.md b/4.0-with-LSTM.md index ea9501a..bd40ce8 100644 --- a/4.0-with-LSTM.md +++ b/4.0-with-LSTM.md @@ -17,9 +17,6 @@ have information about LSTM integration in Tesseract 4.0. ## Training Tesseract LSTM engine * [TrainingTesseract 4.00](https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00) -* [TrainingTesseract 4.00 - Finetuning Example - Arabic](https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Finetune) -* [TrainingTesseract 4.00 - Replace Top Layer Example - Norwegian](https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replacing-Top-Layer-Example) -* [TrainingTesseract 4.00 - Replace Top Layer Example - Devanagari](https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replace-Top-Layer) 3.0 version of box files can be converted for use with LSTM training by adding a tab character at end of each line and boxes with space after each word. `Mark EOL` and `Mark EOL Bulk` functions under `Edit` in `Box Editor` tab of latest version of [jTessBoxEditor - jTessBoxEditor-2.0-Beta](https://sourceforge.net/projects/vietocr/files/jTessBoxEditor/) can be used to add the EOL tabs automatically. Insert mode can be used on last letter of each word to add a box with space. There is no automated way to do this.