Need space between words in box files

Shreeshrii 2017-05-22 21:03:11 +05:30
parent 91960b9506
commit ba2faafad0

@ -21,7 +21,7 @@ have information about LSTM integration in Tesseract 4.0.
* [TrainingTesseract 4.00 - Replace Top Layer Example - Norwegian](https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replacing-Top-Layer-Example)
* [TrainingTesseract 4.00 - Replace Top Layer Example - Devanagari](https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replace-Top-Layer)
3.0 version of box files can be converted for use with LSTM training by adding a tab character at end of each line. `Mark EOL` and `Mark EOL Bulk` functions under `Edit` in `Box Editor` tab of latest version of [jTessBoxEditor - jTessBoxEditor-2.0-Beta](https://sourceforge.net/projects/vietocr/files/jTessBoxEditor/) can be used to do it automatically.
3.0 version of box files can be converted for use with LSTM training by adding a tab character at end of each line and boxes with space after each word. `Mark EOL` and `Mark EOL Bulk` functions under `Edit` in `Box Editor` tab of latest version of [jTessBoxEditor - jTessBoxEditor-2.0-Beta](https://sourceforge.net/projects/vietocr/files/jTessBoxEditor/) can be used to add the EOL tabs automatically. Insert mode can be used on last letter of each word to add a box with space. There is no automated way to do this.
## 4.0.0-alpha ppa