Add explanation to --script-dir argument for set_unicharset_properties

2025-08-01 02:18:09 +08:00 · 2018-03-04 23:29:26 +10:00 · 2018-03-04 23:29:26 +10:00 · 807150c90d
commit 807150c90d
parent 617757b465
1 changed files with 3 additions and 1 deletions
--- a/Training-Tesseract-3.03–3.05.md
+++ b/Training-Tesseract-3.03–3.05.md
@ -241,12 +241,14 @@ unicharset_extractor lang.fontname.exp0.box lang.fontname.exp1.box ...
 *New in 3.03*
-This tool, together with a set of data files, allow the addition of extra properties in the unicharset, mostly sizes obtained from fonts.
+This tool, together with a set of data files, allow the addition of extra properties in the unicharset, mostly sizes obtained from fonts. 
 ```
 training/set_unicharset_properties -U input_unicharset -O output_unicharset --script_dir=training/langdata
 ```
 `--script-dir` should point to a directory containing the relevant .unicharset file(s) for your training character set. These can be downloaded from [https://github.com/tesseract-ocr/langdata](https://github.com/tesseract-ocr/langdata)).
 After running `unicharset_extractor` and `set_unicharset_properties`, you should get a `unicharset` file with all the fields set to the right values, like in this [example](#an-example-of-the-unicharset-file).
 ## The font\_properties file