mirror of
https://github.com/tesseract-ocr/tesseract.git
synced 2025-08-01 02:18:09 +08:00
Add explanation to --script-dir argument for set_unicharset_properties
parent
617757b465
commit
807150c90d
@ -241,12 +241,14 @@ unicharset_extractor lang.fontname.exp0.box lang.fontname.exp1.box ...
|
|||||||
|
|
||||||
*New in 3.03*
|
*New in 3.03*
|
||||||
|
|
||||||
This tool, together with a set of data files, allow the addition of extra properties in the unicharset, mostly sizes obtained from fonts.
|
This tool, together with a set of data files, allow the addition of extra properties in the unicharset, mostly sizes obtained from fonts.
|
||||||
|
|
||||||
```
|
```
|
||||||
training/set_unicharset_properties -U input_unicharset -O output_unicharset --script_dir=training/langdata
|
training/set_unicharset_properties -U input_unicharset -O output_unicharset --script_dir=training/langdata
|
||||||
```
|
```
|
||||||
|
|
||||||
|
`--script-dir` should point to a directory containing the relevant .unicharset file(s) for your training character set. These can be downloaded from [https://github.com/tesseract-ocr/langdata](https://github.com/tesseract-ocr/langdata)).
|
||||||
|
|
||||||
After running `unicharset_extractor` and `set_unicharset_properties`, you should get a `unicharset` file with all the fields set to the right values, like in this [example](#an-example-of-the-unicharset-file).
|
After running `unicharset_extractor` and `set_unicharset_properties`, you should get a `unicharset` file with all the fields set to the right values, like in this [example](#an-example-of-the-unicharset-file).
|
||||||
|
|
||||||
## The font\_properties file
|
## The font\_properties file
|
||||||
|
Loading…
Reference in New Issue
Block a user