2018-03-13 02:08:16 +08:00
SET_UNICHARSET_PROPERTIES(1)
============================
:doctype: manpage
NAME
----
set_unicharset_properties - set properties about the unichars
SYNOPSIS
--------
*set_unicharset_properties* --U 'input_unicharsetfile' --script_dir '/path/to/langdata' --O 'output_unicharsetfile'
DESCRIPTION
-----------
set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it with properties about the unichars it contains and writes the result back to another unicharset file.
OPTIONS
-------
'--script_dir /path/to/langdata'::
(Input) Specify the location of directory for universal script unicharsets and font xheights (type:string default:)
2017-09-17 00:47:04 +08:00
2018-03-13 02:08:16 +08:00
'--U unicharsetfile'::
(Input) Specify the location of the unicharset to load as input.
2017-09-17 00:47:04 +08:00
2018-03-13 02:08:16 +08:00
'--O unicharsetfile'::
(Output) Specify the location of the unicharset to be written with updated properties.
HISTORY
-------
2017-09-17 00:47:04 +08:00
set_unicharset_properties(1) was first made available for tesseract version 3.03.
2018-03-13 02:08:16 +08:00
RESOURCES
---------
Main web site: <https://github.com/tesseract-ocr> +
2020-02-03 18:37:41 +08:00
Information on training: <https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html>
2017-09-17 00:47:04 +08:00
2018-03-13 02:08:16 +08:00
SEE ALSO
--------
tesseract(1)
COPYING
-------
Copyright \(C) 2012 Google, Inc.
Licensed under the Apache License, Version 2.0
AUTHOR
------
The Tesseract OCR engine was written by Ray Smith and his research groups
2024-05-03 21:44:03 +08:00
at Hewlett Packard (1985-1995) and Google (2006-2018).