'\" t .\" Title: mftraining .\" Author: [FIXME: author] [see http://docbook.sf.net/el/author] .\" Generator: DocBook XSL Stylesheets v1.75.2 .\" Date: 09/30/2010 .\" Manual: \ \& .\" Source: \ \& .\" Language: English .\" .TH "MFTRAINING" "1" "09/30/2010" "\ \&" "\ \&" .\" ----------------------------------------------------------------- .\" * Define some portability stuff .\" ----------------------------------------------------------------- .\" ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .\" http://bugs.debian.org/507673 .\" http://lists.gnu.org/archive/html/groff/2009-02/msg00013.html .\" ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .ie \n(.g .ds Aq \(aq .el .ds Aq ' .\" ----------------------------------------------------------------- .\" * set default formatting .\" ----------------------------------------------------------------- .\" disable hyphenation .nh .\" disable justification (adjust text to left margin only) .ad l .\" ----------------------------------------------------------------- .\" * MAIN CONTENT STARTS HERE * .\" ----------------------------------------------------------------- .SH "NAME" mftraining \- feature training for Tesseract .SH "SYNOPSIS" .sp mftraining \-U \fIunicharset\fR \-O \fIlang\&.unicharset\fR \fIFILE\fR\&... .SH "DESCRIPTION" .sp mftraining takes a list of \&.tr files, from which it generates the files inttemp (the shape prototypes) and pffmtable (the number of expected features for each character)\&. (A third file called Microfeat is also written by this program, but it is not used\&.) .SH "OPTIONS" .sp \fI\-U\fR FILE The unicharset generated by unicharset_extractor .sp \fI\-O\fR FILE The output unicharset that will be given to combine_tessdata\&. .SH "SEE ALSO" .sp tesseract(1), cntraining(1), unicharset_extractor(1), combine_tessdata(1) .SH "COPYING" .sp Copyright (c) Hewlett\-Packard Company, 1988 Licensed under the Apache License, Version 2\&.0