mirror of
https://github.com/tesseract-ocr/tesseract.git
synced 2024-12-03 00:49:01 +08:00
0f9d507740
The last contribution from Google was in 2018
(see commit ce88adbf32
).
Signed-off-by: Stefan Weil <sw@weilnetz.de>
33 lines
796 B
Plaintext
33 lines
796 B
Plaintext
AMBIGUOUS_WORDS(1)
|
|
==================
|
|
:doctype: manpage
|
|
|
|
NAME
|
|
----
|
|
ambiguous_words - generate sets of words Tesseract is likely to find ambiguous
|
|
|
|
SYNOPSIS
|
|
--------
|
|
*ambiguous_words* [-l lang] 'TESSDATADIR' 'WORDLIST' 'AMBIGUOUSFILE'
|
|
|
|
DESCRIPTION
|
|
-----------
|
|
ambiguous_words(1) runs Tesseract in a special mode, and for each word
|
|
in word list, produces a set of words which Tesseract thinks might be
|
|
ambiguous with it. 'TESSDATADIR' must be set to the absolute path of
|
|
a directory containing 'tessdata/lang.traineddata'.
|
|
|
|
SEE ALSO
|
|
--------
|
|
tesseract(1)
|
|
|
|
COPYING
|
|
-------
|
|
Copyright \(C) 2012 Google, Inc.
|
|
Licensed under the Apache License, Version 2.0
|
|
|
|
AUTHOR
|
|
------
|
|
The Tesseract OCR engine was written by Ray Smith and his research groups
|
|
at Hewlett Packard (1985-1995) and Google (2006-2018).
|