From 618489290536282a59ed7a71381ca083d9a7586a Mon Sep 17 00:00:00 2001 From: Stefan Weil Date: Thu, 8 Nov 2018 13:13:13 +0100 Subject: [PATCH 1/2] Remove redundant file COPYING Most of the information was already in README.md. Add the missing hint for Leptonica, too, so the file can now be removed. Signed-off-by: Stefan Weil --- COPYING | 21 --------------------- README.md | 11 +++++++---- 2 files changed, 7 insertions(+), 25 deletions(-) delete mode 100644 COPYING diff --git a/COPYING b/COPYING deleted file mode 100644 index be5ebaed..00000000 --- a/COPYING +++ /dev/null @@ -1,21 +0,0 @@ -This package contains the Tesseract Open Source OCR Engine. -Originally developed at Hewlett Packard Laboratories Bristol and -at Hewlett Packard Co, Greeley Colorado, all the code -in this distribution is now licensed under the Apache License: - -** Licensed under the Apache License, Version 2.0 (the "License"); -** you may not use this file except in compliance with the License. -** You may obtain a copy of the License at -** http://www.apache.org/licenses/LICENSE-2.0 -** Unless required by applicable law or agreed to in writing, software -** distributed under the License is distributed on an "AS IS" BASIS, -** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. -** See the License for the specific language governing permissions and -** limitations under the License. - - -Other Dependencies and Licenses: -================================ - -Tesseract uses Leptonica library (http://leptonica.com/) which essentially -uses a BSD 2-clause license. (http://leptonica.com/about-the-license.html) diff --git a/README.md b/README.md index 522b2fbb..620042fa 100644 --- a/README.md +++ b/README.md @@ -12,10 +12,10 @@ This package contains an **OCR engine** - `libtesseract` and a **command line program** - `tesseract`. Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused -on line recognition, but also still supports the legacy Tesseract OCR engine of -Tesseract 3 which works by recognizing character patterns. Compatibility with -Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). -It also needs traineddata files which support the legacy engine, for example +on line recognition, but also still supports the legacy Tesseract OCR engine of +Tesseract 3 which works by recognizing character patterns. Compatibility with +Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). +It also needs traineddata files which support the legacy engine, for example those from the tessdata repository. The lead developer is Ray Smith. The maintainer is Zdenko Podobny. @@ -99,6 +99,9 @@ Please report an issue only for a **bug**, not for asking questions. **NOTE**: This software depends on other packages that may be licensed under different open source licenses. +Tesseract uses [Leptonica library](http://leptonica.com/) which essentially +uses a [BSD 2-clause license](http://leptonica.com/about-the-license.html). + ## Latest Version of README For the latest online version of the README.md see: From 73cefff3a16ccfba1f60f31deaf0ce7c745783f9 Mon Sep 17 00:00:00 2001 From: Stefan Weil Date: Thu, 8 Nov 2018 15:14:59 +0100 Subject: [PATCH 2/2] Include LICENSE file in distribution Signed-off-by: Stefan Weil --- Makefile.am | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/Makefile.am b/Makefile.am index bdbb7e87..5a4f3163 100644 --- a/Makefile.am +++ b/Makefile.am @@ -22,7 +22,7 @@ SUBDIRS = src/arch src/ccutil src/viewer src/cutil src/opencl src/ccstruct SUBDIRS += src/dict src/classify src/wordrec src/textord src/lstm SUBDIRS += src/ccmain src/api . tessdata doc unittest -EXTRA_DIST = README.md +EXTRA_DIST = README.md LICENSE EXTRA_DIST += aclocal.m4 config configure.ac autogen.sh EXTRA_DIST += tesseract.pc.in $(TRAINING_SUBDIR) java doc langtests unlvtests EXTRA_DIST += CMakeLists.txt tesseract.pc.cmake cmake VERSION src/vs2010 cppan.yml