From ff05e909053c9e2a57c434c2f33fbf8044f2beba Mon Sep 17 00:00:00 2001 From: Shreeshrii Date: Thu, 4 May 2017 09:43:51 +0530 Subject: [PATCH] Updated Command Line Usage (markdown) --- Command-Line-Usage.md | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/Command-Line-Usage.md b/Command-Line-Usage.md index 64a96c0..71f13a7 100644 --- a/Command-Line-Usage.md +++ b/Command-Line-Usage.md @@ -1,6 +1,6 @@ ## [Tesseract 'man' page](https://github.com/tesseract-ocr/tesseract/blob/master/doc/tesseract.1.asc) -This page has not been updated for Tesseract 4.0. +This page has not been (fully) updated for Tesseract 4.0. ## Tesseract --help @@ -44,6 +44,12 @@ This page has not been updated for Tesseract 4.0. Use the config variable as part of command `-c tessedit_page_number=0 ` +## Integrate original image file and detected text into searchable PDF + + Use the config variable `-c textonly_pdf=1` and Merge your image-only and text-only PDF. + + see https://github.com/tesseract-ocr/tesseract/issues/660#issuecomment-274213632 for details + ## Simplest Invocation to OCR an image tesseract imagename outputbase