theraysmith@gmail.com
|
d8d9b390d1
|
misc fixes
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@961 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2014-01-09 17:49:07 +00:00 |
|
zdenop@gmail.com
|
92c0ba06de
|
fix issue 972
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@880 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 19:54:06 +00:00 |
|
theraysmith@gmail.com
|
4d514d5a60
|
Major refactor of beam search, elimination of dead code, misc bug fixes, updates to Makefile.am, Changelog etc.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@878 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 15:26:50 +00:00 |
|
david.eger@gmail.com
|
eeeb4f513c
|
Provide better paragraph segmentation without having to run fully
automatic layout analysis.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@725 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-05-10 00:03:34 +00:00 |
|
david.eger@gmail.com
|
c0cd2cd605
|
Restore VC++ compatibility for paragraphs.cpp.
Missed a __func__ addition in the last merge.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@707 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-21 16:41:27 +00:00 |
|
david.eger@gmail.com
|
a91778397b
|
Fix Issue 645, a char signed/unsigned issue in paragraphs.cpp.
When constructing our debug strings, our simple UTF-8 processing should skip all non-ASCII chars.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@706 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-20 20:19:00 +00:00 |
|
david.eger@gmail.com
|
22331c03ec
|
Fix issue 613: assert() fail on Windows isspace() when given non-ASCII.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@671 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-10 01:44:36 +00:00 |
|
zdenop@gmail.com
|
d0c2631ec8
|
VC++2008 build fix for 3.02 version
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@665 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-03 22:23:12 +00:00 |
|
theraysmith@gmail.com
|
3a998fe7ac
|
Added Right-to-left/Bidi capability in the output iterators for Hebrew/Arabic, Added paragraph detection in layout analysis/post OCR, Fixed inconsistent xheight during training and over-chopping, Added simultaneous multi-language capability, Refactored top-level word recognition module, Fixed problems with internally scaled images
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@651 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-02 02:59:49 +00:00 |
|