zdenop@gmail.com
|
7e89c8d9db
|
count lines from 1 in APPLY_BOXES error message; remove not needed file
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@901 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-11-01 12:42:54 +00:00 |
|
theraysmith@gmail.com
|
4c3475ad2e
|
Fixed fmemopen portability problem
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@890 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-10-10 02:07:26 +00:00 |
|
zdenop@gmail.com
|
ee08f623ce
|
fix issue 967
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@886 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-29 20:48:06 +00:00 |
|
zdenop@gmail.com
|
92c0ba06de
|
fix issue 972
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@880 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 19:54:06 +00:00 |
|
theraysmith@gmail.com
|
4d514d5a60
|
Major refactor of beam search, elimination of dead code, misc bug fixes, updates to Makefile.am, Changelog etc.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@878 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 15:26:50 +00:00 |
|
theraysmith@gmail.com
|
b0fb616299
|
Generalized feature extractor to allow fx from greyscale
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@875 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 15:19:50 +00:00 |
|
theraysmith@gmail.com
|
dfc1a92628
|
Refactored classifier to make it easier to add new ones
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@874 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 15:16:01 +00:00 |
|
theraysmith@gmail.com
|
2aafc9df24
|
Improved sub/superscript treatment
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@872 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-20 19:49:47 +00:00 |
|
zdenop@gmail.com
|
10c1169d98
|
remove unused code (Windows related)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@860 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-08 18:21:10 +00:00 |
|
zdenop@gmail.com
|
7e14ade10d
|
print error/warning messages to stderr/debug file instead of stdout (fix issue 911)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@843 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-05-16 20:31:37 +00:00 |
|
zdenop@gmail.com
|
16e80c06ee
|
Test for empty choices at ChoiceIterator (fix issue 826)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@840 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-05-02 08:13:22 +00:00 |
|
theraysmith@gmail.com
|
64c739c8af
|
Added sparse text mode, also fixed issue 653.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@820 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-01-03 19:06:41 +00:00 |
|
theraysmith@gmail.com
|
da1047f020
|
Fixed typos and improved comments
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@753 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-21 15:31:20 +00:00 |
|
theraysmith@gmail.com
|
f23460bec4
|
Removed config_auto.h from .h files
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@748 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-21 15:26:10 +00:00 |
|
theraysmith@gmail.com
|
7b90ed28d3
|
Fixed problem with NULL STRINGs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@745 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-21 15:22:22 +00:00 |
|
theraysmith@gmail.com
|
c2dbb28376
|
Fixed issues 714, 608
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@744 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-21 15:21:24 +00:00 |
|
zdenop@gmail.com
|
60b0d10e16
|
fix for issue 690
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@736 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-08-01 21:57:49 +00:00 |
|
david.eger@gmail.com
|
eeeb4f513c
|
Provide better paragraph segmentation without having to run fully
automatic layout analysis.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@725 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-05-10 00:03:34 +00:00 |
|
zdenop@gmail.com
|
e606c311f5
|
fix issue Issue 684 : show correct line in failure message "Couldn't find a matching blob"
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@723 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-04-22 20:51:00 +00:00 |
|
zdenop@gmail.com
|
cd8de9157c
|
change comments to doxygen block comments (api)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@716 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-30 21:24:12 +00:00 |
|
zdenop@gmail.com
|
5958f01f5f
|
fix doxygen warnings
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@715 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-30 15:42:06 +00:00 |
|
zdenop@gmail.com
|
d4d4b8aad8
|
improve autools system (mingw+msys fix); implementation of --disable-tessdata-prefix
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@708 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-22 20:01:33 +00:00 |
|
david.eger@gmail.com
|
c0cd2cd605
|
Restore VC++ compatibility for paragraphs.cpp.
Missed a __func__ addition in the last merge.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@707 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-21 16:41:27 +00:00 |
|
david.eger@gmail.com
|
a91778397b
|
Fix Issue 645, a char signed/unsigned issue in paragraphs.cpp.
When constructing our debug strings, our simple UTF-8 processing should skip all non-ASCII chars.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@706 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-20 20:19:00 +00:00 |
|
zdenop@gmail.com
|
2972cc426b
|
+ fix VS2008 warning about "non dll-interface class tesseract::LTRResultIterator used as base for dll-interface class tesseract::ResultIterator" by making LTRResultIterator also visible.
+ Changed Project preprocessor definition of WINDLLNAME, because stringizing operator doesn't seem to work when initializing tessedit_module_name in ccutil/ccutil.cpp (which was omitted in previous fixes).
+ Update vs2008/tesshelper.py for new public header files.
patch from Tom Powers (https://groups.google.com/group/tesseract-dev/msg/6da2799cd2cb9844)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@702 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-08 21:15:13 +00:00 |
|
zdenop@gmail.com
|
2f1c112640
|
+Remove visibility from protected members of tesseract::TessBaseAPI class by applying TESS_LOCAL macro;
+Make PageIterator & ResultIterator classes visible by applying TESS_API macro;
+Fix api/Makefile.am & training/Makefile.am to allow Parallel Build Trees;
patch from Tom Powers (https://groups.google.com/group/tesseract-dev/msg/9d00579540e44055)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@701 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-07 22:04:46 +00:00 |
|
zdenop@gmail.com
|
1455bf5610
|
set tessedit_module_name for windows;
implement 'make install LANG="eng ara deu"';
more headers need to be installed: https://groups.google.com/group/tesseract-dev/msg/a4f7424377993b2e
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@700 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-06 22:41:43 +00:00 |
|
david.eger@gmail.com
|
75a9a8fae7
|
Address "RIL_PARA doesn't work" comment in issue 622.
http://code.google.com/p/tesseract-ocr/issues/detail?id=622
The core of the problem is that in PSM_SINGLE_BLOCK mode, Tesseract
doesn't run paragraph detection, so no paragraphs get generated. Here,
we make sure that even if run in a mode where no paragraphs get
generated, we treat each block as its own paragraph.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@696 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-06 20:02:57 +00:00 |
|
zdenop@gmail.com
|
97e19443a3
|
install only necessary headers, fix uninstall
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@692 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-03 13:22:51 +00:00 |
|
zdenop@gmail.com
|
30a70142a0
|
visibility - autotools part (./configure --enable-visibility)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@690 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-02 23:51:33 +00:00 |
|
zdenop@gmail.com
|
e216adab43
|
fix configure.ac; unify identifiers (WIN32 vs _WIN32)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@688 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-02 17:31:24 +00:00 |
|
zdenop@gmail.com
|
49c4ce3183
|
fix for GRAPHICS_DISABLED build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@686 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-01 22:43:51 +00:00 |
|
zdenop@gmail.com
|
6ccab83bd6
|
fixing issue 628 (replacing __MSW32__ with _WIN32) and issue 614 (reverting "class DLLSYM STRING" to "class CCUTIL_API STRING")
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@677 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-19 21:48:45 +00:00 |
|
david.eger@gmail.com
|
018f192fc2
|
Abolish populate_unichars(), fixing seg fault reported in Debian:
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=658634
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@675 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-15 01:37:00 +00:00 |
|
david.eger@gmail.com
|
22331c03ec
|
Fix issue 613: assert() fail on Windows isspace() when given non-ASCII.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@671 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-10 01:44:36 +00:00 |
|
david.eger@gmail.com
|
78a8356a76
|
Put one last bigram correction debug statement behind a debug flag.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@669 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-09 20:08:17 +00:00 |
|
zdenop@gmail.com
|
d0c2631ec8
|
VC++2008 build fix for 3.02 version
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@665 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-03 22:23:12 +00:00 |
|
david.eger@gmail.com
|
56bc885721
|
Fix some debug messaging about bigram correction -- the two lists of
alternates are not independent.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@664 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-03 19:43:25 +00:00 |
|
theraysmith@gmail.com
|
3a998fe7ac
|
Added Right-to-left/Bidi capability in the output iterators for Hebrew/Arabic, Added paragraph detection in layout analysis/post OCR, Fixed inconsistent xheight during training and over-chopping, Added simultaneous multi-language capability, Refactored top-level word recognition module, Fixed problems with internally scaled images
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@651 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-02 02:59:49 +00:00 |
|
theraysmith@gmail.com
|
ac014eb27a
|
Added experimental equation detector
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@646 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-02 02:50:01 +00:00 |
|
theraysmith@gmail.com
|
ef786ad29b
|
Moved ResultIterator/PageIterator to ccmain
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@645 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-02-02 02:47:59 +00:00 |
|
zdenop@gmail.com
|
67f47008c7
|
fixed "one lib" build on linux; runautoconf renamed to autogen.sh;
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@631 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-10-16 19:39:54 +00:00 |
|
zdenop@gmail.com
|
da41b96f7f
|
removed check for libtiff - leptonica is required; cleanup #ifdef/#ifndef HAVE_LIBLEPT
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@624 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-30 06:34:41 +00:00 |
|
joregan@gmail.com
|
bf4a09d72a
|
make single/multiple libraries optional -- this needs testing!!!
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@623 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-29 21:28:28 +00:00 |
|
theraysmith@gmail.com
|
4575c52ff5
|
Removed debugwin.cpp, fixing issue 448
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@613 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-18 16:45:59 +00:00 |
|
theraysmith@gmail.com
|
d5d15f32d7
|
Deleted Makefile.in from svn
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@606 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-18 16:32:44 +00:00 |
|
zdenop@gmail.com
|
9b7375edd6
|
MinGW portability solved + some code cleanup (based on cpplint)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@605 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-15 19:28:25 +00:00 |
|
zdenop@gmail.com
|
7ec3dca968
|
show page 0 for multipage tiff;
Windows: use binary mode for fopen (issue 70);
autotools: fixed cutil/Makefile.am, improved tessdata/Makefile.am;
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@604 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-11 21:42:13 +00:00 |
|
zdenop@gmail.com
|
4abdfdb8fe
|
moved ccstruct/callcpp.cpp to cutil (to header file - see issue 414); moved vs2008/include/stdint.h to vs2008/port/stdint.h so we can use vs2008/include also for mingw; removed unused tessembedded.*
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@603 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-08-11 14:04:20 +00:00 |
|
theraysmith
|
3e8c0bc228
|
Various fixes, including memory leak in fixspace, font labels on output, removed some annoying debug output, fixes to initialization of parameters, general cleanup, and added Hindi
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@567 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2011-03-21 21:44:05 +00:00 |
|