Commit Graph

4863 Commits

Author SHA1 Message Date
zdenop@gmail.com
a776e0be85 TP: visibility trial - code & windows build changes (without autotools changes)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@689 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-02 17:48:45 +00:00
zdenop@gmail.com
e216adab43 fix configure.ac; unify identifiers (WIN32 vs _WIN32)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@688 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-02 17:31:24 +00:00
zdenop@gmail.com
657722aeca added missing changes for r686
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@687 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-01 23:19:35 +00:00
zdenop@gmail.com
49c4ce3183 fix for GRAPHICS_DISABLED build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@686 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-01 22:43:51 +00:00
zdenop
06b2156a99 fixed makemoredists; add --enable-embedded to configure
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@685 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-01 12:40:04 +00:00
zdenop@gmail.com
df1cbdd7d3 fix for issue 463 (GetHOCRText segfaults unless SetInputName has been called first); removed declaration of GetLastInitLanguage
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@684 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-27 17:19:20 +00:00
zdenop@gmail.com
bf7ca288ac fixed 635 (strngs.h has unnecessary include of genericvector.h)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@682 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-26 16:39:01 +00:00
zdenop@gmail.com
da121f013c vs2008 and vs2010 replaced with Tom Powers solution
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@681 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-26 15:30:05 +00:00
zdenop@gmail.com
492f9119c2 check return code of API init (issue 593)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@680 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-26 14:48:35 +00:00
zdenop@gmail.com
132909a607 fix for issue 631: gettimeofday() on windows based on leptonica l_getCurrentTime()
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@679 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-21 21:38:45 +00:00
zdenop@gmail.com
95168ef064 fix missing ";" in VS2008 project files + fix VS2010
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@678 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-20 13:12:45 +00:00
zdenop@gmail.com
6ccab83bd6 fixing issue 628 (replacing __MSW32__ with _WIN32) and issue 614 (reverting "class DLLSYM STRING" to "class CCUTIL_API STRING")
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@677 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-19 21:48:45 +00:00
zdenop@gmail.com
61611c1990 removed unnecessary conditional
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@676 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-18 09:18:06 +00:00
david.eger@gmail.com
018f192fc2 Abolish populate_unichars(), fixing seg fault reported in Debian:
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=658634



git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@675 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-15 01:37:00 +00:00
zdenop@gmail.com
53d133d83a fixed cntraning thanks to Wil Hadden; fixed installation of new manpages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@674 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-12 16:03:05 +00:00
zdenop@gmail.com
3c4fd30bb5 Fix is isinf for VC++
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@673 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-12 14:51:28 +00:00
david.eger@gmail.com
22331c03ec Fix issue 613: assert() fail on Windows isspace() when given non-ASCII.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@671 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-10 01:44:36 +00:00
david.eger@gmail.com
58e06c8c45 Update man pages for Tesseract 3.02.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@670 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-09 22:55:47 +00:00
david.eger@gmail.com
78a8356a76 Put one last bigram correction debug statement behind a debug flag.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@669 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-09 20:08:17 +00:00
zdenop@gmail.com
1355cabe7e VS2008 - fix include path for release*
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@668 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-07 13:52:33 +00:00
zdenop@gmail.com
425c2b8205 install data files; small fix of INSTALL, README; removed ABOUT-NLS (NLS not used at the moment)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@667 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-05 16:25:40 +00:00
zdenop@gmail.com
0a50c9ca5c Another VS2008 fixes
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@666 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-04 22:06:40 +00:00
zdenop@gmail.com
d0c2631ec8 VC++2008 build fix for 3.02 version
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@665 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-03 22:23:12 +00:00
david.eger@gmail.com
56bc885721 Fix some debug messaging about bigram correction -- the two lists of
alternates are not independent.



git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@664 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-03 19:43:25 +00:00
theraysmith@gmail.com
09e41d32c2 Renamed RGB to ComposeRGB to fix windows macro problem
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@663 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-03 16:52:25 +00:00
theraysmith@gmail.com
d581ab7e12 New config for testing bigram correction.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@661 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 18:46:19 +00:00
david.eger@gmail.com
ad53f34e7c Added a missing header file for the 3.02 release.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@659 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 18:01:17 +00:00
theraysmith@gmail.com
e0d735b122 Remaining misc changes for 3.02
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@658 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:14:43 +00:00
theraysmith@gmail.com
23dfabcab1 Cleaned up externally used namespace by removing includes from baseapi.h
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@657 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:14:16 +00:00
theraysmith@gmail.com
6e273b71bd Cube trained data for fra, ita, rus, spa
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@656 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:08:26 +00:00
theraysmith@gmail.com
9206e92b0d Added simultaneous multi-language capability, Refactored top-level word recognition module, Blamer module added for error analysis, Tidied up constraints on control parameters, Added UNICHARSET to WERD_CHOICE to make mult-language handling easier, Added word bigram correction
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@655 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:06:39 +00:00
theraysmith@gmail.com
73adf693d5 Added Right-to-left/Bidi capability in the output iterators for Hebrew/Arabic, Refactored top-level word recognition module, Added simultaneous multi-language capability.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@654 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:03:56 +00:00
theraysmith@gmail.com
e33ae59f4d Fixed training leaks and randomness
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@653 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:02:16 +00:00
theraysmith@gmail.com
01026af5a2 Refactored top-level word recognition module, Blamer module added for error analysis, Added word bigram correction
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@652 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 03:01:38 +00:00
theraysmith@gmail.com
3a998fe7ac Added Right-to-left/Bidi capability in the output iterators for Hebrew/Arabic, Added paragraph detection in layout analysis/post OCR, Fixed inconsistent xheight during training and over-chopping, Added simultaneous multi-language capability, Refactored top-level word recognition module, Fixed problems with internally scaled images
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@651 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:59:49 +00:00
theraysmith@gmail.com
5bc5e2a0b4 Added simultaneous multi-language capability, Added support for ShapeTable in classifier and training, Refactored class pruner, Added new uniform classifier API, Added new training error counter
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@650 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:57:42 +00:00
theraysmith@gmail.com
fdd4ffe85e Fixed endian bug in dawg reader, Added word bigram correction,
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@649 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:56:18 +00:00
theraysmith@gmail.com
6e3d810c1d Major improvements to layout analysis for better image detection, diacritic detection, better textline finding, better tabstop finding
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@648 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:53:04 +00:00
theraysmith@gmail.com
04068c7055 Removed dead memory mangagement code
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@647 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:51:56 +00:00
theraysmith@gmail.com
ac014eb27a Added experimental equation detector
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@646 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:50:01 +00:00
theraysmith@gmail.com
ef786ad29b Moved ResultIterator/PageIterator to ccmain
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@645 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:47:59 +00:00
zdenop@gmail.com
8225f5b846 removed BOM form strngs.h, updated NSIS script and COPYING
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@639 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-22 18:27:31 +00:00
theraysmith@gmail.com
aae3da5bf1 Last minute fixes for making the tarball
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@636 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-22 05:28:44 +00:00
zdenop@gmail.com
db2aa4e73f svpaint.cpp moved from include to source
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@632 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-16 20:23:49 +00:00
zdenop@gmail.com
67f47008c7 fixed "one lib" build on linux; runautoconf renamed to autogen.sh;
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@631 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-16 19:39:54 +00:00
max.markin@gmail.com
bf3ae643e5 Fixed some warnings to make the VC2010 compiler happy:
C4355: 'this' : used in base member initializer list
C4099: type name first seen using 'class' now seen using 'struct'

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@630 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-15 22:26:34 +00:00
max.markin@gmail.com
0fef845950 VC2010: add support for dynamic linking
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@629 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-15 22:17:19 +00:00
max.markin@gmail.com
cfc7de1420 fixed debug build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@628 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-10-07 03:49:25 +00:00
zdenop@gmail.com
ab234da926 fix for issue 540
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@627 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-09-29 21:25:57 +00:00
max.markin@gmail.com
7c4461316a fixed comment
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@626 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-09-18 05:12:37 +00:00