Commit Graph

396 Commits

Author SHA1 Message Date
theraysmith
137f4806b6 Added sub/superscript, small/dropcap detection
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@547 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 01:32:20 +00:00
zdenop@gmail.com
d8a2303daf improved makemoredists script and tessdata/Makefile.am
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@546 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-05 13:33:45 +00:00
zdenop@gmail.com
8cf7660e3a - vs2008 removed from autoconf system (not needed for linux);
Windows build improvements:
- vs2008 library projects use individual intermediate directories (thanks fizz_20120);

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@544 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-05 09:48:06 +00:00
zdenop@gmail.com
c707b26d5f fixed VC++2008 Express build after last changes
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@543 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 12:46:41 +00:00
theraysmith
e0af88a197 Debian packages of Leptonica to allow use of 1.67
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@542 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:35:37 +00:00
theraysmith
7a54f5f950 Misc Makefile etc for 3.01
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@541 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:30:09 +00:00
theraysmith
dbcab0eed3 Traineddata for non-Eng languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@540 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:07:46 +00:00
theraysmith
5c854e03ea Cleaned up unused parameters
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@539 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:06:44 +00:00
theraysmith
c8465252e4 Rewrite of DENORM
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@538 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:05:48 +00:00
theraysmith
b98c922391 Fixed problem with empty dawgs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@537 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:04:02 +00:00
theraysmith
12ddf5d18e Removal of NEWDELETE + fix of problem with joined text
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@536 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:03:21 +00:00
theraysmith
23b29fbe9a Impact of DENORM rewrite + removal of NEWDELETE
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@535 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:02:28 +00:00
theraysmith
048a3efd5e Fixed bug with empty dawgs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@534 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:01:13 +00:00
theraysmith
eba04e7c5b Fixed debug display, training on fragments
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@533 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:00:17 +00:00
theraysmith
ef59841ebe Moved multipage code to BaseAPI and tidied up command line handling
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@532 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:58:30 +00:00
theraysmith
47dc322437 Removed serialise and NEWDELETE macro
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@531 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:56:39 +00:00
theraysmith
1c38920f9d Rewrote DENORM
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@530 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:56:08 +00:00
theraysmith
4c4d036ee4 Removed serialize and NEWDELETE macros
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@529 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:53:31 +00:00
zdenop@gmail.com
f07d5723f2 removed vs2008/dlltest/Makefile from linux configure
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@528 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-29 09:32:47 +00:00
zdenop@gmail.com
70cf76ffcb gettimeofday for windows
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@527 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 19:13:51 +00:00
zdenop@gmail.com
4523ce9f7d 3.01 code from http://github.com/jimregan/tesseract-ocr with addaptions related to Linux and Windows (VC2008) compile process
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@526 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 18:34:14 +00:00
zdenop@gmail.com
7511d76315 fixed hocr to produce valid document (acording http://validator.w3.org/) - issue http://code.google.com/p/tesseract-ocr/issues/detail?id=401
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@525 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-17 20:03:58 +00:00
zdenop@gmail.com
515ac2d3f0 WIN32: improved vcproj files - possibility to build tesseract with static linked leptonica (Release.static/Release) or dynamic linked leptonica (Release.dynamic)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@521 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-15 11:22:58 +00:00
zdenop@gmail.com
179c96beff updated vcproj files: Release|Win32 links leptonica statically, suppressed some warnings;
added include/leptonica/sudoku.h (part of leptonica 1.67);

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@520 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-13 21:31:37 +00:00
zdenop@gmail.com
80b9e44e95 upgrade of leptonica to 1.67 version; removing unnecessary header files from vs2008;
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@519 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-13 20:27:49 +00:00
zdenop@gmail.com
a750ffed7a fixed issue 394: The tessedit_pageseg_mode does not work; thanks sms@fritzwidmer.ch
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@517 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-02 20:52:58 +00:00
max.markin@gmail.com
459ff62293 added quotes in xcopy
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@516 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-29 21:26:07 +00:00
zdenop@gmail.com
fa4d4589cb fixed hocr (escape special special characters; thank to aizvorski) + hocr config)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@515 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-29 19:03:06 +00:00
zdenop@gmail.com
346da8c1e5 missing returns in nonvoid functions (thanks to rusnakp) issue 389;
corrected windows installation script - tesseract should be not run as start-up application;

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@514 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-26 13:35:02 +00:00
zdenop@gmail.com
21d6ea66c2 better handlig of multipage tiff (issue 380 http://code.google.com/p/tesseract-ocr/issues/detail?id=380)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@513 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-22 06:13:54 +00:00
zdenop@gmail.com
8522533689 improved NSI script: finalization for tesseract-ocr-setup-3.00.exe
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@512 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-12 19:09:59 +00:00
zdenop@gmail.com
e54ab01f19 improved NSI script: select language data based on windows system language, download and install non English languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@511 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-11 22:45:09 +00:00
zdenop@gmail.com
5e37a948ba improved NSI script: detection of installed tesseract; set environment variables TESSDATA_PREFIX, PATH; recognition of Tesseract-OCR by "Add/Remove" program in Windows Control panel
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@510 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-10 21:59:34 +00:00
joregan
825ee3a767 Add NSIS script, contributed by Sergey Bronnikov
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@509 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-07 12:33:38 +00:00
zdenop@gmail.com
c64787f835 another addition to Revision: 506 ;-)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@508 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-06 21:56:44 +00:00
zdenop@gmail.com
da06ed4075 addition to Revision: 506 ;-)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@507 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-06 21:48:42 +00:00
zdenop@gmail.com
282aa13975 *.vcproj moved to vs2008/ (bin/ and bin.dbg/ will be in vs2008/)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@506 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-06 21:38:19 +00:00
max.markin@gmail.com
2c06ebc9dc added copying leptonlib.dll to binaries folder
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@505 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-05 02:34:05 +00:00
max.markin@gmail.com
fe6e6e098e fix for vs2008 release mode compilation error in classify
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@504 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-03 18:24:40 +00:00
joregan
23504403e8 add Makefile.in to check the merge
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@502 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-01 03:54:40 +00:00
joregan
6f1353fdad comment more; I'll do something proper about the data packs later, but for now we just need something that'll build generally *without* requiring that the language packs be present
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@501 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-01 00:55:48 +00:00
joregan
fd3a4a44d3 remove the makedummies stuff
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@500 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-01 00:00:23 +00:00
joregan
e614075136 Zdenko caught one I missed
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@497 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 21:47:25 +00:00
joregan
f1c1cd45e4 typo
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@496 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 21:45:50 +00:00
zdenop@gmail.com
3964660093 update of VC++ project file to recent changes
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@495 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 21:20:00 +00:00
joregan
eea51fca91 VC++ truly is braindamaged
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@494 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 20:40:54 +00:00
joregan
9b3d565be0 add vs2008 to the distribution
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@493 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 17:24:13 +00:00
joregan
e0b07948fc disabling gettext checks - not currently used, and something about disabling is causing subsequent autoconf checks to not run
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@492 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 16:27:39 +00:00
joregan
0c32fdea0d add BSD-licensed version of stdint.h for VC++
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@491 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 15:58:15 +00:00
joregan
f2506871f9 move include of config_auto.h to not conflict with local types. Not finished
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@490 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 15:53:40 +00:00