Commit Graph

5603 Commits

Author SHA1 Message Date
theraysmith
7121e51422 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@556 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:52:08 +00:00
theraysmith
a3f30eb5c7 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@555 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:51:34 +00:00
theraysmith
0d81f4b649 Fixed problem that was preventing pagesegmode from being set by config file
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@554 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:43:38 +00:00
zdenop@gmail.com
9067a817f9 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@553 d0cd1f9f-072b-0410-8dd7-cf729c803f20 2011-03-16 14:08:25 +00:00
zdenop@gmail.com
de987bc6e1 windows build: improved vs2008/viewer.vcproj/ issue 413
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@550 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-01-03 14:29:00 +00:00
theraysmith
f040994f51 Fixed closing meta element in hocr output
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@549 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 06:25:20 +00:00
theraysmith
a7db6dada9 Fix for linking with leptonica on Linux.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@548 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 01:40:39 +00:00
theraysmith
137f4806b6 Added sub/superscript, small/dropcap detection
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@547 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 01:32:20 +00:00
zdenop@gmail.com
d8a2303daf improved makemoredists script and tessdata/Makefile.am
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@546 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-05 13:33:45 +00:00
zdenop@gmail.com
8cf7660e3a - vs2008 removed from autoconf system (not needed for linux);
Windows build improvements:
- vs2008 library projects use individual intermediate directories (thanks fizz_20120);

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@544 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-05 09:48:06 +00:00
zdenop@gmail.com
c707b26d5f fixed VC++2008 Express build after last changes
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@543 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 12:46:41 +00:00
theraysmith
e0af88a197 Debian packages of Leptonica to allow use of 1.67
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@542 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:35:37 +00:00
theraysmith
7a54f5f950 Misc Makefile etc for 3.01
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@541 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:30:09 +00:00
theraysmith
dbcab0eed3 Traineddata for non-Eng languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@540 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:07:46 +00:00
theraysmith
5c854e03ea Cleaned up unused parameters
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@539 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:06:44 +00:00
theraysmith
c8465252e4 Rewrite of DENORM
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@538 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:05:48 +00:00
theraysmith
b98c922391 Fixed problem with empty dawgs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@537 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:04:02 +00:00
theraysmith
12ddf5d18e Removal of NEWDELETE + fix of problem with joined text
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@536 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:03:21 +00:00
theraysmith
23b29fbe9a Impact of DENORM rewrite + removal of NEWDELETE
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@535 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:02:28 +00:00
theraysmith
048a3efd5e Fixed bug with empty dawgs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@534 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:01:13 +00:00
theraysmith
eba04e7c5b Fixed debug display, training on fragments
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@533 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:00:17 +00:00
theraysmith
ef59841ebe Moved multipage code to BaseAPI and tidied up command line handling
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@532 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:58:30 +00:00
theraysmith
47dc322437 Removed serialise and NEWDELETE macro
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@531 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:56:39 +00:00
theraysmith
1c38920f9d Rewrote DENORM
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@530 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:56:08 +00:00
theraysmith
4c4d036ee4 Removed serialize and NEWDELETE macros
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@529 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:53:31 +00:00
zdenop@gmail.com
f07d5723f2 removed vs2008/dlltest/Makefile from linux configure
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@528 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-29 09:32:47 +00:00
zdenop@gmail.com
70cf76ffcb gettimeofday for windows
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@527 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 19:13:51 +00:00
zdenop@gmail.com
4523ce9f7d 3.01 code from http://github.com/jimregan/tesseract-ocr with addaptions related to Linux and Windows (VC2008) compile process
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@526 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 18:34:14 +00:00
zdenop@gmail.com
7511d76315 fixed hocr to produce valid document (acording http://validator.w3.org/) - issue http://code.google.com/p/tesseract-ocr/issues/detail?id=401
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@525 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-17 20:03:58 +00:00
zdenop@gmail.com
515ac2d3f0 WIN32: improved vcproj files - possibility to build tesseract with static linked leptonica (Release.static/Release) or dynamic linked leptonica (Release.dynamic)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@521 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-15 11:22:58 +00:00
zdenop@gmail.com
179c96beff updated vcproj files: Release|Win32 links leptonica statically, suppressed some warnings;
added include/leptonica/sudoku.h (part of leptonica 1.67);

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@520 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-13 21:31:37 +00:00
zdenop@gmail.com
80b9e44e95 upgrade of leptonica to 1.67 version; removing unnecessary header files from vs2008;
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@519 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-13 20:27:49 +00:00
zdenop@gmail.com
a750ffed7a fixed issue 394: The tessedit_pageseg_mode does not work; thanks sms@fritzwidmer.ch
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@517 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-02 20:52:58 +00:00
max.markin@gmail.com
459ff62293 added quotes in xcopy
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@516 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-29 21:26:07 +00:00
zdenop@gmail.com
fa4d4589cb fixed hocr (escape special special characters; thank to aizvorski) + hocr config)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@515 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-29 19:03:06 +00:00
zdenop@gmail.com
346da8c1e5 missing returns in nonvoid functions (thanks to rusnakp) issue 389;
corrected windows installation script - tesseract should be not run as start-up application;

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@514 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-26 13:35:02 +00:00
zdenop@gmail.com
21d6ea66c2 better handlig of multipage tiff (issue 380 http://code.google.com/p/tesseract-ocr/issues/detail?id=380)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@513 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-22 06:13:54 +00:00
zdenop@gmail.com
8522533689 improved NSI script: finalization for tesseract-ocr-setup-3.00.exe
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@512 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-12 19:09:59 +00:00
zdenop@gmail.com
e54ab01f19 improved NSI script: select language data based on windows system language, download and install non English languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@511 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-11 22:45:09 +00:00
zdenop@gmail.com
5e37a948ba improved NSI script: detection of installed tesseract; set environment variables TESSDATA_PREFIX, PATH; recognition of Tesseract-OCR by "Add/Remove" program in Windows Control panel
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@510 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-10 21:59:34 +00:00
joregan
825ee3a767 Add NSIS script, contributed by Sergey Bronnikov
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@509 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-07 12:33:38 +00:00
zdenop@gmail.com
c64787f835 another addition to Revision: 506 ;-)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@508 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-06 21:56:44 +00:00
zdenop@gmail.com
da06ed4075 addition to Revision: 506 ;-)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@507 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-06 21:48:42 +00:00
zdenop@gmail.com
282aa13975 *.vcproj moved to vs2008/ (bin/ and bin.dbg/ will be in vs2008/)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@506 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-06 21:38:19 +00:00
max.markin@gmail.com
2c06ebc9dc added copying leptonlib.dll to binaries folder
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@505 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-05 02:34:05 +00:00
max.markin@gmail.com
fe6e6e098e fix for vs2008 release mode compilation error in classify
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@504 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-03 18:24:40 +00:00
joregan
23504403e8 add Makefile.in to check the merge
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@502 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-01 03:54:40 +00:00
joregan
6f1353fdad comment more; I'll do something proper about the data packs later, but for now we just need something that'll build generally *without* requiring that the language packs be present
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@501 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-01 00:55:48 +00:00
joregan
fd3a4a44d3 remove the makedummies stuff
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@500 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-01 00:00:23 +00:00
joregan
e614075136 Zdenko caught one I missed
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@497 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-09-30 21:47:25 +00:00