Commit Graph

2464 Commits

Author SHA1 Message Date
theraysmith
3e8c0bc228 Various fixes, including memory leak in fixspace, font labels on output, removed some annoying debug output, fixes to initialization of parameters, general cleanup, and added Hindi
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@567 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-21 21:44:05 +00:00
theraysmith
c81483f714 Various fixes, including memory leak in fixspace, font labels on output, removed some annoying debug output, fixes to initialization of parameters, general cleanup, and added Hindi
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@566 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-21 21:43:04 +00:00
theraysmith
96ca745384 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@565 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 22:14:53 +00:00
theraysmith
0a4d382131 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@564 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 22:14:10 +00:00
theraysmith
ec39052274 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@563 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:54:54 +00:00
theraysmith
33b346991a Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@562 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:54:32 +00:00
theraysmith
06dda0009e Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@561 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:53:53 +00:00
theraysmith
7cd3c74419 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@560 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:53:35 +00:00
theraysmith
df738bb9a4 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@559 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:53:11 +00:00
theraysmith
06c9739790 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@558 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:52:57 +00:00
theraysmith
5a779704da Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@557 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:52:38 +00:00
theraysmith
7121e51422 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@556 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:52:08 +00:00
theraysmith
a3f30eb5c7 Deleted lots of dead code, including PBLOB
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@555 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:51:34 +00:00
theraysmith
0d81f4b649 Fixed problem that was preventing pagesegmode from being set by config file
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@554 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-18 21:43:38 +00:00
zdenop@gmail.com
9067a817f9 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@553 d0cd1f9f-072b-0410-8dd7-cf729c803f20 2011-03-16 14:08:25 +00:00
zdenop@gmail.com
de987bc6e1 windows build: improved vs2008/viewer.vcproj/ issue 413
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@550 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-01-03 14:29:00 +00:00
theraysmith
f040994f51 Fixed closing meta element in hocr output
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@549 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 06:25:20 +00:00
theraysmith
a7db6dada9 Fix for linking with leptonica on Linux.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@548 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 01:40:39 +00:00
theraysmith
137f4806b6 Added sub/superscript, small/dropcap detection
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@547 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-09 01:32:20 +00:00
zdenop@gmail.com
d8a2303daf improved makemoredists script and tessdata/Makefile.am
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@546 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-05 13:33:45 +00:00
zdenop@gmail.com
8cf7660e3a - vs2008 removed from autoconf system (not needed for linux);
Windows build improvements:
- vs2008 library projects use individual intermediate directories (thanks fizz_20120);

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@544 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-12-05 09:48:06 +00:00
zdenop@gmail.com
c707b26d5f fixed VC++2008 Express build after last changes
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@543 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 12:46:41 +00:00
theraysmith
e0af88a197 Debian packages of Leptonica to allow use of 1.67
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@542 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:35:37 +00:00
theraysmith
7a54f5f950 Misc Makefile etc for 3.01
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@541 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:30:09 +00:00
theraysmith
dbcab0eed3 Traineddata for non-Eng languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@540 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:07:46 +00:00
theraysmith
5c854e03ea Cleaned up unused parameters
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@539 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:06:44 +00:00
theraysmith
c8465252e4 Rewrite of DENORM
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@538 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:05:48 +00:00
theraysmith
b98c922391 Fixed problem with empty dawgs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@537 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:04:02 +00:00
theraysmith
12ddf5d18e Removal of NEWDELETE + fix of problem with joined text
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@536 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:03:21 +00:00
theraysmith
23b29fbe9a Impact of DENORM rewrite + removal of NEWDELETE
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@535 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:02:28 +00:00
theraysmith
048a3efd5e Fixed bug with empty dawgs
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@534 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:01:13 +00:00
theraysmith
eba04e7c5b Fixed debug display, training on fragments
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@533 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:00:17 +00:00
theraysmith
ef59841ebe Moved multipage code to BaseAPI and tidied up command line handling
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@532 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:58:30 +00:00
theraysmith
47dc322437 Removed serialise and NEWDELETE macro
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@531 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:56:39 +00:00
theraysmith
1c38920f9d Rewrote DENORM
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@530 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:56:08 +00:00
theraysmith
4c4d036ee4 Removed serialize and NEWDELETE macros
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@529 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 00:53:31 +00:00
zdenop@gmail.com
f07d5723f2 removed vs2008/dlltest/Makefile from linux configure
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@528 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-29 09:32:47 +00:00
zdenop@gmail.com
70cf76ffcb gettimeofday for windows
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@527 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 19:13:51 +00:00
zdenop@gmail.com
4523ce9f7d 3.01 code from http://github.com/jimregan/tesseract-ocr with addaptions related to Linux and Windows (VC2008) compile process
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@526 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 18:34:14 +00:00
zdenop@gmail.com
7511d76315 fixed hocr to produce valid document (acording http://validator.w3.org/) - issue http://code.google.com/p/tesseract-ocr/issues/detail?id=401
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@525 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-17 20:03:58 +00:00
zdenop@gmail.com
515ac2d3f0 WIN32: improved vcproj files - possibility to build tesseract with static linked leptonica (Release.static/Release) or dynamic linked leptonica (Release.dynamic)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@521 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-15 11:22:58 +00:00
zdenop@gmail.com
179c96beff updated vcproj files: Release|Win32 links leptonica statically, suppressed some warnings;
added include/leptonica/sudoku.h (part of leptonica 1.67);

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@520 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-13 21:31:37 +00:00
zdenop@gmail.com
80b9e44e95 upgrade of leptonica to 1.67 version; removing unnecessary header files from vs2008;
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@519 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-13 20:27:49 +00:00
zdenop@gmail.com
a750ffed7a fixed issue 394: The tessedit_pageseg_mode does not work; thanks sms@fritzwidmer.ch
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@517 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-02 20:52:58 +00:00
max.markin@gmail.com
459ff62293 added quotes in xcopy
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@516 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-29 21:26:07 +00:00
zdenop@gmail.com
fa4d4589cb fixed hocr (escape special special characters; thank to aizvorski) + hocr config)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@515 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-29 19:03:06 +00:00
zdenop@gmail.com
346da8c1e5 missing returns in nonvoid functions (thanks to rusnakp) issue 389;
corrected windows installation script - tesseract should be not run as start-up application;

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@514 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-26 13:35:02 +00:00
zdenop@gmail.com
21d6ea66c2 better handlig of multipage tiff (issue 380 http://code.google.com/p/tesseract-ocr/issues/detail?id=380)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@513 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-22 06:13:54 +00:00
zdenop@gmail.com
8522533689 improved NSI script: finalization for tesseract-ocr-setup-3.00.exe
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@512 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-12 19:09:59 +00:00
zdenop@gmail.com
e54ab01f19 improved NSI script: select language data based on windows system language, download and install non English languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@511 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-10-11 22:45:09 +00:00