Commit Graph

126 Commits

Author SHA1 Message Date
Zdenko Podobný
bf3f125ad5 change links from code.google.com to github.com 2015-07-11 09:43:31 +02:00
Zdenko Podobný
ec9581d8f2 temporary add configure and Makefile.in for release 2015-07-11 09:42:43 +02:00
Ray Smith
19c26f0646 Removed warning about ligatures and fixed root font dir 2015-07-10 15:56:33 -07:00
Ray Smith
a303ab9d00 Misc fixes, mostly clang formatting, but some bug fixes in matrix, werd, and tesstrain_utils. Also updates unicharset to match traineddata files. 2015-07-09 14:28:20 -07:00
Zdenko Podobný
6a998ec5d0 fix redefinition in stringrenderer.cpp (stringrenderer.h) 2015-06-14 17:29:27 +02:00
Ray Smith
0ee178d79b Clang fixes to earlier changes and build compatability with Google environment part 2 2015-06-12 11:17:47 -07:00
Ray Smith
4c7ab0caea Fixed font lists, improved wordlist management 2015-06-12 10:56:40 -07:00
Ray Smith
ab0f4e2c38 Clang fixes to earlier changes and build compatability with Google environment 2015-06-12 10:53:21 -07:00
Jim O'Regan
16ac3b0a20 /usr/share/fonts is the wrong path on Mac 2015-05-18 09:53:14 +01:00
Ray Smith
941d87057e Fixed training build 2015-05-13 17:46:58 -07:00
Jim O'Regan
b13691fda0 Merge conflict: going with Ray's version 2015-05-13 08:54:28 +01:00
Ray Smith
b2a3924585 Major updates to training system as a result of extensive testing on 100 languages - makefile.am 2015-05-12 18:08:39 -07:00
Ray Smith
6be25156f7 Major updates to training system as a result of extensive testing on 100 languages 2015-05-12 18:04:31 -07:00
Ray Smith
164897210a Improved newlines and spaces in a box file so it works better with RTL languages. 2015-05-12 17:51:03 -07:00
Ray Smith
6b634170c1 Significant change to invisible font system
to improve correctness and compatibility with
external programs, particularly ghostscript.
We will start mapping everything to a single glyph,
rather than allowing characters to run off the end
of the font.

A more detailed design discussion is embedded into
pdfrenderer.cpp comments. The font, source code
that produces the font, and the design comments
were contributed by Ken Sharp from Artifex Software.
2015-05-12 17:33:18 -07:00
Zdenko Podobný
d1c749f6ad Fixed issue 1133 - part3 (Nick's replacement of InputBuffer-ReadLine with InputBuffer-Read) 2015-05-01 19:33:56 +02:00
Zdenko Podobný
5e754af9cb Fixed issue 1133 - part2 2015-05-01 19:12:03 +02:00
Zdenko Podobný
c0640a4bef fix cygwin build (issue 1289) 2014-09-28 23:19:52 +02:00
Zdenko Podobný
93f7899a9e fix tesstrain.sh/issue 1311. Patch from Mark Zealey <zealey@gmail.com>
https://groups.google.com/forum/#!msg/tesseract-dev/uYTr1D656-M/xLXgjKy9fywJ
2014-09-14 15:10:25 +02:00
Zdenko Podobný
769fef8c96 fix training tools build 2014-08-13 22:07:44 +02:00
theraysmith@gmail.com
e249d7bcb2 Added tesstrain.sh - a master training script
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1146 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:20:56 +00:00
theraysmith@gmail.com
c9385a2755 Added tesstrain.sh - a master training script
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1145 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:20:45 +00:00
theraysmith@gmail.com
1fc8898926 Fixed missing newlines in logging
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1144 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:20:08 +00:00
theraysmith@gmail.com
6fcede5c48 Fixed some leaks
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1143 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:19:24 +00:00
theraysmith@gmail.com
9f4d6fd668 Added ability to just list available fonts for text, and to underline words for training
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1142 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:19:06 +00:00
theraysmith@gmail.com
97080412fd Bunch of minor bug fixes/cleanups
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1106 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-21 15:48:48 +00:00
zdenop
30e5220f2e fix training build for opencl and mingw
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1103 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-16 19:36:32 +00:00
theraysmith@gmail.com
cda8e748b1 Fixed some formatting issues
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1083 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-25 01:25:42 +00:00
theraysmith@gmail.com
42bfdc21d8 Fixed issue 1134
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1082 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-25 01:07:26 +00:00
theraysmith@gmail.com
84e0f6470f Fixed issue 1132
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1081 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-24 21:46:57 +00:00
theraysmith@gmail.com
61d45d2f34 Fixed issue 1133
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1080 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-24 21:18:00 +00:00
theraysmith@gmail.com
bfb0e392d1 Applied patch to fix issue 1131
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1065 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-23 23:17:43 +00:00
theraysmith@gmail.com
cc47429587 Ugraded pango version
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1062 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-23 22:54:25 +00:00
theraysmith@gmail.com
8364f24f4b Added ability for box files to store spaces and newlines
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1060 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-23 22:52:05 +00:00
zdenop
c3b9b7c60d fix OSX build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1059 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-16 09:04:28 +00:00
theraysmith@gmail.com
07ca24aeaf Removed upper limit on trie size, fixing issue 1020.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1044 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-02-03 19:18:23 +00:00
theraysmith@gmail.com
6a10aa7985 More cleanup changes from patches
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1024 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-29 02:22:14 +00:00
zdenop@gmail.com
ac5a8a871b fix windows builds (mingw and VS2010)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1017 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-26 22:39:20 +00:00
theraysmith@gmail.com
0d93bb7cfa More code cleanup from patches and fixing warnings
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1011 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-24 21:09:59 +00:00
zdenop
9cf08ca8d3 fix build with -DGRAPHICS_DISABLED
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@981 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-11 23:08:54 +00:00
zdenop
8299e2a605 fix linux build, remove not used folder and spec file
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@979 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-10 23:52:04 +00:00
theraysmith@gmail.com
f244ab3fc6 New training tool text2image
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@965 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-09 18:04:20 +00:00
theraysmith@gmail.com
0e230a9d96 New training tool text2image
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@964 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-01-09 18:01:34 +00:00
theraysmith@gmail.com
4d514d5a60 Major refactor of beam search, elimination of dead code, misc bug fixes, updates to Makefile.am, Changelog etc.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@878 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-09-23 15:26:50 +00:00
zdenop@gmail.com
10c1169d98 remove unused code (Windows related)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@860 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-07-08 18:21:10 +00:00
zdenop@gmail.com
b9abecfb34 Auto append dot in combine_tessdata (issue 932); provide more info for combine_tessdata utility
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@848 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-06-02 20:50:55 +00:00
theraysmith@gmail.com
59d244b06e More fixes for GRAPHICS_DISABLED from Zdenko and Ray
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@757 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-09-22 00:59:31 +00:00
zdenop
937aab009f fix issue 636
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@734 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-07-31 09:22:26 +00:00
zdenop@gmail.com
3115fbfdcb another fix MinGW+MSYS
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@709 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-24 10:14:47 +00:00
zdenop@gmail.com
d4d4b8aad8 improve autools system (mingw+msys fix); implementation of --disable-tessdata-prefix
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@708 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-03-22 20:01:33 +00:00