Commit Graph

4461 Commits

Author SHA1 Message Date
theraysmith@gmail.com
c9385a2755 Added tesstrain.sh - a master training script
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1145 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:20:45 +00:00
theraysmith@gmail.com
1fc8898926 Fixed missing newlines in logging
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1144 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:20:08 +00:00
theraysmith@gmail.com
6fcede5c48 Fixed some leaks
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1143 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:19:24 +00:00
theraysmith@gmail.com
9f4d6fd668 Added ability to just list available fonts for text, and to underline words for training
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1142 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:19:06 +00:00
theraysmith@gmail.com
b64ad05096 Improved efficiency of image processing for PDF
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1141 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:15:25 +00:00
theraysmith@gmail.com
36b55f7710 Removed unused variable
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1140 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:10:06 +00:00
theraysmith@gmail.com
c86fe22a62 Started TFile conversion to remove fmemopen
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1139 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:09:25 +00:00
theraysmith@gmail.com
d52231cff3 Started TFile conversion to remove fmemopen
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1138 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-11 23:08:46 +00:00
zdenop
c51691fdeb add parameter info to ParamUtils::PrintParams
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1137 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-10 19:08:20 +00:00
zdenop
7239cec2b4 fix off_t issue on OSX
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1136 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-10 16:42:45 +00:00
zdenop
6941bffbd2 fix typo
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1135 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-09 17:53:57 +00:00
zdenop
bce2cd5f33 enable to select pdf compression type and jpeg quality (fix issue 1263)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1134 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-08 21:18:44 +00:00
zdenop@gmail.com
6cdf70b0cf Cleanup an unused variable in ccmain/osdetect.cpp - fix issue 1229
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1133 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-04 08:29:32 +00:00
zdenop
1156098567 Add font info to hocr output - fix issue 1219
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1132 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-03 16:22:12 +00:00
zdenop
19ddc89c44 update tesseract manpage and INSTALL.SVN
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1131 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-08-02 20:59:19 +00:00
zdenop
1ea387232b fix compatibility of uninstall: MacOSX rm needs -f instead of --force
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1127 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-07-24 20:39:30 +00:00
zdenop
5b779456f9 fix compatibility with leptonica 1.71 and 1.70
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1126 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-07-24 19:11:39 +00:00
zdenop
c550aee2f9 revert commit r1122 ;-)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1123 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-07-01 22:04:56 +00:00
zdenop
bcbfb93475 fix issue 1240
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1122 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-07-01 21:57:22 +00:00
zdenop
95b7783a95 fix issue 1228: bilevel pdf output - horizontal/vertical lines removed
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1118 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-06-23 21:04:37 +00:00
zdenop
905e6162b9 put info about (API) version; fix typo
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1117 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-06-22 18:31:42 +00:00
zdenop
41bd040ef5 fix issue 1043
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1116 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-06-08 21:26:02 +00:00
zdenop@gmail.com
780183226c Accept Windows EOL in config file
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1115 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-06-03 06:57:52 +00:00
rajesh.katikam@gmail.com
3ff108cf45 OpenCL fix for PixMemTiff
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1114 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-06-02 05:42:05 +00:00
zdenop
fad9de4e1b fix issue 1217: GetThresholdedImage accesses possibly NULL thresholder_
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1113 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-31 21:21:37 +00:00
zdenop
e64f555567 fix Issue 1223: TessPolyBlockType enum is outdated in C-API
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1112 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-31 20:31:48 +00:00
zdenop
36f3f76d64 fix tiff issue on windows
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1111 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-31 07:27:54 +00:00
zdenop@gmail.com
84cdcb32cc fixed windows build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1110 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-26 06:48:58 +00:00
zdenop
19c4c2f0e7 fix C-API to resent C++ API changes - thanks to Nick White
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1109 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-25 21:03:11 +00:00
zdenop
ffe52737d5 check if input file exists
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1108 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-25 19:58:00 +00:00
theraysmith@gmail.com
97080412fd Bunch of minor bug fixes/cleanups
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1106 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-21 15:48:48 +00:00
theraysmith@gmail.com
25a8c7b720 Enabled streaming input and output of multi-page documents
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1105 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-21 15:46:21 +00:00
zdenop
30e5220f2e fix training build for opencl and mingw
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1103 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-16 19:36:32 +00:00
zdenop
979f9cafe5 Add word recognition language to C-API - fix issue 1200
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1102 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-16 18:35:54 +00:00
zdenop
44b0d0e28e addition to r1100
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1101 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-11 21:24:54 +00:00
zdenop
6051e40212 fix issue 1197
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1100 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-11 21:20:38 +00:00
zdenop
2e520f2fac fix hocr/pdf output when image is provided from stdin - issue 1196
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1099 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-11 15:59:47 +00:00
zdenop
bdb912c186 escape input_file name in hOCR output - fix issue 1154
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1098 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-09 22:19:30 +00:00
zdenop
0e08cb0080 Make default language params message conditional on debug level: issue 1152
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1097 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-09 18:17:29 +00:00
zdenop
c3b6ac7f32 skip imagedata build to fix issue 1150 on Mac OS X
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1096 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-07 21:04:42 +00:00
zdenop
30f6ae6742 amendment to r1091
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1095 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-07 20:53:03 +00:00
zdenop
ee73e3b107 fix issue 123: user-words (and user-patterns) file specified by command line
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1093 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-04 21:11:00 +00:00
zdenop
bc09cd9040 fix formating in C-API and add TessChoiceIteratorDelete
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1092 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-03 20:21:37 +00:00
zdenop
f86e9d83d4 add ChoiceIterator to C-API - fix issue 1149
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1091 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-05-03 09:29:20 +00:00
theraysmith@gmail.com
484b47bc5d Fixed tfscanf return value with * modifier
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1087 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-29 04:30:47 +00:00
theraysmith@gmail.com
c8e27cb8f8 Fixed segfault due to partial support of * modifier in tfscanf
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1086 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-29 04:03:41 +00:00
theraysmith@gmail.com
d7b089fbcf Fixed some clang errors about explicit constructors and more formatting.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1085 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-28 23:10:48 +00:00
theraysmith@gmail.com
d748d94aae Fixed bugs in scanutils that were causing accuracy degradation
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1084 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-28 23:06:41 +00:00
theraysmith@gmail.com
cda8e748b1 Fixed some formatting issues
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1083 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-25 01:25:42 +00:00
theraysmith@gmail.com
42bfdc21d8 Fixed issue 1134
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1082 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2014-04-25 01:07:26 +00:00