rajesh.katikam@gmail.com
|
bf0a83907b
|
Cleaned up configure.ac and Makefile.am in multiple folder to use OPENCL paths
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@910 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-11-12 10:40:40 +00:00 |
|
rajesh.katikam@gmail.com
|
983aaabaae
|
Initial version of OpenCL support added.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@909 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-11-11 17:43:13 +00:00 |
|
zdenop@gmail.com
|
c7ba981e04
|
fix validity of hocr output of multipage image
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@908 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-11-10 22:00:54 +00:00 |
|
zdenop@gmail.com
|
e66d433907
|
fix issue 938: change tessdata-dir/datadir rules; implement --tessdata-dir option
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@907 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-11-10 20:59:11 +00:00 |
|
zdenop@gmail.com
|
77c1b41e4e
|
fix svn:executable atribute, trailing spaces, version include
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@903 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-11-03 17:24:00 +00:00 |
|
zdenop@gmail.com
|
b15c710385
|
fix declaration of ClearResults() (VC++)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@891 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-10-10 09:06:22 +00:00 |
|
theraysmith@gmail.com
|
4c3475ad2e
|
Fixed fmemopen portability problem
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@890 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-10-10 02:07:26 +00:00 |
|
zdenop@gmail.com
|
ee08f623ce
|
fix issue 967
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@886 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-29 20:48:06 +00:00 |
|
zdenop@gmail.com
|
af319b4d90
|
fix for windows build - part 1
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@883 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-25 09:56:49 +00:00 |
|
theraysmith@gmail.com
|
4d514d5a60
|
Major refactor of beam search, elimination of dead code, misc bug fixes, updates to Makefile.am, Changelog etc.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@878 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-23 15:26:50 +00:00 |
|
theraysmith@gmail.com
|
88ea81c89e
|
Added renderer to API
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@869 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-09-20 19:39:59 +00:00 |
|
zdenop@gmail.com
|
b5e16669e1
|
fix issue 946/reopen issue 903
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@865 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-25 15:54:30 +00:00 |
|
zdenop@gmail.com
|
b1fd75ccf9
|
amend r:862
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@863 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-14 14:11:16 +00:00 |
|
zdenop@gmail.com
|
c45bb08a6e
|
check inputformat before getting number of pages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@862 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-14 13:58:23 +00:00 |
|
zdenop@gmail.com
|
ebd0ba8134
|
remove unused code (tesseractmain.h)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@861 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-08 18:23:47 +00:00 |
|
zdenop@gmail.com
|
10c1169d98
|
remove unused code (Windows related)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@860 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-08 18:21:10 +00:00 |
|
zdenop@gmail.com
|
b5d3d66a68
|
remove unused code(gettext)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@859 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-07 16:39:13 +00:00 |
|
zdenop@gmail.com
|
4c16ff6a1f
|
use leptonica for getting number of pages instead of own code
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@858 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-05 16:07:25 +00:00 |
|
zdenop@gmail.com
|
8a0878af3a
|
fix mingw build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@856 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-07-05 08:46:57 +00:00 |
|
zdenop@gmail.com
|
418a7ad16f
|
allow to have text file with list of images as input
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@855 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-06-27 21:53:53 +00:00 |
|
zdenop@gmail.com
|
e5628e5e1a
|
fix hOCR output - do not print empty words: issue 903
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@854 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-06-23 15:10:24 +00:00 |
|
zdenop@gmail.com
|
74dc14ebd4
|
fix copying a TessResultIterator using CAPI (issue 934)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@849 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-06-02 21:25:41 +00:00 |
|
zdenop@gmail.com
|
62b2e12b72
|
replace option -o with -c
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@841 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-05-02 17:06:14 +00:00 |
|
zdenop@gmail.com
|
7dcfd02c22
|
Allow arbitrary configuration options to be set from the command line (fix issue 893)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@837 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-04-29 20:43:14 +00:00 |
|
zdenop@gmail.com
|
1032cb1692
|
fix issue 881: capi.h redefines things from Leptonica, causing compilation failures
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@836 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-04-29 17:57:21 +00:00 |
|
zdenop@gmail.com
|
a04a5c1f42
|
Tesseract should exit with an error if ProcessPages fails (fixed issue 891)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@834 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-04-12 08:14:13 +00:00 |
|
zdenop@gmail.com
|
a6bee550e8
|
Add lang and dir attributes to each word in hOCR output (fix issue 878);
Unify usage of single quote in hOCR output
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@832 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-03-28 21:37:55 +00:00 |
|
zdenop@gmail.com
|
db52047420
|
fix issue 809: invalid hOCR output file on windows when input filename has non ascii chars.
Add release date to vs2008/doc/versions.html
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@828 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2013-02-23 15:01:21 +00:00 |
|
zdenop@gmail.com
|
37fb755d47
|
Add a command-line option (--print-parameters) to dump the parameters to stdout
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@814 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-12-23 17:54:14 +00:00 |
|
zdenop@gmail.com
|
4812fac33e
|
Fix issue 427: print result to stdout instead to file
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@813 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-12-23 17:52:42 +00:00 |
|
zdenop@gmail.com
|
8a2b5f0ead
|
Fix issue 808: Check for output file write permissions before performing lengthy OCR operation
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@812 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-12-23 17:49:15 +00:00 |
|
zdenop@gmail.com
|
42c92c3e29
|
avoid multiple tesseract inits in tesseract executable
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@811 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-12-23 17:47:06 +00:00 |
|
zdenop@gmail.com
|
9b2906c67e
|
fix issue 800: Get rid of glob() for searching available languages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@810 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-11-30 22:11:22 +00:00 |
|
zdenop@gmail.com
|
5d9fd5fb72
|
add word confidence info (x_wconf) to hocr output/fix issue 748
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@806 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-11-06 21:18:35 +00:00 |
|
theraysmith@gmail.com
|
af04ae882f
|
Made use of _ macro and stderr consistent with error messages.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@780 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-10-22 23:40:19 +00:00 |
|
zdenop@gmail.com
|
6b4970776d
|
Fixed tessdata_dir for tessseract executable.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@777 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-10-11 19:47:17 +00:00 |
|
theraysmith@gmail.com
|
605fd7488b
|
Fixed relative-to-executable tessdata location, while allowing for addition of terminating /
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@774 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-10-09 00:41:08 +00:00 |
|
zdenop@gmail.com
|
ceff3288d7
|
fix issue 764...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@768 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-27 08:43:55 +00:00 |
|
zdenop@gmail.com
|
fb91759cdc
|
fix issue 764 and clean tabulators, trim trailing spaces...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@767 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-27 08:24:46 +00:00 |
|
zdenop@gmail.com
|
23f1d16037
|
fix fox issue 346 / GetAvailableLanguagesAsVector
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@760 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-24 05:20:23 +00:00 |
|
zdenop@gmail.com
|
dc8bd4682b
|
C-API (fix issue 362)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@759 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-24 05:14:11 +00:00 |
|
theraysmith@gmail.com
|
fbf7968490
|
Fixed problem with blank pages
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@750 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-09-21 15:27:25 +00:00 |
|
zdenop@gmail.com
|
2a57976c41
|
- fix msys buil (missing -lws2_32 for library)
- remove old debian leptonica package
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@738 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-08-25 19:53:41 +00:00 |
|
zdenop@gmail.com
|
306a8216e1
|
fix creating box file from empty image (issue 516)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@737 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-08-03 22:32:17 +00:00 |
|
zdenop@gmail.com
|
c8eedb25a6
|
added ocr-capabilities for hocr conformity; XHTML 1.0 Transitional conformity; improved hocr output readability
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@729 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-05-28 20:44:23 +00:00 |
|
david.eger@gmail.com
|
6a9a3ddcb2
|
Zdeno pointed out that ocr_line (though not ocr_word) is actually in the hocr spec.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@728 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-05-27 23:58:09 +00:00 |
|
david.eger@gmail.com
|
d9d70919bb
|
Conform to the hocr spec: hocr doesn't have ocr_word, but instead has ocrx_word.
Tested with ExactImage's hocr2pdf.
$ tesseract phototest.tif phototest hocr
$ hocr2pdf -i phototest.tif -o ./phototest.pdf < ./phototest.hocr
$ evince phototest.pdf
See: https://docs.google.com/document/preview?id=1QQnIQtvdAC_8n92-LhwPcjtAUFwBlzE8EWnKAxlgVf0
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@726 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-05-25 17:36:25 +00:00 |
|
david.eger@gmail.com
|
eeeb4f513c
|
Provide better paragraph segmentation without having to run fully
automatic layout analysis.
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@725 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-05-10 00:03:34 +00:00 |
|
zdenop@gmail.com
|
aa14e8b212
|
fix Mingw shared build
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@718 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-04-02 12:14:37 +00:00 |
|
zdenop@gmail.com
|
cd8de9157c
|
change comments to doxygen block comments (api)
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@716 d0cd1f9f-072b-0410-8dd7-cf729c803f20
|
2012-03-30 21:24:12 +00:00 |
|