Stefan Weil
6ac5d0ba8f
dawg: Fix typos and file information in file header
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-08-30 21:48:06 +02:00
Ray Smith
2633fef0b6
Part 2 of separating out the unicharset from the LSTM model, fixing command line for training
2017-08-02 13:29:23 -07:00
Ray Smith
da03e4e910
Fixes from pull of cleanups: clang tidied, reviewed, fixed new bugs, undeleted needed code. Probably breaks the build, due to some inclusion of changes in utf8/32 conversion
2017-07-14 09:30:14 -07:00
Stefan Weil
121a7c6489
ccstruct: Fix non portable and wrong format strings
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-05-11 19:32:51 +02:00
Ray Smith
1cc511188d
Added extra Init that takes a memory buffer or a filereader function pointer to enable read of traineddata from memory or foreign file systems. Updated existing readers to use TFile API instead of FILE. This does not yet add big-endian capability to LSTM, but it is very easy from here.
2017-04-27 15:48:23 -07:00
zdenop
3c0c54f059
Merge pull request #353 from pnordhus/remove_dawgpositionvector_dtor
...
Remove redundant destructor
2016-12-08 13:04:58 +01:00
Stefan Weil
cefc420ddb
Remove extra semicolons after member function definitions
...
clang++ report:
api/baseapi.h:852:4: warning:
extra ';' after member function definition [-Wextra-semi]
[...]
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2016-12-04 14:54:52 +01:00
Stefan Weil
7e90200d26
Fix some compiler warnings (unused function parameters)
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2016-11-13 17:20:06 +01:00
Philipp Nordhus
c05ff3456e
Remove duplicate destructor
...
Destructor of base class GenericVector calls base class clear()
method, deallocating the memory.
2016-06-17 23:20:03 +02:00
Stefan Weil
97d47a406d
dict: Fix typos in comments and strings
...
All of them were found by codespell.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-09-14 22:16:42 +02:00
theraysmith@gmail.com
4d514d5a60
Major refactor of beam search, elimination of dead code, misc bug fixes, updates to Makefile.am, Changelog etc.
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@878 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2013-09-23 15:26:50 +00:00
zdenop@gmail.com
6ccab83bd6
fixing issue 628 (replacing __MSW32__ with _WIN32) and issue 614 (reverting "class DLLSYM STRING" to "class CCUTIL_API STRING")
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@677 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-19 21:48:45 +00:00
theraysmith@gmail.com
fdd4ffe85e
Fixed endian bug in dawg reader, Added word bigram correction,
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@649 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2012-02-02 02:56:18 +00:00
zdenop@gmail.com
9b7375edd6
MinGW portability solved + some code cleanup (based on cpplint)
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@605 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-08-15 19:28:25 +00:00
theraysmith
664b84b3c8
Various fixes, including memory leak in fixspace, font labels on output, removed some annoying debug output, fixes to initialization of parameters, general cleanup, and added Hindi
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@571 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2011-03-21 21:46:35 +00:00
theraysmith
b98c922391
Fixed problem with empty dawgs
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@537 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-30 01:04:02 +00:00
zdenop@gmail.com
4523ce9f7d
3.01 code from http://github.com/jimregan/tesseract-ocr with addaptions related to Linux and Windows (VC2008) compile process
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@526 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-11-23 18:34:14 +00:00
joregan
a18816f839
partial merge of doxygen branch (stuff without conflicts, basically)
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@441 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2010-07-27 13:23:23 +00:00
theraysmith
3a13d80d24
Changes to dict for 3.00
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@293 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2009-07-11 02:20:33 +00:00
theraysmith
3adf29c25c
Increased max edges in squished dawg
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@194 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2008-11-14 04:33:18 +00:00
theraysmith
0aa4861116
Further fixes to dictionary generation that was losing words
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@184 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2008-08-20 17:47:05 +00:00
theraysmith
b950752818
Fixes to wordlist2dawg to create correct dawgs on windows
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@179 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2008-08-14 22:44:46 +00:00
theraysmith
520077bd41
Fixed name collision with jpeg library
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@164 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2008-04-22 00:42:51 +00:00
theraysmith
2a678305c6
Major internationalization improvements
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@133 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2008-02-01 00:21:49 +00:00
theraysmith
0d9fa6a040
Fixed portability problems with VC++ 6 and VC++ express.
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@83 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2007-07-18 01:01:50 +00:00
theraysmith
a59e5dc791
Preparations for unicodization
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@56 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2007-05-16 01:46:09 +00:00
tmbdev
425d593ebe
top-skimming import from sf.net
...
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk/trunk@2 d0cd1f9f-072b-0410-8dd7-cf729c803f20
2007-03-07 20:03:40 +00:00