tesseract

mirror of https://github.com/tesseract-ocr/tesseract.git synced 2024-12-23 15:07:49 +08:00

Author	SHA1	Message	Date
Robbert Klarenbeek	4919b276eb	Fix incompatibility with some C++11 implementations	2016-04-28 22:34:44 +02:00
Stefan Weil	fe11c19bf3	Add missing argument for tprintf The format string expects 3 int arguments. Signed-off-by: Stefan Weil <sw@weilnetz.de>	2016-03-17 06:23:30 +01:00
Zdenko Podobný	2c675dcc77	Revert "fix comment about default PSM" This reverts commit `b46af6da31`.	2016-03-10 09:42:01 +01:00
zdenop	b46af6da31	fix comment about default PSM	2016-03-09 19:19:45 +01:00
Zdenko Podobný	8bfaf84007	move new&delete histogramAllChannels inside the #ifdef USE_OPENCL; fixes #248	2016-03-04 14:35:08 +01:00
Stefan Weil	4fdf272ffa	Remove checks for this == NULL This fixes warnings from clang. Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-11-07 13:09:53 +01:00
Stefan Weil	83541d8ea0	Remove register attribute for local variables This fixes clang compiler warnings like this one: wordrec/gradechop.cpp:52:3: warning: 'register' storage class specifier is deprecated [-Wdeprecated-register] Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-11-06 06:45:19 +01:00
Stefan Weil	4a92ff5862	Fix compiler warnings for copy constructors gcc reports these warnings with -Wextra: ccstruct/pageres.h:330:3: warning: base class 'class ELIST_LINK' should be explicitly initialized in the copy constructor [-Wextra] ccstruct/ratngs.cpp:115:1: warning: base class 'class ELIST_LINK' should be explicitly initialized in the copy constructor [-Wextra] ccstruct/ratngs.h:291:3: warning: base class 'class ELIST_LINK' should be explicitly initialized in the copy constructor [-Wextra] ccutil/genericvector.h:435:3: warning: base class 'class GenericVector<WERD_RES*>' should be explicitly initialized in the copy constructor [-Wextra] Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-11-05 09:19:37 +01:00
Stefan Weil	70fd7cdf0a	ccstruct: Fix compiler warning (disable buggy code) gcc reports a potential bad array access: ccstruct/mod128.cpp:98:20: warning: array subscript has type 'char' [-Wchar-subscripts] dir is of type 'char'. Most compilers use signed char by default. Then the value of dir is in the range -128 ... 127 and cannot be used to access an array with 256 elements. Don't fix that but disable the buggy code. Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-11-05 06:39:35 +01:00
Stefan Weil	edf765b952	Remove unneeded const qualifiers This fixes compiler warnings like this one: api/baseapi.h:739:32: warning: type qualifiers ignored on function return type [-Wignored-qualifiers] Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-11-05 06:36:42 +01:00
Stefan Weil	38f3db8ca5	Fix more typos in comments (found by codespell) Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-11-04 21:58:42 +01:00
Stefan Weil	bef8cad38d	ccstruct: Fix typos in comments and strings Most of them were found by codespell. Signed-off-by: Stefan Weil <sw@weilnetz.de>	2015-09-14 22:02:00 +02:00
James R. Barlow	18ac7ae7ef	Get OpenCL to compile on OS X However, the output of the OpenCL build is garbage....	2015-08-26 02:03:07 -07:00
Zdenko Podobný	66a76a9477	Revert "temporary add config/*, configure and Makefile.in for release" This reverts commits `ec9581d8f2`, `1afe382c4e`, `4b2cfabcc1`	2015-07-31 21:44:43 +02:00
Zdenko Podobný	27b8a5cc89	fix GRAPHICS_DISABLED build	2015-07-23 23:14:53 +02:00
Jim O'Regan	524a61452d	Doxygen Squashed commit from https://github.com/tesseract-ocr/tesseract/tree/more-doxygen closes #14 Commits: `6317305` doxygen `9f42f69` doxygen `0fc4d52` doxygen `37b4b55` fix typo `bded8f1` some more doxy `020eb00` slight tweak `524666d` doxygenify `2a36a3e` doxygenify `229d218` doxygenify `7fd28ae` doxygenify `a8c64bc` doxygenify `f5d21b6` fix `5d8ede8` doxygenify `a58a4e0` language_model.cpp `fa85709` lm_pain_points.cpp lm_state.cpp `6418da3` merge `06190ba` Merge branch 'old_doxygen_merge' into more-doxygen `84acf08` Merge branch 'master' into more-doxygen `50fe1ff` pagewalk.cpp cube_reco_context.cpp `2982583` change to relative `192a24a` applybox.cpp, take one `8eeb053` delete docs for obsolete params `52e4c77` modernise classify/ocrfeatures.cpp `2a1cba6` modernise cutil/emalloc.cpp `773e006` silence doxygen warning `aeb1731` silence doxygen warning `f18387f` silence doxygen; new params are unused? `15ad6bd` doxygenify cutil/efio.cpp `c8b5dad` doxygenify cutil/danerror.cpp `784450f` the globals and exceptions parts are obsolete; remove `8bca324` doxygen classify/normfeat.cpp `9bcbe16` doxygen classify/normmatch.cpp `aa9a971` doxygen ccmain/cube_control.cpp `c083ff2` doxygen ccmain/cube_reco_context.cpp `f842850` params changed `5c94f12` doxygen ccmain/cubeclassifier.cpp `15ba750` case sensitive `f5c71d4` case sensitive `f85655b` doxygen classify/intproto.cpp `4bbc7aa` partial doxygen classify/mfx.cpp `dbb6041` partial doxygen classify/intproto.cpp `2aa72db` finish doxygen classify/intproto.cpp `0b8de99` doxygen training/mftraining.cpp `0b5b35c` partial doxygen ccstruct/coutln.cpp `b81c766` partial doxygen ccstruct/coutln.cpp `40fc415` finished? doxygen ccstruct/coutln.cpp `6e4165c` doxygen classify/clusttool.cpp `0267dec` doxygen classify/cutoffs.cpp `7f0c70c` doxygen classify/fpoint.cpp `512f3bd` ignore ~ files `5668a52` doxygen classify/intmatcher.cpp `84788d4` doxygen classify/kdtree.cpp `29f36ca` doxygen classify/mfoutline.cpp `40b94b1` silence doxygen warnings `6c511b9` doxygen classify/mfx.cpp `f9b4080` doxygen classify/outfeat.cpp `aa1df05` doxygen classify/picofeat.cpp `cc5f466` doxygen training/cntraining.cpp `cce044f` doxygen training/commontraining.cpp `167e216` missing param `9498383` renamed params `37eeac2` renamed param `d87b5dd` case `c8ee174` renamed params `b858db8` typo `4c2a838` h2 context? `81a2c0c` fix some param names; add some missing params, no docs `bcf8a4c` add some missing params, no docs `af77f86` add some missing params, no docs; fix some param names `01df24e` fix some params `6161056` fix some params `68508b6` fix some params `285aeb6` doxygen complains here no matter what `529bcfa` rm some missing params, typos `cd21226` rm some missing params, add some new ones `48a4bc2` fix params `c844628` missing param `312ce37` missing param; rename one `ec2fdec` missing param `05e15e0` missing params `d515858` change "<" to < to make doxygen happy `b476a28` wrong place	2015-07-20 18:48:00 +01:00
Zdenko Podobný	ec9581d8f2	temporary add configure and Makefile.in for release	2015-07-11 09:42:43 +02:00
Ray Smith	44122698d7	Removed debug messages, forward compatability of traineddata files, further bug fix.	2015-07-09 14:50:25 -07:00
Ray Smith	a303ab9d00	Misc fixes, mostly clang formatting, but some bug fixes in matrix, werd, and tesstrain_utils. Also updates unicharset to match traineddata files.	2015-07-09 14:28:20 -07:00
Jeff Breidenbach	935e72401a	Changes to get 'make-dist' to work This is what was required to get 'make-dist' to work. I left autogen alone since it works, albeit with an error message. My practice packages appear to work fine.	2015-06-29 08:11:26 +01:00
Zdenko Podobný	3a0da4e1b7	fix DISABLE_GRAPHICS build (google code issue 1490)	2015-06-21 22:50:14 +02:00
Ray Smith	d174c4fd33	Fixed occurrence of small rotated blocks in loosely spaced text part 2	2015-06-12 11:12:06 -07:00
Ray Smith	03f3c9dc88	Misc fixes missed from previous commits	2015-05-12 18:13:15 -07:00
Ray Smith	2924d3ae15	Changes missed from diacritic fix edit	2015-05-12 17:28:56 -07:00
Ray Smith	84920b92b3	Font and classifier output structure cleanup. Font recognition was poor, due to forcing a 1st and 2nd choice at a character level, when the total score for the correct font is often correct at the word level, so allowed the propagation of a full set of fonts and scores to the word recognizer, which can now decide word level fonts using the scores instead of simple votes. Change precipitated a cleanup of output data structures for classifier results, eliminating ScoredClass and INT_RESULT_STRUCT, with a few extra elements going in UnicharRating, and using that wherever possible. That added the extra complexity of 1-rating due to a flip between 0 is good and 0 is bad for the internal classifier scores before they are converted to rating and certainty.	2015-05-12 17:24:34 -07:00
Ray Smith	0e868ef377	Major change to improve layout analysis for heavily diacritic languages: Tha, Vie, Kan, Tel etc. There is a new overlap detector that detects when diacritics cause a big increase in textline overlap. In such cases, diacritics from overlap regions are kept separate from layout analysis completely, allowing textline formation to happen without them. The diacritics are then assigned to 0, 1 or 2 close words at the end of layout analysis, using and modifying an old noise detection data path. The stored diacritics are used or not during recognition according to the character classifier's liking for them.	2015-05-12 16:47:02 -07:00
Ray Smith	b6d0184806	Fixed problems with shifted baselines so recognition can recover from layout analysis errors.	2015-05-12 15:53:45 -07:00
Ray Smith	4a3caefd92	Add ability to build under android (without cube or scrollview).	2015-05-12 15:41:15 -07:00
Ray Smith	25d0968d09	Major refactor to improve speed on difficut images, especially when running a heap checker. SEAM and SPLIT have been begging for a refactor for a LONG time. This change does most of the work of turning them into proper classes: Moved relevant code into SEAM/SPLIT/TBLOB/EDGEPT etc from global helper functions. Made the splits full data members of SEAM in an array instead of 3 separate pointers. This greatly reduces the amount of new/delete happening in the chopper, which is the main goal. Deleted redundant files: olutil., makechop. Brought other code into SEAM in order to keep its data members private with only priority having accessors.	2015-05-12 14:59:14 -07:00
Ray Smith	2f197cd653	Fixed issues 899/1220/1246 (mixed eng+ara)	2014-09-17 18:27:49 -07:00
Ray Smith	736d327473	NOP changes from static analysis in issue 1205	2014-08-12 16:09:12 -07:00
theraysmith@gmail.com	dbf6197471	Major refactor of control.cpp to enable line recognition git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1147 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-08-11 23:23:06 +00:00
theraysmith@gmail.com	d52231cff3	Started TFile conversion to remove fmemopen git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1138 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-08-11 23:08:46 +00:00
zdenop	c3b6ac7f32	skip imagedata build to fix issue 1150 on Mac OS X git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1096 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-05-07 21:04:42 +00:00
theraysmith@gmail.com	0dc7926f24	Fixed issue 1122 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1077 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-04-24 01:02:45 +00:00
theraysmith@gmail.com	a9f483cffc	Applied patch to fix issue 1098 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1066 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-04-23 23:28:01 +00:00
theraysmith@gmail.com	3a5f699013	Applied patch to refix issue 331 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1064 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-04-23 23:12:53 +00:00
theraysmith@gmail.com	fec775400d	Added ImageData class git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1061 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-04-23 22:53:16 +00:00
theraysmith@gmail.com	8364f24f4b	Added ability for box files to store spaces and newlines git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1060 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-04-23 22:52:05 +00:00
theraysmith@gmail.com	7f5e5264d3	Fixed issues 1093-1097 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1048 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-02-04 23:36:24 +00:00
theraysmith@gmail.com	2fcea93846	Fixed issues 1081-1090 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1046 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-02-04 02:23:18 +00:00
theraysmith@gmail.com	d11dc049e3	Fixed a lot of compiler/clang warnings git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1015 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-25 02:28:51 +00:00
theraysmith@gmail.com	0d93bb7cfa	More code cleanup from patches and fixing warnings git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@1011 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-24 21:09:59 +00:00
zdenop@gmail.com	71ae509354	fix for mingw32/g++ 4.8.1 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@998 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-22 08:10:15 +00:00
theraysmith@gmail.com	5857bebdc8	Minor formatting changes git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@992 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-17 18:54:16 +00:00
zdenop	3d1e1cc23d	fix opencl build git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@986 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-13 22:41:52 +00:00
zdenop	aeba7a7ace	amend r:983 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@985 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-12 21:38:11 +00:00
zdenop	a6d23c63c5	remove empty file git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@984 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-12 20:55:00 +00:00
zdenop@gmail.com	94d08567e1	fix vs2010 (and maybe vs2008) build git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@983 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-12 20:13:55 +00:00
zdenop	9cf08ca8d3	fix build with -DGRAPHICS_DISABLED git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@981 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2014-01-11 23:08:54 +00:00

1 2 3

132 Commits