tesseract

mirror of https://github.com/tesseract-ocr/tesseract.git synced 2024-12-05 02:47:00 +08:00

Author	SHA1	Message	Date
Jim O'Regan	524a61452d	Doxygen Squashed commit from https://github.com/tesseract-ocr/tesseract/tree/more-doxygen closes #14 Commits: `6317305` doxygen `9f42f69` doxygen `0fc4d52` doxygen `37b4b55` fix typo `bded8f1` some more doxy `020eb00` slight tweak `524666d` doxygenify `2a36a3e` doxygenify `229d218` doxygenify `7fd28ae` doxygenify `a8c64bc` doxygenify `f5d21b6` fix `5d8ede8` doxygenify `a58a4e0` language_model.cpp `fa85709` lm_pain_points.cpp lm_state.cpp `6418da3` merge `06190ba` Merge branch 'old_doxygen_merge' into more-doxygen `84acf08` Merge branch 'master' into more-doxygen `50fe1ff` pagewalk.cpp cube_reco_context.cpp `2982583` change to relative `192a24a` applybox.cpp, take one `8eeb053` delete docs for obsolete params `52e4c77` modernise classify/ocrfeatures.cpp `2a1cba6` modernise cutil/emalloc.cpp `773e006` silence doxygen warning `aeb1731` silence doxygen warning `f18387f` silence doxygen; new params are unused? `15ad6bd` doxygenify cutil/efio.cpp `c8b5dad` doxygenify cutil/danerror.cpp `784450f` the globals and exceptions parts are obsolete; remove `8bca324` doxygen classify/normfeat.cpp `9bcbe16` doxygen classify/normmatch.cpp `aa9a971` doxygen ccmain/cube_control.cpp `c083ff2` doxygen ccmain/cube_reco_context.cpp `f842850` params changed `5c94f12` doxygen ccmain/cubeclassifier.cpp `15ba750` case sensitive `f5c71d4` case sensitive `f85655b` doxygen classify/intproto.cpp `4bbc7aa` partial doxygen classify/mfx.cpp `dbb6041` partial doxygen classify/intproto.cpp `2aa72db` finish doxygen classify/intproto.cpp `0b8de99` doxygen training/mftraining.cpp `0b5b35c` partial doxygen ccstruct/coutln.cpp `b81c766` partial doxygen ccstruct/coutln.cpp `40fc415` finished? doxygen ccstruct/coutln.cpp `6e4165c` doxygen classify/clusttool.cpp `0267dec` doxygen classify/cutoffs.cpp `7f0c70c` doxygen classify/fpoint.cpp `512f3bd` ignore ~ files `5668a52` doxygen classify/intmatcher.cpp `84788d4` doxygen classify/kdtree.cpp `29f36ca` doxygen classify/mfoutline.cpp `40b94b1` silence doxygen warnings `6c511b9` doxygen classify/mfx.cpp `f9b4080` doxygen classify/outfeat.cpp `aa1df05` doxygen classify/picofeat.cpp `cc5f466` doxygen training/cntraining.cpp `cce044f` doxygen training/commontraining.cpp `167e216` missing param `9498383` renamed params `37eeac2` renamed param `d87b5dd` case `c8ee174` renamed params `b858db8` typo `4c2a838` h2 context? `81a2c0c` fix some param names; add some missing params, no docs `bcf8a4c` add some missing params, no docs `af77f86` add some missing params, no docs; fix some param names `01df24e` fix some params `6161056` fix some params `68508b6` fix some params `285aeb6` doxygen complains here no matter what `529bcfa` rm some missing params, typos `cd21226` rm some missing params, add some new ones `48a4bc2` fix params `c844628` missing param `312ce37` missing param; rename one `ec2fdec` missing param `05e15e0` missing params `d515858` change "<" to < to make doxygen happy `b476a28` wrong place	2015-07-20 18:48:00 +01:00
Ray Smith	84920b92b3	Font and classifier output structure cleanup. Font recognition was poor, due to forcing a 1st and 2nd choice at a character level, when the total score for the correct font is often correct at the word level, so allowed the propagation of a full set of fonts and scores to the word recognizer, which can now decide word level fonts using the scores instead of simple votes. Change precipitated a cleanup of output data structures for classifier results, eliminating ScoredClass and INT_RESULT_STRUCT, with a few extra elements going in UnicharRating, and using that wherever possible. That added the extra complexity of 1-rating due to a flip between 0 is good and 0 is bad for the internal classifier scores before they are converted to rating and certainty.	2015-05-12 17:24:34 -07:00
Ray Smith	2f197cd653	Fixed issues 899/1220/1246 (mixed eng+ara)	2014-09-17 18:27:49 -07:00
theraysmith@gmail.com	7ec4fd7a56	Refactorerd control functions to enable parallel blob classification git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@904 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2013-11-08 20:30:56 +00:00
theraysmith@gmail.com	4d514d5a60	Major refactor of beam search, elimination of dead code, misc bug fixes, updates to Makefile.am, Changelog etc. git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@878 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2013-09-23 15:26:50 +00:00
david.eger@gmail.com	018f192fc2	Abolish populate_unichars(), fixing seg fault reported in Debian: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=658634 git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@675 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2012-02-15 01:37:00 +00:00
theraysmith@gmail.com	3a998fe7ac	Added Right-to-left/Bidi capability in the output iterators for Hebrew/Arabic, Added paragraph detection in layout analysis/post OCR, Fixed inconsistent xheight during training and over-chopping, Added simultaneous multi-language capability, Refactored top-level word recognition module, Fixed problems with internally scaled images git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@651 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2012-02-02 02:59:49 +00:00
zdenop@gmail.com	da41b96f7f	removed check for libtiff - leptonica is required; cleanup #ifdef/#ifndef HAVE_LIBLEPT git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@624 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2011-08-30 06:34:41 +00:00
theraysmith	3e8c0bc228	Various fixes, including memory leak in fixspace, font labels on output, removed some annoying debug output, fixes to initialization of parameters, general cleanup, and added Hindi git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@567 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2011-03-21 21:44:05 +00:00
zdenop@gmail.com	4523ce9f7d	3.01 code from http://github.com/jimregan/tesseract-ocr with addaptions related to Linux and Windows (VC2008) compile process git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@526 d0cd1f9f-072b-0410-8dd7-cf729c803f20	2010-11-23 18:34:14 +00:00

10 Commits