Shreeshrii
fdc243b363
Change default width for images output by text2image
...
Fixes
Image too large to learn!! Size = 2594x48
Image not trainable
See https://github.com/tesseract-ocr/tesseract/issues/590#issuecomment-271244655
for related discussion
2018-06-11 09:34:07 +05:30
zdenop
680b172a0c
Merge pull request #1659 from stweil/api
...
Remove some header files from public API
2018-06-10 16:27:47 +02:00
Stefan Weil
fcdcba70f4
Remove some header files from public API
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-10 16:19:58 +02:00
Egor Pugin
e5d11b5297
Merge pull request #1658 from Shreeshrii/master
...
correct script for eng, remove new reports from distribution
2018-06-10 14:15:51 +03:00
Shree Devi Kumar
053698bb8b
update .gitignore
2018-06-10 04:16:23 +00:00
Shree Devi Kumar
4753d8cbb6
correct script for eng, remove new reports from distribution
2018-06-10 02:49:50 +00:00
Egor Pugin
a4241c9817
Merge pull request #1655 from stweil/scanedg
...
block_edges: Add assertions for block coordinates
2018-06-09 18:32:05 +03:00
Egor Pugin
7b62e5c7e0
Merge pull request #1656 from stweil/unittest
...
Fix symbolic link in unittest
2018-06-09 18:31:45 +03:00
Egor Pugin
37dadbe478
Merge pull request #1657 from Shreeshrii/master
...
update Spanish UNLV test, use spa.stopwords, iconv to UTF-8
2018-06-09 18:31:35 +03:00
Shreeshrii
d8bed41ec3
change filename to generic ~/
2018-06-09 20:17:51 +05:30
Shree Devi Kumar
a01d1604c3
update readme
2018-06-09 14:44:54 +00:00
Shree Devi Kumar
4290951fc1
add summary for Spanish UNLV test with 4.0.0-beta with --tessdata_best and --tessdata
2018-06-09 14:36:10 +00:00
Shree Devi Kumar
86700fd345
add summary for Spanish UNLV test with 4.0.0-beta with --tessdata_fast
2018-06-09 13:07:21 +00:00
Shree Devi Kumar
6559af0c9d
update Spanish UNLV test, use spa.stopwords, iconv to UTF-8
2018-06-09 12:47:09 +00:00
Stefan Weil
165622c745
Fix symbolic link in unittest
...
The unittest could not run when building out of source tree.
Fix the symbolic link and make sure that the directory for it exists.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-09 14:15:13 +02:00
Stefan Weil
5812972775
block_edges: Add assertions for block coordinates
...
Check whether the top right point of the block is inside of the
thresholded image t_pix. Otherwise the following code would make
illegal memory accesses.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-09 14:06:33 +02:00
Shree Devi Kumar
481d7775c6
Merge remote-tracking branch 'origin/master'
2018-06-09 08:15:31 +00:00
Egor Pugin
cd58a861d9
Merge pull request #1653 from stweil/typo
...
scanutils: Fix typos in comments
2018-06-09 11:00:22 +03:00
Egor Pugin
000c9864b7
Merge pull request #1654 from stweil/bool
...
capi: Fix regression caused by use of bool data type
2018-06-09 11:00:11 +03:00
Egor Pugin
3f725dd92e
Merge pull request #1650 from Shreeshrii/master
...
Add Spanish UNLV tests, use ocreval tools from /usr/local/bin
2018-06-09 10:59:50 +03:00
Stefan Weil
a709018e94
capi: Fix regression caused by use of bool data type
...
Commit 87d33b6c9e
added code which uses bool.
Therefore stdbool.h must be included for compilations with a C compiler.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-09 08:45:45 +02:00
Stefan Weil
02277bed34
scanutils: Fix typos in comments
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-09 07:53:20 +02:00
Shree Devi Kumar
e614066c59
Merge remote-tracking branch 'origin/master'
2018-06-08 17:16:09 +00:00
zdenop
e7c1e0739c
Merge pull request #1649 from stweil/locale
...
Test for correct locale settings
2018-06-08 19:02:38 +02:00
zdenop
89730e12a7
Merge pull request #1648 from stweil/fix
...
scanutils: Fix illegal memory access
2018-06-08 19:01:45 +02:00
Shreeshrii
477f57adf2
correct URL for 1995 report
2018-06-08 21:52:25 +05:30
Shreeshrii
7acebd0f52
reformat with markdown
2018-06-08 21:38:16 +05:30
Stefan Weil
3292484f67
Test for correct locale settings
...
Normal C++ programs like those which are built for tesseract automatically
set the locale "C".
There can be different locale settings if the tesseract library is used
in other software.
A wrong locale can cause wrong results from sscanf which is used at
different places in the tesseract code, so make sure that we have the
right locale settings and fail if that is not the case.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-08 17:40:10 +02:00
Shree Devi Kumar
ea7f4801ed
add option for UNLV tests for spa
2018-06-08 14:28:50 +00:00
Stefan Weil
280db06bbf
scanutils: Fix illegal memory access
...
Format strings which contain "%*s" show this error in Valgrind:
==32503== Conditional jump or move depends on uninitialised value(s)
==32503== at 0x2B8BB0: tvfscanf(_IO_FILE*, char const*, __va_list_tag*) (scanutils.cpp:486)
==32503== by 0x2B825A: tfscanf(_IO_FILE*, char const*, ...) (scanutils.cpp:234)
==32503== by 0x272B01: read_unlv_file(STRING, int, int, BLOCK_LIST*) (blread.cpp:54)
==32503== by 0x1753CD: tesseract::Tesseract::SegmentPage(STRING const*, BLOCK_LIST*, tesseract::Tesseract*, OSResults*) (pagesegmain.cpp:115)
==32503== by 0x1363CD: tesseract::TessBaseAPI::FindLines() (baseapi.cpp:2291)
==32503== by 0x130CF1: tesseract::TessBaseAPI::Recognize(ETEXT_DESC*) (baseapi.cpp:802)
==32503== by 0x1322D3: tesseract::TessBaseAPI::ProcessPage(Pix*, int, char const*, char const*, int, tesseract::TessResultRenderer*) (baseapi.cpp:1176)
==32503== by 0x131A84: tesseract::TessBaseAPI::ProcessPagesMultipageTiff(unsigned char const*, unsigned long, char const*, char const*, int, tesseract::TessResultRenderer*, int) (baseapi.cpp:1013)
==32503== by 0x132052: tesseract::TessBaseAPI::ProcessPagesInternal(char const*, char const*, int, tesseract::TessResultRenderer*) (baseapi.cpp:1129)
==32503== by 0x131B1E: tesseract::TessBaseAPI::ProcessPages(char const*, char const*, int, tesseract::TessResultRenderer*) (baseapi.cpp:1032)
==32503== by 0x12E00C: main (tesseractmain.cpp:537)
==32503== Uninitialised value was created by a stack allocation
==32503== at 0x272A60: read_unlv_file(STRING, int, int, BLOCK_LIST*) (blread.cpp:41)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-08 15:28:30 +02:00
zdenop
a6623065fe
Merge pull request #1645 from Shreeshrii/unlvtests
...
reformat EXTRA_DIST in makefile
2018-06-08 12:59:33 +02:00
zdenop
417893db71
Merge pull request #1646 from Shreeshrii/master
...
remove generated files committed by error
2018-06-08 12:58:31 +02:00
zdenop
a514765865
Merge pull request #1644 from Shreeshrii/gitignore
...
add /src/ to api, training and vs2010 in gitignore
2018-06-08 12:57:02 +02:00
Shreeshrii
68c6b42853
modify to avoid line continuations
2018-06-08 15:17:05 +05:30
Shreeshrii
df09d0db28
delete lines relating to vs2010
2018-06-08 15:12:27 +05:30
Shree Devi Kumar
1b6815364a
remove generated files commited by error
2018-06-08 08:59:20 +00:00
Shree Devi Kumar
7a9fef9685
reformat EXTRA_DIST in makefile
2018-06-08 08:32:47 +00:00
zdenop
29304a4173
Merge pull request #1642 from stweil/unlvtests
...
Remove some files which are generated by the UNLV test
2018-06-08 08:42:14 +02:00
zdenop
1309749a15
Merge pull request #1643 from Shreeshrii/unlv
...
move unlvtests ignored files to .gitignore in root dir
2018-06-08 08:40:40 +02:00
Shree Devi Kumar
f2caeb43b4
add /src/ to api, training and vs2010
2018-06-08 05:34:47 +00:00
Shree Devi Kumar
828d727135
move unlvtests ignored files to .gitignore in root dir
2018-06-08 05:29:44 +00:00
Stefan Weil
fcaf192ea3
Remove some files which are generated by the UNLV test
...
They don't contain useful information.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-07 20:22:08 +02:00
zdenop
51ebf8a21e
Merge pull request #1640 from stweil/unlvtests
...
Fix script for UNLV tests
2018-06-07 07:20:54 +02:00
Stefan Weil
bbb4658733
Fix log message of UNLV tests
...
We must filter unwanted output from tesseract.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-07 07:18:55 +02:00
Stefan Weil
ff3b263c5b
Fix script for UNLV tests
...
Commit 934e612a3e
added too many quotes.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-07 07:12:08 +02:00
zdenop
d47cebcdc8
Merge pull request #1641 from stweil/fix
...
training: Add missing linefeed to error message
2018-06-06 22:13:26 +02:00
Stefan Weil
0215d91f45
training: Add missing linefeed to error message
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-06-06 21:32:16 +02:00
zdenop
8b3501e54f
Merge pull request #1638 from Shreeshrii/master
...
remove testing and testdata, use from submodule test, add unlvtests
2018-06-06 14:51:59 +02:00
Shree Devi Kumar
deea045e4a
Merge branch 'master' of https://github.com/tesseract-ocr/tesseract
2018-06-06 12:26:20 +00:00
Shree Devi Kumar
2563380d51
move testing and testdata to test, add unlvtests
2018-06-06 12:20:14 +00:00