Commit Graph

37 Commits

Author SHA1 Message Date
Stefan Weil
1c7e00611b Add initial support for traineddata files in standard archive formats
This requires libarchive-dev.

Tesseract can now load traineddata files in any of the archive formats
which are supported by libarchive. Example of a zipped BagIt archive:

    $ unzip -l /usr/local/share/tessdata/zip.traineddata
    Archive:  /usr/local/share/tessdata/zip.traineddata
      Length      Date    Time    Name
    ---------  ---------- -----   ----
           55  2019-03-05 15:27   bagit.txt
            0  2019-03-05 15:25   data/
         1557  2019-03-05 15:28   manifest-sha256.txt
      1082890  2019-03-05 15:25   data/eng.word-dawg
      1487588  2019-03-05 15:25   data/eng.lstm
         7477  2019-03-05 15:25   data/eng.unicharset
        63346  2019-03-05 15:25   data/eng.shapetable
       976552  2019-03-05 15:25   data/eng.inttemp
        13408  2019-03-05 15:25   data/eng.normproto
         4322  2019-03-05 15:25   data/eng.punc-dawg
         4738  2019-03-05 15:25   data/eng.lstm-number-dawg
         1410  2019-03-05 15:25   data/eng.freq-dawg
          844  2019-03-05 15:25   data/eng.pffmtable
         6360  2019-03-05 15:25   data/eng.lstm-unicharset
         1012  2019-03-05 15:25   data/eng.lstm-recoder
         1047  2019-03-05 15:25   data/eng.unicharambigs
         4322  2019-03-05 15:25   data/eng.lstm-punc-dawg
     16109842  2019-03-05 15:25   data/eng.bigram-dawg
           80  2019-03-05 15:25   data/eng.version
         6426  2019-03-05 15:25   data/eng.number-dawg
      3694794  2019-03-05 15:25   data/eng.lstm-word-dawg
    ---------                     -------
     23468070                     21 files

`combine_tessdata -d` and `combine_tessdata -u` also work.

The traineddata files in the new format can be generated with
standard tools like zip or tar.

More work is needed for other training tools and big endian support.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-03-05 17:18:48 +01:00
Stefan Weil
4bd18c4c1b Travis: Enable continuous integration build for macOS
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-02-18 10:49:51 +01:00
Egor Pugin
db3ed5d656 Run travis builds without cppan. 2019-01-16 13:16:51 +03:00
Egor Pugin
23b8036737
Update .travis.yml 2019-01-15 19:36:05 +03:00
Egor Pugin
63e1862049 Disable travis os x builds as they do not work well on travis-side. 2017-09-09 21:13:14 +03:00
Stefan Weil
a2404ae735 Fix Travis CI for Leptonica 1.74.2
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-06-05 13:30:49 +02:00
Stefan Weil
44a5e3da40 Update from Leptonica 1.74.1 to 1.74.2
The newer version contains fixes for the pixUnsharpMaskingGray*
functions which are relevant for Tesseract (used in ImageData::PreScale
which calls pixScale).

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-06-05 10:31:53 +02:00
Stephen
b4b14061ff shouldn't rely on different install 2017-05-12 14:14:08 -07:00
Stephen
14baca38b1 test brew installs but allow failures 2017-05-12 14:02:39 -07:00
Stefan Weil
f5494a7535 Update and improve configuration for Travis CI
* Use container based Ubuntu trusty. This provides a newer gcc version
  by default. CI jobs should now start faster.

* Add caching. Leptonica is now only built once for each platform.

* Get Leptonica tar instead of zip file. It's smaller,
  and there is no need to store it on disk.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-05-05 22:30:08 +02:00
Stefan Weil
325de57d54 Update Leptonica version for Travis CI
Use the latest version which is available.
Remove also some unneeded whitespace.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-04-22 17:51:39 +02:00
Egor Pugin
362b68ecfd Update .travis.yml 2017-03-25 22:06:12 +03:00
Egor Pugin
ddd901b6e4 Update .travis.yml 2017-03-25 22:04:57 +03:00
Egor Pugin
b9dd8be8c7 Update .travis.yml 2017-03-25 22:01:27 +03:00
Egor Pugin
0488a0c827 Update .travis.yml 2017-03-25 21:56:58 +03:00
Amit D
b0b4c57ef3 Travis CI - Update Leptonica to 1.74.0 2016-12-24 13:39:20 +02:00
Amit D
c1f191cd5a Travis CI - Update Leptonica to 1.74.0 2016-12-24 13:17:59 +02:00
Amit D
7f14e17b6c Travis CI - Update Leptonica to 1.74 2016-12-24 13:04:56 +02:00
Egor Pugin
f9ce278cbb Turn off macos travis build as it fails during bootstrap. 2016-10-11 17:21:52 +03:00
Egor Pugin
4b6e7af2c5 Update .travis.yml 2016-10-07 10:11:43 +03:00
Egor Pugin
ef32ec9c68 Update .travis.yml 2016-01-26 14:15:17 +03:00
Egor Pugin
9bfa7643b4 Update .travis.yml 2016-01-26 13:42:59 +03:00
Egor Pugin
dac1bd4c9e Update .travis.yml 2016-01-26 12:44:36 +03:00
Egor Pugin
fceb3abc1f Update ci scripts. 2016-01-14 14:01:55 +03:00
Stefan Weil
a940180e9a Fix Travis build error (error when getting cmake)
This command fails currently:

wget http://www.cmake.org/files/v3.3/cmake-3.3.1-Linux-x86_64.sh

Obviously cmake.org now redirects to https connections, so we have to
fix the URL. Then there still remains a certificate problem. Adding
--no-check-certificate is a workaround which should be removed later.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2015-11-07 10:13:33 +01:00
Egor Pugin
dfb5aa5c34 Add Leptonica_BUILD_DIR option for CMake find_package in Config mode. Set correct .so name on Linux. 2015-10-09 18:12:02 +03:00
Egor Pugin
5e7ffe9e56 Remove unused option. 2015-09-07 12:57:02 +03:00
Egor Pugin
b375728012 Add BUILD_TRAINING CMake option. 2015-09-06 23:59:54 +03:00
egorpugin
25225117b6 Update .travis.yml 2015-09-06 22:12:45 +03:00
egorpugin
57947cd053 Update .travis.yml 2015-09-06 22:05:54 +03:00
egorpugin
a65f2da409 Update .travis.yml 2015-09-06 21:58:49 +03:00
egorpugin
c3198df368 Update .travis.yml 2015-09-06 21:52:42 +03:00
egorpugin
e5fad6432e Update .travis.yml 2015-09-06 21:36:47 +03:00
egorpugin
df2309727d Update .travis.yml 2015-09-06 21:21:53 +03:00
egorpugin
d7059e13fd Update .travis.yml 2015-09-06 21:15:04 +03:00
egorpugin
e67372c849 Update .travis.yml 2015-09-06 20:59:04 +03:00
Egor Pugin
0f651c187b Add .travis.yml. 2015-09-06 20:54:05 +03:00