./graphics/tesseract, Tesseract Open Source OCR Engine

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2016Q2, Version: 3.04.01nb1, Package name: tesseract-3.04.01nb1, Maintainer: pkgsrc-users

This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO
OUTPUT FORMATTING, and NO UI. It can only process an image of a
single column and create text from it. It can detect fixed pitch
vs proportional text. Having said that, in 1995, this engine was
in the top 3 in terms of character accuracy, and it compiles and
runs on both Linux and Windows. Another current limitation is that
it only recognizes English and its character set is only US-ASCII.
Training code IS included in the open source release however, and
will be included in a future release.


Required to run:
[devel/pango] [graphics/cairo] [graphics/leptonica]

Required to build:
[x11/dri2proto] [x11/inputproto] [x11/xf86vidmodeproto] [x11/xcb-proto] [x11/fixesproto4] [x11/xextproto] [x11/renderproto] [x11/xf86driproto] [x11/xproto] [x11/damageproto] [x11/glproto]

Master sites:

SHA1: 359ffc1925f0270ca100a2b4c1d3b41f4b23701d
RMD160: 5e754411afa74cfc4e6b601fe2c770ba93a25f23
Filesize: 2215.923 KB

Version history: (Expand)