Path to this page:
./
graphics/tesseract,
Tesseract Open Source OCR Engine
Branch: pkgsrc-2017Q4,
Version: 3.05.01nb2,
Package name: tesseract-3.05.01nb2,
Maintainer: pkgsrc-usersThis code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO
OUTPUT FORMATTING, and NO UI. It can only process an image of a
single column and create text from it. It can detect fixed pitch
vs proportional text. Having said that, in 1995, this engine was
in the top 3 in terms of character accuracy, and it compiles and
runs on both Linux and Windows. Another current limitation is that
it only recognizes English and its character set is only US-ASCII.
Training code IS included in the open source release however, and
will be included in a future release.
Required to run:[
graphics/leptonica] [
graphics/cairo] [
devel/pango] [
textproc/icu]
Required to build:[
x11/xproto] [
x11/xextproto] [
x11/damageproto] [
x11/dri2proto] [
x11/glproto] [
x11/renderproto] [
x11/xf86vidmodeproto] [
x11/fixesproto4] [
x11/inputproto] [
x11/xf86driproto] [
x11/xcb-proto] [
pkgtools/cwrappers] [
pkgtools/x11-links]
Master sites:
SHA1: a9a70bf84a597cb3c228d73c70a590e7b032b6ce
RMD160: 11fae540fdd0ec4f6f9388fae4bbde790b17ee4d
Filesize: 3491.025 KB
Version history: (Expand)
- (2018-01-02) Package added to pkgsrc.se, version tesseract-3.05.01nb2 (created)