./graphics/claraocr, Optical Character Recognition (OCR) program for books

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]

Branch: pkgsrc-2014Q2, Version: 0.9.9nb4, Package name: claraocr-0.9.9nb4, Maintainer: pkgsrc-users

Clara OCR is a free (GPL) Optical Character Recognition (OCR) program
for systems that support the C library and the X windows system (e.g.
most flavours of Unix). The development platform of Clara OCR is
32-bit Intel running GNU/Linux.

Clara OCR is intended for large scale digitalization projects. It
features a powerful GUI and a web interface for cooperative
digitalization of books. Clara OCR development started in 1999 and
is approaching production quality.


Converts pbm/pgm image files to text (ISO-8859)
Can process scans in batch for large documents
Can run from the command-line
Is relatively easy to train


Is not "omnifont"; you must train it for each document
Does not scan the images
Does not support unicode
Cannot read handwriting

Required to run:
[graphics/netpbm] [lang/perl5]

Required to build:

Master sites: (Expand)

SHA1: 7d18088ad086d476cce2821497db16f4f444231e
RMD160: 6831d71a5ae383fd025d57050edbabe4a5359666
Filesize: 409.696 KB

Version history: (Expand)