Path to this page:
./
graphics/claraocr,
Optical Character Recognition (OCR) program for books
Branch: pkgsrc-2021Q4,
Version: 0.9.9nb10,
Package name: claraocr-0.9.9nb10,
Maintainer: pkgsrc-usersClara OCR is a free (GPL) Optical Character Recognition (OCR) program
for systems that support the C library and the X windows system (e.g.
most flavours of Unix). The development platform of Clara OCR is
32-bit Intel running GNU/Linux.
Clara OCR is intended for large scale digitalization projects. It
features a powerful GUI and a web interface for cooperative
digitalization of books. Clara OCR development started in 1999 and
is approaching production quality.
Features:
Converts pbm/pgm image files to text (ISO-8859)
Can process scans in batch for large documents
Can run from the command-line
Is relatively easy to train
Non-features:
Is not "omnifont"; you must train it for each document
Does not scan the images
Does not support unicode
Cannot read handwriting
Master sites: (Expand)
Filesize: 409.696 KB
Version history: (Expand)
- (2022-01-05) Package added to pkgsrc.se, version claraocr-0.9.9nb10 (created)