Path to this page:
./
graphics/tesseract,
Open Source OCR Engine
Branch: CURRENT,
Version: 5.5.0,
Package name: tesseract-5.5.0,
Maintainer: pkgsrc-usersTesseract provides an OCR engine and a command line program. It
includes a new neural net (LSTM) based OCR engine which is focused on
line recognition, but also still provides a legacy OCR engine which
works by recognizing character patterns. Tesseract has Unicode (UTF-8)
support, and can recognize more than 100 languages "out of the box".
Tesseract can be trained to recognize other languages. It supports
various output formats: plain text, hOCR (HTML), PDF,
invisible-text-only PDF, and TSV.
Required to run:[
textproc/icu] [
graphics/cairo] [
devel/pango] [
graphics/leptonica]
Required to build:[
textproc/asciidoc] [
pkgtools/x11-links] [
x11/xcb-proto] [
pkgtools/cwrappers] [
x11/xorgproto]
Master sites:
Filesize: 1860.369 KB
Version history: (Expand)
- (2024-11-18) Updated to version: tesseract-5.5.0
- (2024-11-15) Updated to version: tesseract-5.4.1nb3
- (2024-11-01) Updated to version: tesseract-5.4.1nb2
- (2024-11-01) Updated to version: tesseract-5.4.1nb1
- (2024-06-11) Updated to version: tesseract-5.4.1
- (2024-06-11) Updated to version: tesseract-5.4.0
CVS history: (Expand)
2024-11-18 11:32:00 by Adam Ciarcinski | Files touched by this commit (2) | |
Log message:
tesseract: updated to 5.5.0
5.5.0
Fix TARGET_PDB_FILE error for static linking.
Make regular usage of CMAKE_INSTALL_LIBDIR and GNUInstallDirs
Ignore illegal TESSDATA_PREFIX (not existing filesystem entry)
Fix confidence output for the PAGE XML renderer
Set hOCR capabilities ocrp_dir and ocrp_lang unconditionally
Reduce clock syscalls
Calculate row bounding box in single-word mode
Replace access/_access by std::filesystem::exists
Modernize code for list of available models
Fix performance and other issues reported by Codacy
Remove unnecessary assignment and assertions
Update code for tprintf
Add C++ stream for log messages and use it in two debug messages
cmake: Correctly set the soversion based on SemVer properties
Replace deprecated runner macos-12 by macos-latest in GitHub actions
Modernize code for renderers and remove filename conversion for Windows
Fix some typos and grammer issues
Add GitHub action and Makefile target for Windows installer
Support symbolic values for --oem and --psm options
Replace some tprintf by tesserr stream (fixes Windows compiler warnings)
Add RISC-V V support
Fix and improve Windows installer
Remove Tensorflow support
Update submodule googletest to release v1.15.2
|
2024-11-14 23:22:33 by Thomas Klausner | Files touched by this commit (2429) |
Log message:
*: recursive bump for icu 76 shlib major version bump
|
2024-11-01 13:55:19 by Thomas Klausner | Files touched by this commit (2426) |
Log message:
*: revbump for icu downgrade
|
2024-11-01 01:54:33 by Thomas Klausner | Files touched by this commit (2427) |
Log message:
*: recursive bump for icu 76.1 shlib bump
|
2024-08-25 08:19:21 by Thomas Klausner | Files touched by this commit (575) |
Log message:
*: replace CMAKE_ARGS with CMAKE_CONFIGURE_ARGS
|
2024-06-11 21:35:53 by Adam Ciarcinski | Files touched by this commit (2) | |
Log message:
tesseract: updated to 5.4.1
5.4.1
This release fixes a regression with legacy or mixed models.
Avoid FP overflow in NormEvidenceOf
Update deprecated Node.js 16 GitHub actions
Fix code style issues which were reported by Codacy
Fix some issues which were reported by Codacy
Fix more Codacy issues
Several build fixes
|
2024-06-11 12:04:14 by Adam Ciarcinski | Files touched by this commit (3) | |
Log message:
tesseract: updated to 5.4.0
5.4.0
Update appveyor.yml - Url has changed
Fix grey result of indexed PNG in pdfrenderer.
Fix some typos
normstrngs: add more hyphens and quotes
Rename frk -> deu_latf (ISO 639-3, ISO 15924)
Fix some performance issues which were reported by Coverity Scan
Remove broken Dockerfile
PAGE XML renderer / export
Remove unsupported OpenCL code and related API functions
facilitate vectorization for generic build
Support training without lstmf files
Simplify GridSearch<...> variables using typedef equivalents
Use std::min and std::max for min & max operations in \
makerow.cpp::most_overlapping_row()
Fix a few typos in comments
Remove an unused variable in paragraphs.cpp::DetectParagraphs()
A few refactors in some files
Fix output and issues reported by Coverity Scan for PAGE XML renderer
Update documentation
Fix some issues which were reported by GitHub code scanning
Improve CCUtil::main_setup
Allow for text angle/gradient to be retrieved
Fix setup of datadir on installations with Conda
Fix FP exception in Wordrec::angle_change
Use AM_CPPFLAGS also for compilation of all sources
Fix some compiler warnings
Remove unused xmlns:xlink from ALTO renderer
Fix some compiler warnings
Fixes: remove unnecessary nullptr checks
Avoid redundant conversion from std::string to char * to std::string
Replace strcpy and strncpy by new inline helper function
Make function Network::spec pure virtual
|
2024-05-29 18:35:19 by Adam Ciarcinski | Files touched by this commit (1929) | |
Log message:
revbump after icu and protobuf updates
|