./textproc/uchardet, Encoding detector library ported from Mozilla

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 0.0.8, Package name: uchardet-0.0.8, Maintainer: pkgsrc-users

uchardet is an encoding detector library, which takes a sequence of bytes
in an unknown character encoding without any additional information, and
attempts to determine the encoding of the text. Returned encoding names are
iconv-compatible.

uchardet started as a C language binding of the original C++ implementation
of the universal charset detection library by Mozilla. It can now detect
more charsets, and more reliably than the original implementation.


Master sites:

Filesize: 217.43 KB

Version history: (Expand)


CVS history: (Expand)


   2024-07-24 11:35:59 by Jonathan Perkin | Files touched by this commit (1)
Log message:
uchardet: Unbreak previous change.
   2024-07-22 22:21:57 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
uchardet: updated to 0.0.8

Version 0.0.8 released.

New supports:

Norwegian: IBM865, ISO-8859-1, ISO-8859-15 and WINDOWS-1252.
Danish: IBM865.

Minimum CMake version bumped to 3.1 (requirement was 2.8.5 before) to
have CMake exported targets:

The executable uchardet::uchardet
The library uchardet::libuchardet
The static library uchardet::libuchardet_static

Fix build issues for UWP on Windows.
Add uchardet CLI tool building support for MSVC.
Various bug fixes and docs/README tweaks.
   2023-07-18 20:47:54 by Nia Alarie | Files touched by this commit (10)
Log message:
textproc: Adapt packages (where possible) to USE_(CC|CXX)_FEATURES
   2022-04-26 01:16:50 by Tobias Nygren | Files touched by this commit (3)
Log message:
uchardet: no -Wl,--version-script on SunOS
   2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
   2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162)
Log message:
textproc: Remove SHA1 hashes for distfiles
   2020-06-09 15:53:00 by Nia Alarie | Files touched by this commit (5)
Log message:
textproc: Add uchardet

uchardet is an encoding detector library, which takes a sequence of bytes
in an unknown character encoding without any additional information, and
attempts to determine the encoding of the text. Returned encoding names are
iconv-compatible.

uchardet started as a C language binding of the original C++ implementation
of the universal charset detection library by Mozilla. It can now detect
more charsets, and more reliably than the original implementation.