./textproc/py-webencodings, Character encoding for the web

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 0.5.1nb2, Package name: py311-webencodings-0.5.1nb2, Maintainer: kleink

This is a Python implementation of the WHATWG Encoding standard.

In order to be compatible with legacy web content when interpreting
something like Content-Type: text/html; charset=latin1, tools need
to use a particular set of aliases for encoding labels as well as
some overriding rules. For example, US-ASCII and iso-8859-1 on the
web are actually aliases for windows-1252, and an UTF-8 or UTF-16
BOM takes precedence over any other encoding declaration. The
Encoding standard defines all such details so that implementations
do not have to reverse-engineer each other.

This module has encoding labels and BOM detection, but the actual
implementation for encoders and decoders is Python's.


Required to run:
[lang/python310]

Master sites:

Filesize: 9.493 KB

Version history: (Expand)


CVS history: (Expand)


   2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)
Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS
   2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
   2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162)
Log message:
textproc: Remove SHA1 hashes for distfiles
   2017-07-26 12:58:03 by Adam Ciarcinski | Files touched by this commit (2)
Log message:
0.5.1:
Bug fixes.
   2016-10-26 09:50:24 by Klaus Klein | Files touched by this commit (4)
Log message:
Import webencodings 0.5 as textproc/py-webencodings.

This is a Python implementation of the WHATWG Encoding standard.

In order to be compatible with legacy web content when interpreting
something like Content-Type: text/html; charset=latin1, tools need
to use a particular set of aliases for encoding labels as well as
some overriding rules. For example, US-ASCII and iso-8859-1 on the
web are actually aliases for windows-1252, and an UTF-8 or UTF-16
BOM takes precedence over any other encoding declaration. The
Encoding standard defines all such details so that implementations
do not have to reverse-engineer each other.

This module has encoding labels and BOM detection, but the actual
implementation for encoders and decoders is Python's.