pkgsrc.se | The NetBSD package collection

./textproc/py-html5lib, HTML5 parser and tokenizer

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]

Branch: CURRENT, Version: 1.1nb2, Package name: py312-html5lib-1.1nb2, Maintainer: pkgsrc-users

html5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.

Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.

Required to run:
[www/py-genshi] [textproc/py-lxml] [lang/py-six] [textproc/py-webencodings] [lang/python310]

Master sites:

https://files.pythonhosted.org/packages/source/h/html5lib/ (Download)

Filesize: 265.835 KB

Version history: (Expand)

(2024-11-11) Updated to version: py312-html5lib-1.1nb2
(2024-04-30) Updated to version: py311-html5lib-1.1nb2
(2024-01-14) Updated to version: py311-html5lib-1.1nb1
(2022-11-09) Updated to version: py310-html5lib-1.1nb1
(2022-01-05) Updated to version: py39-html5lib-1.1nb1
(2021-11-09) Updated to version: py39-html5lib-1.1

CVS history: (Expand)

2024-11-11 08:29:31 by Thomas Klausner | Files touched by this commit (862)

Log message:
py-*: remove unused tool dependency

py-setuptools includes the py-wheel functionality nowadays

2024-04-30 11:01:14 by Thomas Klausner | Files touched by this commit (8)

Log message:
py-html5lib: convert to wheel.mk.

Add upstream patches to get test suite to progress a bit further.

Bump PKGREVISION.

2024-01-14 10:45:00 by Thomas Klausner | Files touched by this commit (9) | Package updated

Log message:
*: update for py27-lxml

2022-11-09 14:14:32 by Joerg Sonnenberger | Files touched by this commit (223)

Log message:
Reset MAINTAINER

2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)

Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS

2021-11-09 21:10:28 by Thomas Klausner | Files touched by this commit (3) | Package updated

Log message:
py-html5lib: update to 1.1.

Add some missing dependencies and test dependencies.

1.1
~~~

Breaking changes:

* Drop support for Python 3.3. (#358)
* Drop support for Python 3.4. (#421)

Deprecations:

* Deprecate the ``html5lib`` sanitizer (``html5lib.serialize(sanitize=True)`` and
  ``html5lib.filters.sanitizer``). We recommend users migrate to `Bleach
  <https://github.com/mozilla/bleach>`. Please let us know if Bleach \ 
doesn't suffice for your
  use. (#443)

Other changes:

* Try to import from ``collections.abc`` to remove DeprecationWarning and ensure
  ``html5lib`` keeps working in future Python versions. (#403)
* Drop optional ``datrie`` dependency. (#442)

2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)

Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip

2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162)

Log message:
textproc: Remove SHA1 hashes for distfiles