Path to this page:
./
textproc/py-html5lib,
HTML5 parser and tokenizer
Branch: CURRENT,
Version: 1.1nb2,
Package name: py312-html5lib-1.1nb2,
Maintainer: pkgsrc-usershtml5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.
Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.
Required to run:[
www/py-genshi] [
textproc/py-lxml] [
lang/py-six] [
textproc/py-webencodings] [
lang/python310]
Master sites:
Filesize: 265.835 KB
Version history: (Expand)
- (2024-11-11) Updated to version: py312-html5lib-1.1nb2
- (2024-04-30) Updated to version: py311-html5lib-1.1nb2
- (2024-01-14) Updated to version: py311-html5lib-1.1nb1
- (2022-11-09) Updated to version: py310-html5lib-1.1nb1
- (2022-01-05) Updated to version: py39-html5lib-1.1nb1
- (2021-11-09) Updated to version: py39-html5lib-1.1
CVS history: (Expand)
2024-01-14 10:45:00 by Thomas Klausner | Files touched by this commit (9) | |
Log message:
*: update for py27-lxml
|
2022-11-09 14:14:32 by Joerg Sonnenberger | Files touched by this commit (223) |
Log message:
Reset MAINTAINER
|
2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595) |
Log message:
*: bump PKGREVISION for egg.mk users
They now have a tool dependency on py-setuptools instead of a DEPENDS
|
2021-11-09 21:10:28 by Thomas Klausner | Files touched by this commit (3) | |
Log message:
py-html5lib: update to 1.1.
Add some missing dependencies and test dependencies.
1.1
~~~
Breaking changes:
* Drop support for Python 3.3. (#358)
* Drop support for Python 3.4. (#421)
Deprecations:
* Deprecate the ``html5lib`` sanitizer (``html5lib.serialize(sanitize=True)`` and
``html5lib.filters.sanitizer``). We recommend users migrate to `Bleach
<https://github.com/mozilla/bleach>`. Please let us know if Bleach \
doesn't suffice for your
use. (#443)
Other changes:
* Try to import from ``collections.abc`` to remove DeprecationWarning and ensure
``html5lib`` keeps working in future Python versions. (#403)
* Drop optional ``datrie`` dependency. (#442)
|
2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161) |
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums
All checksums have been double-checked against existing RMD160 and
SHA512 hashes
Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
|
2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162) |
Log message:
textproc: Remove SHA1 hashes for distfiles
|
2018-02-26 09:24:42 by Adam Ciarcinski | Files touched by this commit (3) | |
Log message:
py-html5lib: updated to 1.0.1
1.0.1
Breaking changes:
* Drop support for Python 2.6.
* Remove utils/spider.py
Features:
* Improve documentation.
* Add iframe seamless boolean attribute.
* Add itemscope as a boolean attribute.
* Support Python 3.6.
* Add CI support for Windows using AppVeyor.
* Improve testing and CI and add code coverage
* Semver-compliant version number.
Bug fixes:
* Add support for setuptools < 18.5 to support environment markers.
* Add explicit dependency for six >= 1.9.
* Fix regexes to work with Python 3.7 regex adjustments.
* Fix alphabeticalattributes filter namespace bug.
* Include license file in generated wheel package.
* Fix annotation-xml typo.
* Allow uppercase hex chararcters in CSS colour check.
|
2017-01-15 00:04:16 by Klaus Klein | Files touched by this commit (1) |
Log message:
Add dependency on py-webencodings (added the package in preparation,
but still managed not to add the dependency here).
Bump PKGREVISION.
|