Path to this page:
./
textproc/py-html5lib,
HTML5 parser and tokenizer
Branch: pkgsrc-2009Q2,
Version: 0.11,
Package name: py25-html5lib-0.11,
Maintainer: joerghtml5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.
Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.
Master sites:
SHA1: cfacf8feed09bd0d53bc713965d70c8e9a416e92
RMD160: fd6e377fa4af43d008147ee3fa50a18533eaa19d
Filesize: 183.11 KB
Version history: (Expand)
- (2009-07-09) Package has been reborn
- (2009-07-08) Package added to pkgsrc.se, version py25-html5lib-0.11 (created)