Path to this page:
./
textproc/py-html5lib,
HTML5 parser and tokenizer
Branch: pkgsrc-2010Q1,
Version: 0.11.1,
Package name: py26-html5lib-0.11.1,
Maintainer: joerghtml5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.
Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.
Required to run:[
lang/python26]
Required to build:[
archivers/unzip]
Master sites:
SHA1: 157506319e40f5d973c128e5e2b826cd1bee471e
RMD160: ac00975e5ea8b20606531e631274c1a8985110c9
Filesize: 367.082 KB
Version history: (Expand)
- (2010-04-09) Package added to pkgsrc.se, version py26-html5lib-0.11.1 (created)