./textproc/py-html5lib, HTML5 parser and tokenizer

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2012Q3, Version: 0.90, Package name: py27-html5lib-0.90, Maintainer: joerg

html5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.

Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.


Required to run:
[devel/py-setuptools]

Master sites:

SHA1: 37fdf4d853f53ebd170250f7f023f55a02659378
RMD160: ba01161f3b0d6a5dfb9e1ffedaf9c18a6b7d2a19
Filesize: 96.994 KB

Version history: (Expand)