Path to this page:
./
textproc/py-html5lib,
HTML5 parser and tokenizer
Branch: pkgsrc-2017Q3,
Version: 0.999999999nb1,
Package name: py27-html5lib-0.999999999nb1,
Maintainer: joerghtml5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.
Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.
Required to run:[
lang/python27] [
lang/py-six] [
textproc/py-webencodings]
Master sites:
SHA1: 3a38a57f6e255a59bc8f41cc8163d502a09cc7ee
RMD160: bc1528bd7dfa0813f5a48f9a005ff52687b8994f
Filesize: 139.994 KB
Version history: (Expand)
- (2017-09-29) Package added to pkgsrc.se, version py27-html5lib-0.999999999nb1 (created)