./textproc/py-html-sanitizer, White-list based HTML sanitizer

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2019Q4, Version: 1.6.4, Package name: py37-html-sanitizer-1.6.4, Maintainer: joerg

html-sanitizer is a whitelist-based and very opinionated HTML sanitizer
that can be used both for untrusted and trusted sources. It attempts to
clean up the mess made by various rich text editors and or copy-pasting
to make styling of webpages simpler and more consistent. It builds on the
excellent HTML cleaner in lxml to make the result both valid and safe.

It goes further than pure tag filtering by transforming the HTML
fragments to normalize formatting and drop redundant or pointless tags.


Master sites:

SHA1: 2635f2c2a1f64f752bc8d7f496f9612d1e3828c9
RMD160: d08e91d1d752f88e2571b617ea60d092da8ece6a
Filesize: 13.559 KB

Version history: (Expand)