2017-02-13 22:19:51 by Adam Ciarcinski | Files touched by this commit (4) |
Log message: This is a Python library of web-related functions, such as: * remove comments, or tags from HTML snippets * extract base url from HTML snippets * translate entites on HTML strings * convert raw HTTP headers to dicts and vice-versa * construct HTTP auth header * converting HTML pages to unicode * sanitize urls (like browsers do) * extract arguments from urls |