./www/py-w3lib, Python library of web-related functions

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.21.0, Package name: py37-w3lib-1.21.0, Maintainer: pkgsrc-users

This is a Python library of web-related functions, such as:
* remove comments, or tags from HTML snippets
* extract base url from HTML snippets
* translate entites on HTML strings
* convert raw HTTP headers to dicts and vice-versa
* construct HTTP auth header
* converting HTML pages to unicode
* sanitize urls (like browsers do)
* extract arguments from urls


Required to run:
[devel/py-setuptools] [lang/py-six] [lang/python37]

Required to build:
[pkgtools/cwrappers]

Master sites:

SHA1: a6ea609a660722ebac52012a09631c8889308481
RMD160: e9807d80e3d07caf226257d96f017b6325c1fd6b
Filesize: 37.259 KB

Version history: (Expand)


CVS history: (Expand)


   2019-08-12 22:03:01 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-w3lib: updated to 1.21.0

1.21.0:
- Add the encoding and path_encoding parameters to
  :func:w3lib.url.safe_download_url
- :func:w3lib.url.safe_url_string now also removes tabs and new lines
- :func:w3lib.html.remove_comments now also removes truncated comments
- :func:w3lib.html.remove_tags_with_content no longer removes tags which
  start with the same text as one of the specified tags
- Recommend pytest instead of nose to run tests
   2019-01-16 00:05:37 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-w3lib: updated to 1.20.0

1.20.0:
- Fix url_query_cleaner to do not append "?" to urls without a query string
- Add support for Python 3.7 and drop Python 3.3
- Add w3lib.url.add_or_replace_parameters helper
- Documentation fixes
   2018-01-26 09:06:07 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-w3lib: updated to 1.19.0

1.19.0:
- Add a workaround for CPython segfault (https://bugs.python.org/issue32583)
  which affect w3lib.encoding functions. This is technically **backwards
  incompatible** because it changes the way non-decodable bytes are replaced
  (in some cases instead of two ``\ufffd`` chars you can get one).
  As a side effect, the fix speeds up decoding in Python 3.4+.
- Add 'encoding' parameter for w3lib.http.basic_auth_header.
- Fix pypy testing setup, add pypy3 to CI.
   2017-09-30 15:27:14 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-w3lib: update to 1.18.0

1.18.0:
- Include additional assets used for distribution packages in the source tarball
- Consider ``[`` and ``]`` as safe characters in path and query components
  of URLs, i.e. they are not escaped anymore
- Disable codecov project coverage check
   2017-07-31 00:32:28 by Thomas Klausner | Files touched by this commit (229)
Log message:
Switch github HOMEPAGEs to https.
   2017-02-13 22:19:51 by Adam Ciarcinski | Files touched by this commit (4)
Log message:
This is a Python library of web-related functions, such as:
* remove comments, or tags from HTML snippets
* extract base url from HTML snippets
* translate entites on HTML strings
* convert raw HTTP headers to dicts and vice-versa
* construct HTTP auth header
* converting HTML pages to unicode
* sanitize urls (like browsers do)
* extract arguments from urls