./www/py-parsel, Library to extract data from HTML and XML using XPath and CSS

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.5.2, Package name: py37-parsel-1.5.2, Maintainer: pkgsrc-users

Parsel is a library to extract data from HTML and XML using XPath and CSS
selectors.

Features:
* Extract text using CSS or XPath selectors
* Regular expression helper methods


Required to run:
[devel/py-setuptools] [textproc/py-lxml] [textproc/py-cssselect] [lang/py-six] [www/py-w3lib] [lang/python37]

Required to build:
[pkgtools/cwrappers] [devel/py-test-runner]

Master sites:

SHA1: 415b615392502ec5b506ef839f71e7cc10df69f6
RMD160: 5f3ca0605494490c30670b49d9fe0ab0230f2b61
Filesize: 40.17 KB

Version history: (Expand)


CVS history: (Expand)


   2019-08-12 22:04:22 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-parsel: updated to 1.5.2

1.5.2:
* Selector.remove_namespaces received a significant performance improvement
* The value of data within the printable representation of a selector
  (repr(selector)) now ends in ... when truncated, to make the
  truncation obvious.
* Minor documentation improvements.
   2018-11-15 10:53:33 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-parsel: updated to 1.5.1

1.5.1:
* has-class XPath function handles newlines and other separators
  in class names properly;
* fixed parsing of HTML documents with null bytes;
* documentation improvements;
* Python 3.7 tests are run on CI; other test improvements.
   2018-07-09 08:17:14 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-parsel: updated to 1.5.0

1.5.0:
* New Selector.attrib and SelectorList.attrib properties which make
  it easier to get attributes of HTML elements.
* CSS selectors became faster: compilation results are cached
  (LRU cache is used for css2xpath), so there is
  less overhead when the same CSS expression is used several times.
* .get() and .getall() selector methods are documented and recommended
  over .extract_first() and .extract().
* Various documentation tweaks and improvements.
   2018-02-26 09:32:32 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-parsel: updated to 1.4.0

1.4.0:
* Selector and SelectorList can't be pickled because
  pickling/unpickling doesn't work for lxml.html.HtmlElement;
  parsel now raises TypeError explicitly instead of allowing pickle to
  silently produce wrong output. This is technically backwards-incompatible
  if you're using Python < 3.6.
   2017-07-31 00:32:28 by Thomas Klausner | Files touched by this commit (229)
Log message:
Switch github HOMEPAGEs to https.
   2017-05-20 07:40:35 by Adam Ciarcinski | Files touched by this commit (2)
Log message:
Changes 1.2.0:
* Add :meth:`~parsel.selector.SelectorList.get` and \ 
:meth:`~parsel.selector.SelectorList.getall`
  methods as aliases for :meth:`~parsel.selector.SelectorList.extract_first`
  and :meth:`~parsel.selector.SelectorList.extract` respectively
* Add default value parameter to :meth:`~parsel.selector.SelectorList.re_first` \ 
method
* Add :meth:`~parsel.selector.Selector.re_first` method to \ 
:class:`parsel.selector.Selector` class
* Bug fix: detect ``None`` result from lxml parsing and fallback with an empty \ 
document
* Rearrange XML/HTML examples in the selectors usage docs
   2017-02-16 00:22:33 by Joerg Sonnenberger | Files touched by this commit (1)
Log message:
Needs py-test-runner.
   2017-02-13 22:21:09 by Adam Ciarcinski | Files touched by this commit (4)
Log message:
Added www/py-parsel version 1.1.0

Parsel is a library to extract data from HTML and XML using XPath and CSS
selectors.

Features:
* Extract text using CSS or XPath selectors
* Regular expression helper methods