./www/py-parsel, Library to extract data from HTML and XML using XPath and CSS

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.5.1, Package name: py27-parsel-1.5.1, Maintainer: pkgsrc-users

Parsel is a library to extract data from HTML and XML using XPath and CSS
selectors.

Features:
* Extract text using CSS or XPath selectors
* Regular expression helper methods


Required to run:
[devel/py-setuptools] [textproc/py-lxml] [lang/python27] [textproc/py-cssselect] [lang/py-six] [www/py-w3lib]

Required to build:
[pkgtools/cwrappers] [devel/py-test-runner]

Master sites:

SHA1: 1eeb15923381ddf8e8e3e6ccbc4c33a0ac4d080e
RMD160: ba21b4d5c1127dd0bd04da97e512e4862d5bc5be
Filesize: 39.661 KB

Version history: (Expand)


CVS history: (Expand)


   2018-11-15 10:53:33 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-parsel: updated to 1.5.1

1.5.1:
* has-class XPath function handles newlines and other separators
  in class names properly;
* fixed parsing of HTML documents with null bytes;
* documentation improvements;
* Python 3.7 tests are run on CI; other test improvements.
   2018-07-09 08:17:14 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-parsel: updated to 1.5.0

1.5.0:
* New Selector.attrib and SelectorList.attrib properties which make
  it easier to get attributes of HTML elements.
* CSS selectors became faster: compilation results are cached
  (LRU cache is used for css2xpath), so there is
  less overhead when the same CSS expression is used several times.
* .get() and .getall() selector methods are documented and recommended
  over .extract_first() and .extract().
* Various documentation tweaks and improvements.
   2018-02-26 09:32:32 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-parsel: updated to 1.4.0

1.4.0:
* Selector and SelectorList can't be pickled because
  pickling/unpickling doesn't work for lxml.html.HtmlElement;
  parsel now raises TypeError explicitly instead of allowing pickle to
  silently produce wrong output. This is technically backwards-incompatible
  if you're using Python < 3.6.
   2017-07-31 00:32:28 by Thomas Klausner | Files touched by this commit (229)
Log message:
Switch github HOMEPAGEs to https.
   2017-05-20 07:40:35 by Adam Ciarcinski | Files touched by this commit (2)
Log message:
Changes 1.2.0:
* Add :meth:`~parsel.selector.SelectorList.get` and \ 
:meth:`~parsel.selector.SelectorList.getall`
  methods as aliases for :meth:`~parsel.selector.SelectorList.extract_first`
  and :meth:`~parsel.selector.SelectorList.extract` respectively
* Add default value parameter to :meth:`~parsel.selector.SelectorList.re_first` \ 
method
* Add :meth:`~parsel.selector.Selector.re_first` method to \ 
:class:`parsel.selector.Selector` class
* Bug fix: detect ``None`` result from lxml parsing and fallback with an empty \ 
document
* Rearrange XML/HTML examples in the selectors usage docs
   2017-02-16 00:22:33 by Joerg Sonnenberger | Files touched by this commit (1)
Log message:
Needs py-test-runner.
   2017-02-13 22:21:09 by Adam Ciarcinski | Files touched by this commit (4)
Log message:
Added www/py-parsel version 1.1.0

Parsel is a library to extract data from HTML and XML using XPath and CSS
selectors.

Features:
* Extract text using CSS or XPath selectors
* Regular expression helper methods