./textproc/py-itemloaders, Python library that helps to collect data from HTML and XML sources

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.3.2, Package name: py312-itemloaders-1.3.2, Maintainer: pkgsrc-users

itemloaders is a library that helps you collect data from HTML and XML sources.

It comes in handy to extract data from web pages, as it supports data
extraction using CSS and XPath Selectors.

It's specially useful when you need to standardize the data from many sources.
For example, it allows you to have all your casting and parsing rules in a
single place.


Required to run:
[www/py-parsel] [www/py-w3lib] [textproc/py-jmespath] [textproc/py-itemadapter] [lang/python310]

Master sites:

Filesize: 19.245 KB

Version history: (Expand)


CVS history: (Expand)


   2024-04-27 07:05:33 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-itemloaders: updated to 1.2.0

1.2.0

Added official support for Python 3.12 and PyPy 3.10
Removed official support for Python 3.7
Improved performance of itemloaders.utils.arg_to_iter
Fixed test expectations on recent Python versions
Improved CI
   2023-04-27 11:07:55 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-itemloaders: updated to 1.1.0

1.1.0

Added JMESPath support (ItemLoader.add_jmes etc.), requiring Parsel 1.8.1+
Added official support for Python 3.11
Removed official support for Python 3.6
Internal code cleanup
Added pre-commit support and applied changes from black and flake8
Improved CI
   2022-12-12 15:18:55 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-itemloaders: updated to 1.0.6

1.0.6
Fixes a regression introduced in 1.0.5 that would cause the re parameter of \ 
ItemLoader.add_xpath and similar methods to be passed to lxml, which would \ 
trigger an exception when the value of re was a compiled pattern and not a \ 
string

1.0.5
Allow additional args to be passed when calling ItemLoader.add_xpath
Fixed missing space in an exception message
Updated company name in author and copyright sections
Added official support for Python 3.9 and improved PyPy compatibility
Added official support for Python 3.10
   2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)
Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS
   2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
   2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162)
Log message:
textproc: Remove SHA1 hashes for distfiles
   2021-03-22 07:14:02 by Adam Ciarcinski | Files touched by this commit (1)
Log message:
py-itemloaders: fix typos
   2021-03-21 22:13:00 by =?UTF-8?B?RnLDqWTDqXJpYyBGYXViZXJ0ZWF1?= | Files touched by this commit (4)
Log message:
py-itemloaders: Added version 1.0.4

textproc/py-itemloaders is a dependency of www/py-scrapy.