./textproc/py-itemloaders, Python library that helps to collect data from HTML and XML sources

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.3.2, Package name: py312-itemloaders-1.3.2, Maintainer: pkgsrc-users

itemloaders is a library that helps you collect data from HTML and XML sources.

It comes in handy to extract data from web pages, as it supports data
extraction using CSS and XPath Selectors.

It's specially useful when you need to standardize the data from many sources.
For example, it allows you to have all your casting and parsing rules in a
single place.


Required to run:
[www/py-parsel] [www/py-w3lib] [textproc/py-jmespath] [textproc/py-itemadapter] [lang/python310]

Master sites:

Filesize: 19.245 KB

Version history: (Expand)


CVS history: (Expand)


   2024-11-11 08:29:31 by Thomas Klausner | Files touched by this commit (862)
Log message:
py-*: remove unused tool dependency

py-setuptools includes the py-wheel functionality nowadays
   2024-10-01 09:55:14 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-itemloaders: updated to 1.3.2

1.3.2

Added official support for the upcoming Python 3.13
Removed w3lib from direct dependencies (it's not used directly but is an \ 
indirect dependency via parsel)
Improved CI
   2024-06-04 12:35:29 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-itemloaders: updated to 1.3.1

1.3.1

Fixed an error when using nested loaders with empty matches that was introduced \ 
in 1.3.0

1.3.0

Added support for method chaining to the add_* and replace_* methods
Added type hints and py.typed
Made the docs builds reproducible
   2024-04-27 07:05:33 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-itemloaders: updated to 1.2.0

1.2.0

Added official support for Python 3.12 and PyPy 3.10
Removed official support for Python 3.7
Improved performance of itemloaders.utils.arg_to_iter
Fixed test expectations on recent Python versions
Improved CI
   2023-04-27 11:07:55 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-itemloaders: updated to 1.1.0

1.1.0

Added JMESPath support (ItemLoader.add_jmes etc.), requiring Parsel 1.8.1+
Added official support for Python 3.11
Removed official support for Python 3.6
Internal code cleanup
Added pre-commit support and applied changes from black and flake8
Improved CI
   2022-12-12 15:18:55 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-itemloaders: updated to 1.0.6

1.0.6
Fixes a regression introduced in 1.0.5 that would cause the re parameter of \ 
ItemLoader.add_xpath and similar methods to be passed to lxml, which would \ 
trigger an exception when the value of re was a compiled pattern and not a \ 
string

1.0.5
Allow additional args to be passed when calling ItemLoader.add_xpath
Fixed missing space in an exception message
Updated company name in author and copyright sections
Added official support for Python 3.9 and improved PyPy compatibility
Added official support for Python 3.10
   2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)
Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS
   2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip