./devel/py-pooch, Manages Python librarys sample data files

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.8.2, Package name: py311-pooch-1.8.2, Maintainer: pkgsrc-users

Does your Python package include sample datasets? Are you shipping
them with the code? Are they getting too big?

Pooch is here to help! It will manage a data registry by downloading
your data files from a server only when needed and storing them
locally in a data cache (a folder on your computer).

Here are Pooch's main features:

* Pure Python and minimal dependencies.
* Download a file only if necessary (it's not in the data cache
or needs to be updated).
* Verify download integrity through SHA256 hashes (also used to
check if a file needs to be updated).
* Designed to be extended: plug in custom download (FTP, scp,
etc) and post-processing (unzip, decompress, rename) functions.
* Includes utilities to unzip/decompress the data upon download
to save loading time.
* Can handle basic HTTP authentication (for servers that require
a login) and printing download progress bars.
* Easily set up an environment variable to overwrite the data
cache location.


Master sites:

Filesize: 57.962 KB

Version history: (Expand)


CVS history: (Expand)


   2024-02-21 13:42:04 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-pooch: updated to 1.8.1

v1.8.1

Bug fixes:

Use the ID instead of persistentID for Dataverse downloads since some \ 
repositories don't issue persistentIDs but all issue normal IDs
Ensure all archive members are unpacked in subsequent uses of Untar/Unzip if the \ 
first call only asked for a few members

Documentation:

Move "Projects using Pooch" further up the README
Update the versions of sphinx and its plugins

Maintenance:

Remove many deprecated pylint options
Use Dependabot to manage GitHub Actions
Simplify the test GitHub Actions workflow
Update format for Black 24.1.1
   2023-11-27 18:20:29 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-pooch: updated to 1.8.0

v1.8.0

Bug fixes:

Fix bug: add support for old and new Zenodo APIs

New features:

Only create local data directories if necessary
Speed up import time by lazy loading requests

Maintenance:

Add support for Python 3.11
Only run CI cron job for the upstream repository

Documentation:

Add GemGIS to list of projects using Pooch
Fix spelling of Dataverse
Fix grammar on retrieve documentation
   2023-10-28 21:57:26 by Thomas Klausner | Files touched by this commit (516) | Package updated
Log message:
python/wheel.mk: simplify a lot, and switch to 'installer' for installation

This follows the recommended bootstrap method (flit_core, build, installer).

However, installer installs different files than pip, so update PLISTs
for all packages using wheel.mk and bump their PKGREVISIONs.
   2023-03-14 15:10:53 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-pooch: updated to 1.7.0

v1.7.0

Bug fixes:

Make archive extraction always take members into account
Figshare downloaders fetch the correct version, instead of always the latest one.

New features:

Allow spaces in filenames in registry files
Refactor Pooch.is_available to use downloaders
Add support for downloading files from Dataverse DOIs
Add a new Pooch.load_registry_from_doi method that populates the Pooch registry \ 
using DOI-based data repositories
Support urls for Zenodo repositories created through the GitHub integration \ 
service, which include slashes in the filename of the main zip files
Automatically add a trailing slash to base_url on pooch.create

Maintenance:

Drop support for Python 3.6
Port from deprecated appdirs to platformdirs
Update version of Codecov's Action to v3

Documentation:

Update sphinx, theme, and sphinx-panels
Add CITATION.cff for the JOSS article
Use Markdown for the README
Improve docstring of known_hash in retrieve function
Replace link to Pooch's citation with a BibTeX code snippet
   2022-01-31 18:31:23 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-pooch: updated to 1.6.0

v1.6.0

Important:

This is the last release with support for Python 3.6. The next major/minor \ 
release of Pooch will require Python >= 3.7.
Pooch now specifies version bounds for our required dependencies and a plan for \ 
dropping support for older versions. Please revise it if you depend on Pooch.

Enhancements:

Add option to disable updates on hash mismatch
Allow enabling progress bars with an argument in Pooch.fetch and retrieve

Documentation:

Use real data URLs in the README example code
Tell users to import from the top-level namespace
Update the contact link to fatiando.org/contact
Refer the community guides to fatiando/community
Mention in docs that figshare collections aren't supported

Maintenance:

Replace Google Analytics for Plausible to make our docs more privacy-friendly
Use Dependente to capture dependencies on CI
Use build instead of setup.py
Run the tests weekly on GitHub Actions
Set minimum required version of dependencies
Rename "master" to "main" throughout the project
Remove trailing slash from GitHub handle in AUTHORS.md
   2022-01-12 22:35:39 by Thomas Klausner | Files touched by this commit (4) | Package updated
Log message:
devel/py-pooch: import py-pooch-1.5.2

Does your Python package include sample datasets? Are you shipping
them with the code? Are they getting too big?

Pooch is here to help! It will manage a data registry by downloading
your data files from a server only when needed and storing them
locally in a data cache (a folder on your computer).

Here are Pooch's main features:

* Pure Python and minimal dependencies.
* Download a file only if necessary (it's not in the data cache
  or needs to be updated).
* Verify download integrity through SHA256 hashes (also used to
  check if a file needs to be updated).
* Designed to be extended: plug in custom download (FTP, scp,
  etc) and post-processing (unzip, decompress, rename) functions.
* Includes utilities to unzip/decompress the data upon download
  to save loading time.
* Can handle basic HTTP authentication (for servers that require
  a login) and printing download progress bars.
* Easily set up an environment variable to overwrite the data
  cache location.