./python/py-dateparser, Date parsing library designed to parse dates from HTML pages

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.2.0, Package name: py311-dateparser-1.2.0, Maintainer: pkgsrc-users

dateparser provides modules to easily parse localized dates in almost any
string formats commonly found on web pages.


Required to run:
[time/py-dateutil] [time/py-pytz] [time/py-tzlocal] [textproc/py-regex] [lang/python310]

Master sites:

Filesize: 300.059 KB

Version history: (Expand)


CVS history: (Expand)


   2023-11-17 21:48:06 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-dateparser: updated to 1.2.0

1.2.0 (2023-11-17)
------------------

New features:

- New ``PREFER_MONTH_OF_YEAR`` setting

Fixes:

- Absolute years in Russian are no longer being treated as a number of years in
  the past

Cleanups and internal improvements:

- Removed the use of ``datetime.utcnow``, deprecated on Python 3.12
- Applied Black formatting to the code base
- Initial integration with OSSFuzz
- Extended test cases
   2023-03-29 14:57:16 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-dateparser: updated to 1.1.8

1.1.8 (2023-03-22)
------------------

Improvements:
- Improved date parsing for Chinese
- Improved date parsing for Czech
- Reorder language by popularity
- Fix leak of memory in cache
- Add support for "\d units later"
- Move modification in CLDR data to yaml
- Add support to use timezone via settings to get PREFER_DATES_FROM result
   2023-01-01 22:39:23 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-dateparser: updated to 1.1.5

1.1.5 (2022-12-29)
------------------

Improvements:

- Parse short versions of day, month, and year
- Add a test for “in 1d”
- Update languages_info
- Add a workaround for zipimporter not having exec_module before Python 3.10
- Stabilize tests at midnight
- Add a test case for French

Cleanups:

- Remove the requirements-build file
   2022-11-22 11:16:01 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-dateparser: updated to 1.1.4

1.1.4 (2022-11-21)
------------------

Improvements:

- Improved support for languages such as Slovak, Indonesian, Hindi, German and \ 
Japanese
- Recursively create a model home
- Replace regex sub with simple string replace
- Add Python 3.10, 3.11 support
- Drop support for Python 3.5, 3.6 versions
   2022-11-21 11:00:48 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-dateparser: updated to 1.1.3

1.1.3 (2022-11-03)
------------------

New features:

- Add support for fractional units

Improvements:

- Fix the returned datetime skipping a day with time+timezone input and \ 
PREFER_DATES_FROM = 'future'
- Fix input translatation breaking keep_formatting
- English: support "till date"
- English: support “after” and “before” in relative dates

Cleanups:

- Reorganize internal data
- CI updates
   2022-10-25 11:02:38 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-dateparser: updated to 1.1.2

1.1.2 (2022-10-20)
------------------

Improvements:

- Added support for negative timestamp
- Fixed PytzUsageWarning for Python versions >= 3.6
- Added support for dates with dots and spaces
- Improved support for Ukrainian, Croatian and Russian
- Added support for parsing Unix timestamps consistently regardless of timezones
- Improved tests
   2022-09-17 23:26:31 by Thomas Klausner | Files touched by this commit (3)
Log message:
py-dateparser: unrestrict py-regex dependency

Bump PKGREVISION.
   2022-07-11 15:31:46 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-dateparser: updated to 1.1.1

1.1.1 (2022-03-17)
------------------

Improvements:

- Fixed issue with regex library by pinning dependencies to an earlier version.
- Extended support for Russian language dates starting with lowercase.
- Allowed to use_given_order for languages too.
- Fixed link to settings section.
- Defined UTF-8 encoding for Windows.
- Fixed directories creation error in CLI utils.

1.1.0 (2021-10-04)
------------------

New features:

* Support language detection based on ``langdetect``, ``fastText``, or a
  custom implementation
* Add support for 'by <time>'
* Sort default language list by internet usage

Improvements:

* Improved support of Chinese, Czech
* Improvements in ``search_dates``
* Make order of previous locales deterministic
* Fix parsing with trailing space
* Consider ``RETURN_TIME_AS_PERIOD`` for timestamp times
* Exclude failing regex version
* Ongoing work multithreading support
* Add demo URL

QA:

* Migrate pipelines from Travis CI to Github Actions
* Use versioned CLDR data
* Add a script to update table of supported languages and locales
* Sort 'skip' keys in yaml files
* Improve test coverage
* Code cleanup

1.0.0 (2020-10-29)
------------------

Breaking changes:

* Drop support for Python 2.7 and pypy
* Now ``DateDataParser.get_date_data()`` returns a ``DateData`` object instead \ 
of a ``dict``
* From now wrong ``settings`` are not silenced and raise ``SettingValidationError``
* Now ``dateparser.parse()`` is deterministic and doesn't try previous locales. \ 
Also, ``DateDataParser.get_date_data()`` doesn't try the previous locales by \ 
default
* Remove the ``'base-formats'`` parser
* Extract the ``'no-spaces-time'`` parser from the ``'absolute-time'`` parser \ 
and make it an optional parser
* Remove ``numeral_translation_data``
* Remove the undocumented ``SKIP_TOKENS_PARSER`` and ``FUZZY`` settings
* Remove support for using strings in ``date_formats``
* The undocumented ``ExactLanguageSearch`` class has been moved to the private \ 
scope and some internal methods have changed
* Changes in ``dateparser.utils``: ``normalize_unicode()`` doesn't accept \ 
``bytes`` as input and ``convert_to_unicode`` has been deprecated

New features:

* Add Python 3.9 support
* Detect hours separated with a period/dot
* Add support for "decade"
* Add support for the hijri calendar in Python ≥ 3.6

Improvements:

* New logo!
* Improve the README and docs
* Fix the "calendars" extra
* Fix leap years when ``PREFER_DATES_FROM`` is set
* Fix ``STRICT_PARSING`` setting in ``no-spaces-time`` parser
* Consider ``RETURN_AS_TIME_PERIOD`` setting for ``relative-time`` parser
* Parse the 24hr time format with meridian info
* Other small improvements