./converters/py-chardet, Character encoding auto-detection in Python

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]

Branch: CURRENT, Version: 3.0.3, Package name: py27-chardet-3.0.3, Maintainer: bartosz.kuzma

Character encoding auto-detection in Python.

Required to run:
[devel/py-setuptools] [lang/python27]

Required to build:
[devel/py-py] [devel/py-test] [pkgtools/cwrappers] [devel/py-test-runner] [devel/py-hypothesis]

Master sites:

SHA1: e8e6fb92fbd01471c14a96d4eb75de41fa98e1b8
RMD160: 311de14616c2e9d6ce475d79990b57ceb13ea479
Filesize: 1823.249 KB

Version history: (Expand)

CVS history: (Expand)

   2017-05-17 09:09:53 by Adam Ciarcinski | Files touched by this commit (2)
Log message:
Changes 3.0.3:
This release fixes a crash when debugging logging was enabled.
   2017-04-23 18:08:02 by Thomas Klausner | Files touched by this commit (1)
Log message:
Add missing unused test dependency.

See also https://github.com/chardet/chardet/issues/120
   2017-04-19 19:24:16 by Thomas Klausner | Files touched by this commit (3) | Package updated
Log message:
Updated py-chardet to 3.0.2.

chardet 3.0.2

Fixes an issue where detect would sometimes return None instead of a dict with \ 
the keys encoding, language, and confidence (Issue #113, PR #114).

chardet 3.0.1

This bugfix release fixes a crash in the EUC-TW prober when it encountered \ 
certain strings (Issue #67).

chardet 3.0.0

This release is long overdue, but still mostly serves as a placeholder
for the impending 4.0.0 release, which will have retrained models
for better accuracy. For now, this release will get the following
improvements up on PyPI:

    Added support for Turkish ISO-8859-9 detection (PR #41, thanks @queeup)
    Commented out large unused sections of Big5 and EUC-KR tables to save memory \ 
    Removed Python 3.2 from testing, but add 3.4 - 3.6
    Ensure that stdin is open with mode 'rb' for chardetect CLI. (PR #38, thanks \ 
    Fixed chardetect crash with non-ascii file names (PR #39, thanks @nkanaev)
    Made naming conventions more Pythonic throughout (no more \ 
mTypicalPositiveRatio, and instead typical_positive_ratio)
    Modernized test scripts and infrastructure so we've got Travis testing and \ 
all that stuff
    Rename filter_without_english_words to filter_international_words and make \ 
it match current Mozilla implementation (PR #44, thanks @rsnair2)
    Updated filter_english_letters to match C implementation (c665459)
    Temporarily disabled Hungarian ISO-8859-2 and Windows-1250 detection because \ 
it is very inaccurate (da6c0a0)
    Allow CLI sub-package to be importable (PR #55)
    Add a hypotheis-based test (PR #66, thanks @DRMacIver)
    Strip endianness from UTF with BOM predictions so that the encoding can be \ 
passed directly to bytes.decode() (PR #73, thanks @snoack)
    Fixed broken links in docs (PR #90, thanks @roskakori)
    Added early exit to chardetect when encoding is detected instead of looping \ 
through entire file (PR #103, thanks @jpz)
    Use bytearray objects internally instead of wrap_ord calls, which provides a \ 
nice performance boost across the board (PR #106)
    Add language property to probers and UniversalDetector results (PR #180)
    Mark the 5 known test failures as such so we can have more useful Travis \ 
build results in the meantime (d588407)
   2017-01-03 14:23:05 by Jonathan Perkin | Files touched by this commit (52)
Log message:
Use "${MV} || ${TRUE}" and "${RM} -f" consistently in \ 
post-install targets.
   2016-08-28 17:48:37 by Thomas Klausner | Files touched by this commit (112)
Log message:
Remove unnecessary PLIST_SUBST and FILES_SUBST that are now provided
by the infrastructure.

Mark a couple more packages as not ready for python-3.x.
   2016-06-08 19:43:49 by Thomas Klausner | Files touched by this commit (356)
Log message:
   2016-02-05 13:40:56 by Thomas Klausner | Files touched by this commit (3) | Package updated
Log message:
Fix self-conflict between different python versions of this package.

   2015-11-03 02:43:56 by Alistair G. Crooks | Files touched by this commit (120)
Log message:
Add SHA512 digests for distfiles for converters category

Problems found with existing distfile:
No changes made to the libiconv distinfo file.

Otherwise, existing SHA1 digests verified and found to be the same on
the machine holding the existing distfiles (morden).  All existing
SHA1 digests retained for now as an audit trail.