./textproc/py-Unidecode, ASCII transliterations of Unicode text

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2020Q3, Version: 1.1.1, Package name: py37-Unidecode-1.1.1, Maintainer: pkgsrc-users

It often happens that you have text data in Unicode, but you need
to represent it in ASCII. For example when integrating with legacy
code that doesn't support Unicode, or for ease of entry of non-Roman
names on a US keyboard, or when constructing ASCII machine identifiers
from human-readable Unicode strings that should still be somewhat
intelligeble (a popular example of this is when making an URL slug
from an article title).

Note that this module generally produces better results than simply
stripping accents from characters (which can be done in Python with
built-in functions). It is based on hand-tuned character mappings
that for example also contain ASCII approximations for symbols and
non-Latin alphabets.

This is a Python port of Text::Unidecode Perl module by Sean M.
Burke.


Master sites:

SHA1: 20eea40de171ef9853174ce799dc06e624fb4996
RMD160: 26193ceec061fe94a79d4dcaf8aeb55ce90b53b0
Filesize: 207.179 KB

Version history: (Expand)