./textproc/icu, Robust and full-featured Unicode services

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 60.2, Package name: icu-60.2, Maintainer: pkgsrc-users

The International Components for Unicode(ICU) is a C and C++ library that
provides robust and full-featured Unicode support on a wide variety of
platforms. The library provides:

- Calendar support
- Character set conversions
- Collation (language-sensitive)
- Date & time formatting
- Locales (140+ supported)
- Message catalogs (resources)
- Message formatting
- Normalization
- Number & currency formatting
- Time zones
- Transliteration
- Word, line & sentence breaks


Required to build:
[pkgtools/cwrappers]

Master sites:

SHA1: 7a099523fcb708b427e94f9b3c372fec3ebd1563
RMD160: e9ef04181123e82d2a846b23caff072e12982b39
Filesize: 22771.323 KB

Version history: (Expand)


CVS history: (Expand)


   2018-01-16 14:28:12 by Jonathan Perkin | Files touched by this commit (2)
Log message:
icu: Revert previous clang patch, clang was changed instead.
   2018-01-12 13:43:08 by Jonathan Perkin | Files touched by this commit (2)
Log message:
icu: Don't perform SunOS _STDC_C99 workaround with clang.
   2017-12-14 22:28:32 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
icu: updated to 60.2

60.2:
New API for direct-UTF-8 normalization.
- It also optionally records changes, for source-to-result index mapping and \ 
tracking of text metadata.
More convenient case mapping API (StringPiece→ByteSink).
ICU now handles ill-formed UTF-8 byte sequences as specified in the W3C Encoding \ 
Standard.

Bug fixes:
CLDR 32.0.1
- Change of some German AM/PM to English strings reverted; will be revisited.
- BGN transliterations restored.
The Script_Extensions property value for 5 CJK characters is wrong.
ICU4J DecimalFormat
- getGroupingSize() returns -1 instead of 0 in ICU60 if grouping is disabled
- setPositivePrefix also changes negative prefix
- unsets maxFrac when minFrac is set on a currency instance
DateFormat
- Urdu Islamic calendar eras
- Narrow format of noon time is used for abbreviated day period pattern letter \ 
'b' and 'bb'
Conversion buffer overflow
Calendar buffer overrun
Windows C++: The header file "stringoptions.h" is not included in the \ 
the pre-built binary .zip file download.
Fix various typos and spelling mistakes.
   2017-11-30 18:54:18 by Maya Rashish | Files touched by this commit (1)
Log message:
icu: bump minimum requirement for library major bump
   2017-11-30 17:03:18 by Adam Ciarcinski | Files touched by this commit (5) | Package updated
Log message:
icu: updated to 60.1

Changes 60.1:
* Unicode 10.0: 8,518 new characters, including four new scripts, 7,494 new Han \ 
characters, and 56 new emoji characters.
  - Properties newly supported in ICU: Emoji_Component, Regional_Indicator, \ 
Prepended_Concatenation_Mark
* CLDR 32:
  - Data for several (mostly Asian) new languages, date formatting patterns \ 
using colloquial day period formats ("h:mm B" → “1:30 in the \ 
afternoon”), and many other data improvements.
  - See the CLDR download page for other CLDR features and migration issues in \ 
CLDR 32.
* NumberFormatter, a new number formatting API: A long-overdue refresh of number \ 
formatting in ICU with a focus on usability, robustness, and performance.  The \ 
30+ settings in DecimalFormat are reduced to 8 in NumberFormatter; all \ 
NumberFormatter objects are thread-safe and immutable; and the code is efficient \ 
in both the client-side (constant locale) and server-side (variable locale) use \ 
cases.
  - New users are encouraged to use the new API for number formatting.  However, \ 
preexisting code can continue using the old API, which has been partially made \ 
into a wrapper over the new API.
  - Documentation: in Java, see com.ibm.icu.number.NumberFormatter, and in C++, \ 
see i18n/unicode/numberformatter.h.
* New options for titlecasing:
  - Sentence titlecasing and whole-string titlecasing without custom \ 
BreakIterator instances.
  - The default index adjustment has been changed from "find first cased \ 
character" to "find first letter, number, or symbol"; a new \ 
option is available for selecting the previous adjustment behavior.
* Smaller data files for BreakIterator.
  - Reverse rules no longer used: Easier updates, easier to conform to Unicode \ 
Standard.
  - Old source rule files continue to work, reverse rules are ignored.
  - Rule-based data files: 1.2MB→0.8MB.

ICU4C Specific Changes
* New API for direct-UTF-8 normalization.
  - It also optionally records changes, for source-to-result index mapping and \ 
tracking of text metadata.
* More convenient case mapping API (StringPiece→ByteSink).
* ICU now handles ill-formed UTF-8 byte sequences as specified in the W3C \ 
Encoding Standard.
   2017-11-16 10:58:26 by Havard Eidnes | Files touched by this commit (3) | Package updated
Log message:
Apply a fix for CVE-2017-14952 from
http://bugs.icu-project.org/trac/change … nemeta.cpp
Bump PKGREVISION.
   2017-10-06 17:20:42 by Maya Rashish | Files touched by this commit (2)
Log message:
icu: include xlocale on all non-netbsd non-linux systems.
netbsd&linux do not have it (glibc had it, but removed in 2.26, and
was satisfied by locale.h always, if their release notes is to be believed)

this should cover BSDs other than netbsd, etc.
   2017-10-06 07:42:01 by Adam Ciarcinski | Files touched by this commit (2)
Log message:
Fix building on Darwin