Path to this page:
Subject: CVS commit: pkgsrc/textproc/icu
From: Marc Recht
Date: 2007-03-23 13:51:13
Message id: 20070323125113.C2E8A2150A@cvs.netbsd.org
Log Message:
update to ICU 3.6
Major changes in ICU 3.6 include the following:
- Unicode: ICU uses and supports Unicode 5.0, which is the latest major release \
of Unicode. Unicode 5.0 will be used in many operating systems and applications, \
and this version of ICU is important maintain interoperability with these new \
operating systems and applications. More information about Unicode 5.0 can be \
found in the Unicode press release.
- Locale Data: ICU uses and supports data from Common Locale Data Repository \
(CLDR) 1.4, which includes many improvements in quality and quantity of data. \
There is 25% more CLDR locale data in 245 locales in ICU.
- ICU4C Specific Changes
- Charset Detection: A charset detection framework was added, which provides \
heuristics for detecting the charset for unlabeled sequences of bytes.
- Layout: The font layout engine has support added for Tibetan, Sinhala and Old \
Hangul.
- BiDi: The BiDi algorithm was enhanced to be more flexible and efficient
- ICU Data Management: The new icupkg tool provides an easier way to manage \
ICU's data library. This tool allows you to add, update or remove data from \
ICU's data archive.
- Time Zones The time zone data is modularized to allow easier building and \
updating of the data.
- Word Boundaries: The Thai word break iteration was improved to be more \
accurate. Also dictionary based detection of Thai word boundaries is now active \
for all locales.
- UText
- The BreakIterator uses UText for abstract text processing.
- 64-bit indexing is now used to allow access to larger chunks of text.
- API for read-only locking for security and robustness was added.
- Performance
- The u_sprintf/u_sscanf performance from the icuio library has been improved \
for number formatting/parsing.
- Constructing a DateFormat is significantly faster for many locales.
- Opening and closing a charset converter is significantly faster.
- The UTF-8 transformation functions and macros are faster.
- The UText API was improved for performance.
- The collation open and close functions have a small performance improvement.
Files: