- Apr 04, 2019
-
-
Frank Tang authored
What's new in ICU 64.1: - Unicode 12: 554 new characters, including 4 new scripts and 61 new emoji characters. - CLDR 35 locale data http://blog.unicode.org/2019/03/unicode-cldr-version-35-languagelocale.html - ICU 64 now uses "rearguard" TZ data. (Recent versions have used "vanguard" data with certain overrides.) (ICU-20398) - ICU data filtering: The ICU4C build accepts an optional filter script that specifies a subset of the data to be built, with whitelists and blacklists for locales and for resource bundle paths. (ICU-10923, design doc) - MessageFormat has new pattern syntax for specifying the style of a date/time argument via a locale-independent skeleton rather than a locale-specific pattern. (ICU-9622) * Date/time skeletons use the same "::" prefix as number skeletons. * Example MessageFormat pattern string: "We close on {closing,date,::MMMMd} at {closing,time,::jm}." - Many formatting APIs can now output a new type of result object which is-a FormattedValue (Java & C++), or convertible to a UFormattedValue (C). * These combine the result strings with easy iteration over FieldPosition metadata. - New C++ class LocaleBuilder for building a Locale from subtags, keywords, and extensions. (ICU-20328) Parallel to the existing ICU4J ULocale.Builder class. - For C++ MeasureUnit instances, there are now additional factory methods that return units by value, not by pointer-with-ownership. (ICU-20337) - Various Out-Of-Memory (OOM) issues have been fixed. (ticket query) - See http://site.icu-project.org/download/64 for more details. The update steps are recorded : https://chromium.googlesource.com/chromium/deps/icu/+log/20690c6..6d422ff - Update update.sh to point to ICU's new repo location - Import the pristine copy of ICU 64.1 and update BUILD files with update.sh - Update and apply locale data patches 1. patches/locale_google.patch: * Google's internal ICU locale changes * Simpler region names for Hong Kong and Macau in all locales * Currency signs in ru and uk locales (do not include 'tr' locale changes) * AM/PM, midnight, noon formatting for a few Indian locales * Timezone name changes in Korean and Chinese locales * Default digit for Arabic locale is European digits. - patches/locale1.patch: Minor fixes for Korean 2. Breakiterator patches - patches/wordbrk.patch for word.txt a. Move full stops (U+002E, U+FF0E) from MidNumLet to MidNum so that FQDN labels can be split at '.' b. Move fullwidth digits (U+FF10 - U+FF19) from Ideographic to Numeric. See http://unicode.org/cldr/trac/ticket/6555 - patches/khmer-dictbe.patch Adjust parameters to use a smaller Khmer dictionary (khmerdict.txt). https://unicode-org.atlassian.net/browse/ICU-9451 - Add several common Chinese words that were dropped previously to source/data/cjdict/brkitr/cjdict.txt patch: patches/cjdict.patch upstream bug: https://unicode-org.atlassian.net/browse/ICU-10888 3. Build-related changes - patches/configure.patch: * Remove a section of configure that will cause breakage while running runConfigureICU. - patches/wpo.patch (only needed when icudata dll is used). upstream bugs : https://unicode-org.atlassian.net/browse/ICU-8043 https://unicode-org.atlassian.net/browse/ICU-5701 - patches/data_symb.patch : Put ICU_DATA_ENTRY_POINT(icudtXX_dat) in common when we use the icu data file or icudt.dll - patches/staticmutex.patch : Change the static UMutex code to avoid static_initializers error. upstream bug: https://unicode-org.atlassian.net/browse/ICU-20520 - patches/buildtool.patch : Fix the build tool which ommited res_index.res */res_index.res files upstream bug: https://unicode-org.atlassian.net/browse/ICU-20529 upstream PR: https://github.com/unicode-org/icu/pull/571/ 4. Double conversion library build failure - patches/double_conversion.patch - upstream bugs: https://unicode-org.atlassian.net/browse/ICU-13750 https://github.com/google/double-conversion/issues/66 5. ISO-2022-JP encoding (fromUnicode) change per WHATWG encoding spec. - patches/iso2022jp.patch - upstream bug: https://unicode-org.atlassian.net/browse/ICU-20251 - ICU data files are rebuilt Up to 67kB increase. Since we also save 43K in https://chromium-review.googlesource.com/c/v8/v8/+/1478710 , the net increase is only 24KB. ** ICU Data Size Change ** Data Size ICU63 ICU64-1 DIFF chromeos 10326064 10378624 52560 common 10326064 10394816 68752 cast 5126144 5101616 -24528 android 6355520 6406256 50736 ios 6315248 6372016 56768 flutter 880928 894752 13824 Created by: git rev-list --reverse 20690c62..6d422ffa | \ xargs git cherry-pick --strategy=recursive -X theirs Bug: chromium:943348 Change-Id: Ia7f86abfa8625dd24aae2f71456abd679fda3dae Reviewed-on: https://chromium-review.googlesource.com/c/chromium/deps/icu/+/1552155 Reviewed-by:
Jungshik Shin <jshin@chromium.org>
-
- May 08, 2018
-
-
Jungshik Shin authored
What's new in ICU 61.1: - CLDR 33 (locale data; http://cldr.unicode.org/index/downloads/cldr-33 ) - A new number formatting API - Rewrote U8_NEXT - See http://site.icu-project.org/download/61 for more details. The update steps are recorded in https://chromium.googlesource.com/chromium/deps/icu/+log/f7ab784..b0e0b84 On top of the above, IANA timezone db was updated once more to the latest 2018e (released on May 4th). The list of local changes are: * Update the currency list. For desktop, include all the national currencies (see https://bugs.chromium.org/p/v8/issues/detail?id=7472 ) * Clean up disabled warning list in BUILD.gn/icu.gyp * Fix clang on Windows and MSVC build failures https://github.com/google/double-conversion/issues/66 http://bugs.icu-project.org/trac/ticket/13752 http://bugs.icu-project.org/trac/ticket/13750 * Rebuild the data files Size is reduced by ~100kB for non-desktop. Desktop(common): increase by 90kB 6,386,176 android/icudtl.dat (6,502,048) 4,929,248 cast/icudtl.dat (5,032,368) 10,264,208 common/icudtl.dat (10,171,920) 6,372,480 ios/icudtl.dat (6,488,384) * Update IANA tz DB to 2018e * Update README.chromium * Drop patches that became unnecessary. * Apply the remaining patches Isvalidenum.patch, simplify_smallintformatter.patch, word_long.patch, locid_map.patch * Apply data-build related patches {wpo,vscomp,data.build,data.build.win,data_symb}.patch * Apply BreakIterator related patches + locale1 patch * Apply Google's patch for locale data * ICU 61.1 update step 1 Check in ICU 61.1 to the tree (61staging branch) by running scripts/update.sh. Build files are updated by the script. * Update prep TBR=thakis@chromium.org Bug: 838383 Test: None Change-Id: Ib83ed624fe764abdb7fbfd567ae5e982497089bd Reviewed-on: https://chromium-review.googlesource.com/1041578 Reviewed-by:
Jungshik Shin <jshin@chromium.org>
-
- Nov 07, 2017
-
-
Jungshik Shin authored
What's new in ICU 60.1: - Unicode 10.0 (including Emoji 5.0) - CLDR 32 (locale data; http://cldr.unicode.org/index/downloads/cldr-32 ) - Smaller rule file size for BreakIterator (no more reverse rules) - A new number formatting API - Malformed UTF-8 sequences are handled per W3C Encoding Standard - See http://site.icu-project.org/download/60 for more details. In addition, the following changes are made: * Add more locale variants for en,fr,de,it,pt,nl,ru,ar * Add the minimum locale data for Osage and Chakma * Apply locale data patches (Google and Chromium) * Apply build-related and break iterator patches: wpo.patch vscomp.patch data.build.win.patch data_symb.patch cjdict.patch khmer-dictbe.patch wordbrk.patch patches/data.build.patch * Update BUILD.gn and gyp files. * Data file size (25 ~ 130kB smaller than the latest 59.1) android/icudtl.dat 6499904 common/icudtl.dat 10170896 ios/icudtl.dat 6486240 The update steps are recorded in https://chromium.googlesource.com/chromium/deps/icu/+log/f88520f..1034c7e5 . Bug: 766816 Test: None (DEPS roll CL will have test specified). Change-Id: I9144bb37464eec1f282f40f9f2d28b6306511083 Reviewed-on: https://chromium-review.googlesource.com/756237 Reviewed-by:
Mark Mentovai <mark@chromium.org>
-
- May 14, 2017
-
-
Jungshik Shin authored
* Highlights: - Emoji 5.0 data (partial; Emoji_Component property not included) - CLDR 31.0.1 (http://blog.unicode.org/2017/03/cldr-version-31-released.html) UTC and GMT are treated as distinct) - New case mapping API for styled text - C++ 11 is required - char16_t for UChar (UTF-16) - Source code is in UTF-8 * Size changes common: 10,130,560 => 10,175,056 android: 6,573,872 => 6,616,864 iOS: 6,562,352 => 6,605,152 On top of ICU 59.1 from the upstream, the following changes were applied. See https://chromium.googlesource.com/chromium/deps/icu/+log/chromium/59staging - Fix C++ 11 string literal assignment issue (upstream bug: 13192) - Fix C4229 warning by MSVC - Apply utf32.patch and include unistr.h in fuzzer_util - Update ICU data files - Fix wpo.patch - Apply Google locale patch and locale1.patch - update readme - Apply breakiterator related patches - Apply and update wpo.patch - Drop unused patch, apply data.build.win.patch, update README.chromium - Add /utf-8 flag for Windows/Visual Studio - Update BUILD.gn for UChar, stubdata and apply data_sym.patch - use stubdata.cpp instead of stubdata.c in icu.gyp - Update icu.gyp* files for v8 - Update BUILD.gn, apply data.build.patch and vscomp.patch - Add new files in ICU 59.1 - Get a fresh copy of ICU 59.1 from the upstream - Update update.sh script TBR=drott@chromium.org, yangguo@chromium.org Bug:699469 TEST: layout tests, all unittests, browser tests Change-Id: Ie1e77323aa0c7f872153680c4deca6471a771a5c Reviewed-on: https://chromium-review.googlesource.com/505173 Reviewed-by:
Jungshik Shin <jshin@chromium.org>
-
- Oct 23, 2016
-
-
Jungshik Shin authored
* Note that this CL will be followed by CLs with local changes. Until then, ICU should not be rolled in DEPS. See READ_THIS_FIRST for details. * Adjust scripts/update.sh and scripts/data_files_to_preserve.txt - CLDR/ICU added ckb/ast locale data. Drop them from the list to preserve. - source/layout does not exist in 58.1 any more. * Update the tree to ICU 58.1 from the upstream by running scripts/update.sh * Update README.chromium and add READ_THIS_FIRST to warn about the status of the tree. BUG=637001 TEST=None
-
- Feb 08, 2016
-
-
Jungshik Shin authored
Update the copyright years in LICENSE (taken from license.html) and IANA timezone DB version in README.chromium. Incorporate the format fixes made by Mostyn Bramley-Moore <mostynb@opera.com> in https://codereview.chromium.org/1530343003/ BUG=575007 TEST=None TBR=mostynb@opera.com Review URL: https://codereview.chromium.org/1678243002 .
-
- Jan 08, 2015
-
-
Jungshik Shin (jungshik at google) authored
1. Replace 'src/' with a fresh copy of ICU 54.1 from the upstream. 2. Remove unncessary files/directories 3. Update the source/header file lists in icu.gypi and icu.gyp 4. Update README.chromium to warn that the ICU head is not usable for building Chromium/Blink/v8,yet. BUG=428145 TEST=None Review URL: https://codereview.chromium.org/845603002
-
- Apr 18, 2014
-
-
jshin@chromium.org authored
1. {big5,gb2312}han collation data is not used by anybody because they're useless as a sorting order. Add a function to trim_data.sh to remove them from zh.txt 2. Remove remove_unihan.sh and add back unihan rules to coll/{zh,ja,ko}.txt. In ICU 52, tools/genrb does NOT include unihan collation by default so that we don't have to bother to remove it from the rule files. 3. Remove obsolete patch files (locale[23].patch) 4. Add LICENSE file (converted from license.html) 5. Update README.chromium accordingly. 6. Check in the updated data file/assembly files. The net saving in icudtl.dat is ~ 220kB. BUG=132145 TEST=icudtl.dat is 10576480 TBR=mark Review URL: https://codereview.chromium.org/243763002 git-svn-id: http://src.chromium.org/svn/trunk/deps/third_party/icu52@264857 4ff67af0-8c30-449e-8e8b-ad334ec8d88c
-