Friday, October 25, 2024
ICU 76 Released
ICU 76 updates to Unicode 16 (blog), including new characters and scripts, emoji, collation & IDNA changes, and corresponding APIs and implementations. It also updates to CLDR 46 (beta blog) locale data with new locales, significant updates to existing locales, and various additions and corrections. For example, the CLDR and Unicode default sort orders are now very nearly the same.
Most of the java.time (Temporal) types can now be formatted directly using the existing ICU4J date/time formatting classes.
There are some new APIs to make ICU easier to use with modern C++ and Java patterns. Most of the C/C++ APIs added for this purpose are implemented as C++ header-only APIs, and usable on top of binary stable C APIs, which is a first for ICU.
The Java and C++ technology preview implementations of the (also in tech preview) CLDR MessageFormat 2.0 specification have been updated to match recent changes.
ICU 76 and CLDR 46 are major releases, including a new version of Unicode and major locale data improvements.
For details, please see
https://unicode-org.github.io/icu/download/76.html.
Adopt a Character and Support Unicode’s Mission
Looking to give that special someone a special something?
Or maybe something to treat yourself?
π️ππ️π¨π₯πη±₿♜π
Adopt a character or emoji to give it the attention it deserves, while also supporting Unicode’s mission to ensure everyone can communicate in their own languages across all devices.
Each adoption includes a digital badge and certificate that you can proudly display!
Have fun and support a good cause
You can also donate funds or gift stock
As Unicode, Inc. is a US-based open source, open standards, non-profit, 501(c)3 organization, your contribution may be eligible for a tax deduction. Please consult with a tax advisor for details.
Unicode CLDR 46 available
- Updated to Unicode 16.0 (including major changes to collation)
- Substantial additions and modifications of Emoji search keyword data
- ‘Upleveling’ the locale coverage (see below)
- Updates to Message Format in tech preview
- Updates to conformance
- New tech preview section on semantic skeletons
New / Upleveled Locales
±
New Level
Locales
π
Modern
Nigerian Pidgin, Tigrinya
π
Moderate
Akan, Baluchi (Latin), Kangri, Tajik, Tatar, Wolof
π
Basic
Ewe, Ga, Kinyarwanda, Konkani (Latin), Northern Sotho, Oromo, Sichuan Yi, Southern Sotho, Tswana
π
Basic*
Chuvash, Anii
For more information
Adopt a Character and Support Unicode’s Mission
Looking to give that special someone a special something?
Or maybe something to treat yourself?
π️ππ️π¨π₯πη±₿♜π
Adopt a character or emoji to give it the attention it deserves, while also supporting Unicode’s mission to ensure everyone can communicate in their own languages across all devices.
Each adoption includes a digital badge and certificate that you can proudly display!
Have fun and support a good cause
You can also donate funds or gift stock
As Unicode, Inc. is a US-based open source, open standards, non-profit, 501(c)3 organization, your contribution may be eligible for a tax deduction. Please consult with a tax advisor for details.
Monday, May 20, 2024
Unicode CLDR Version 46 Submission Open
Via the online Survey Tool, contributors supply data for their languages — data that is widely used to support much of the world’s software. This data is also a factor in determining which languages are supported on mobile phones and computer operating systems.
Version 46 is focusing on:
- Unicode 16 additions: new emoji, script names, collation data (Chinese & Japanese), …
- Emoji search keywords: Expanding keyword coverage to make it easier for users to find the right emoji
- New Languages targeting Basic:
- Ewe (ee),
- Ga (gaa)
- Kinyarwanda (rw)
- Northern Sotho (nso)
- Oromo (om),
- Sesotho (st)
- Setswana (tn),
- Up-leveling: Akan (ak)
Each new locale starts with a small set of Core data, such as a list of characters used in the language. Submitters of those locales need to bring the coverage up to Basic level (very basic basic dates, times, numbers, and endonyms) during the next submission cycle.
Once a language reaches Basic coverage, it has the minimum support for use in language selection, such as on mobile devices. In the next submission cycle, the name for that language is also added for translation for all languages at Modern coverage.
If you would like to contribute missing data for your language, see Survey Tool Accounts. For more information on contributing to CLDR, see the CLDR Information Hub.
Adopt a Character and Support Unicode’s Mission
Looking to give that special someone a special something?
Or maybe something to treat yourself?
π️ππ️π¨π₯πη±₿♜π
Adopt a character or emoji to give it the attention it deserves, while also supporting Unicode’s mission to ensure everyone can communicate in their own languages across all devices.
Each adoption includes a digital badge and certificate that you can proudly display!
Have fun and support a good cause
You can also donate funds or gift stock