Showing posts with label UAX #44. Show all posts
Showing posts with label UAX #44. Show all posts
Friday, April 13, 2018
Last Call on Unicode 11.0 Review
[画像:stopwatch image ]The beta review period for Unicode 11.0 and related technical standards will close
on April 23, 2018. This is the last opportunity for technical comments before
version 11.0 is released in Q2 2018. Implementers and interested parties are
encouraged to download data files, review proposed updates, and submit comments.
Unicode 11.0 adds seven new scripts, including Hanifi Rohingya, 66 additional emoji characters, including four new components for hair color (for a total of 157 emoj sequences). The set of Georgian Mtavruli capital letters has been added to support modern casing practices.
UAX #14, Unicode Line Breaking Algorithm
Unicode 11.0 adds seven new scripts, including Hanifi Rohingya, 66 additional emoji characters, including four new components for hair color (for a total of 157 emoj sequences). The set of Georgian Mtavruli capital letters has been added to support modern casing practices.
- For more information about testing the 11.0 beta, see unicode.org/versions/beta-11.0.0.html
- For the current draft summary of Unicode 11.0, see unicode.org/versions/Unicode11.0.0
UAX #14, Unicode Line Breaking Algorithm
- Uses Extended_Pictographic property for future-proofing
- New support for Indic virama handling
- Uses Extended_Pictographic property for future-proofing
- A new table of formal regex definitions
- Refines the use of ZWJ in identifiers
- Broadens the definition of hashtag identifiers
- Five new fields and improved regular expressions.
- Document extension of Unihan properties to non-Unihan
- New property Equivalent_Unified_Ideograph
- New regular expressions Bidi_Paired_Bracket & Equivalent_Unified_Ideograph
- More discussion of emoji variation sequences
- Clarification of values allowed for the Age property
- Updates data to Unicode 11.0
- Clarification of search tailoring in visual-order scripts
- Updates data to Unicode 11.0
- Enhances discussions of joining controls & combining sequences
- Updates data to Unicode 11.0
- Changes the format of the test file for arbitrary input settings
- Updates input setting for Transitional_Processing
- Supplies Extended_Pictographic property for future-proofing
- Simplifies emoji sequence definitions
- EBNF and Regex expressions for loose matches
- More proposed guidelines: gender-neutral emoji, skin-tone modifiers, ZWJ visible fallbacks, hair-style components
- Mechanism for changing the “facing” direction for emoji
Friday, December 13, 2013
Unicode 7.0 Annexes Available for Early Review
As technical work gets underway to prepare the publication of Unicode 7.0 (tentatively scheduled for June, 2014), the Unicode Technical Committee has posted proposed updates for several important specifications:
PRI #260, Proposed Update UTS #10, Unicode Collation Algorithm
PRI #261, Proposed Update UAX #15, Unicode Normalization Forms
PRI #262, Proposed Update UAX #44, Unicode Character Database
In UTS #10, collation weights are discussed more generically, with fewer references to the 16-bit weights used in the DUCET. Section 6.3.2, Large Values for Secondary or Tertiary Weights was merged into Section 6.2, Large Weight Values. In UAX #44, the derivation of the Alphabetic property has been updated and the discussion of @missing in Section 4.2.10 @missing Conventions has been simplified to reflect the revised conventions in the UCD data files, which eliminated special edge cases.
Review periods for these new public review issues close January 27, 2014. For details about reviewing and commenting, please see the Public Review Issues page.
http://unicode-inc.blogspot.com/2013/12/unicode-70-annexes-available-for-early.html
PRI #260, Proposed Update UTS #10, Unicode Collation Algorithm
PRI #261, Proposed Update UAX #15, Unicode Normalization Forms
PRI #262, Proposed Update UAX #44, Unicode Character Database
In UTS #10, collation weights are discussed more generically, with fewer references to the 16-bit weights used in the DUCET. Section 6.3.2, Large Values for Secondary or Tertiary Weights was merged into Section 6.2, Large Weight Values. In UAX #44, the derivation of the Alphabetic property has been updated and the discussion of @missing in Section 4.2.10 @missing Conventions has been simplified to reflect the revised conventions in the UCD data files, which eliminated special edge cases.
Review periods for these new public review issues close January 27, 2014. For details about reviewing and commenting, please see the Public Review Issues page.
http://unicode-inc.blogspot.com/2013/12/unicode-70-annexes-available-for-early.html
Subscribe to:
Comments (Atom)