Tuesday, January 22, 2013
Making UTC Document Register Public
The Unicode Technical Committee (UTC) is making its
document register freely available for public access, starting on April
15, 2013. This decision has been taken in the interest of increasing
public involvement in the ongoing deliberations
of the UTC regarding the development of the Unicode Standard and the
other standards and reports that it maintains. Open access to the
document register will also make it easier to search the documents, both
current and historical, for topics of interest,
using widely available search engines. The UTC document register
contains online documents dating back to 1997 and online registers for
paper document distributions dating back to 1991.
The date for opening up access has been set to April 15 to provide sufficient time for anyone who might have issues concerning this change to raise their concerns to the Unicode Consortium. In particular, any author of a document which was submitted to the UTC under the old rules, with the assumption that the document would be available only to current members of the Consortium for review, who has concerns about that document being made publicly accessible, is encouraged to contact the Unicode Consortium. Please identify precisely the document of concern and the reasons why you might not wish for it to be included in the publicly accessible set. Please note that the change to make the document register publicly accessible does not change anything with regard to copyright status of existing documents – these documents are not being put in the public domain; rather, the UTC is simply removing the requirement for password access to view them.
The date for opening up access has been set to April 15 to provide sufficient time for anyone who might have issues concerning this change to raise their concerns to the Unicode Consortium. In particular, any author of a document which was submitted to the UTC under the old rules, with the assumption that the document would be available only to current members of the Consortium for review, who has concerns about that document being made publicly accessible, is encouraged to contact the Unicode Consortium. Please identify precisely the document of concern and the reasons why you might not wish for it to be included in the publicly accessible set. Please note that the change to make the document register publicly accessible does not change anything with regard to copyright status of existing documents – these documents are not being put in the public domain; rather, the UTC is simply removing the requirement for password access to view them.
Thursday, January 3, 2013
Major changes to Unicode for Arabic & Hebrew
UAX #9, Unicode Bidirectional Algorithm, will be updated for Unicode 6.3. The
Unicode BIDI algorithm is used for displaying all Arabic and Hebrew text
on the web and in application programs, so any changes require careful
review.
This proposed update involves a substantial extension of the Unicode Bidirectional Algorithm to allow for the implementation of isolate runs. It also introduces new Bidi_Class property values and formatting characters in support of that extension.
There are also changes to Section 3.3.4 Resolving Neutral Types to resolve paired punctuation marks as a unit. This adds a new rule N0.
See the modifications section of the proposed update for information on specific changes to sections in the document.
The proposed update is available here: http://www.unicode.org/reports/tr9/tr9-28.html
This proposed update involves a substantial extension of the Unicode Bidirectional Algorithm to allow for the implementation of isolate runs. It also introduces new Bidi_Class property values and formatting characters in support of that extension.
There are also changes to Section 3.3.4 Resolving Neutral Types to resolve paired punctuation marks as a unit. This adds a new rule N0.
See the modifications section of the proposed update for information on specific changes to sections in the document.
The proposed update is available here: http://www.unicode.org/reports/tr9/tr9-28.html
Tuesday, December 18, 2012
Unicode 6.2 Paperback Available
Unicode 6.2, Core Specification is now available as paperback book.
Responding to requests, the editorial committee has created a modestly-priced print-on-demand volume that contains the complete text of the core specification of Version 6.2 of the Unicode Standard. This 692-page volume may be purchased from Lulu.com for 17ドル.24, plus shipping.
Note that this volume does not include the Version 6.2 code charts, nor does it include the Version 6.2 Standard Annexes and Unicode Character Database, all of which are available only on the Unicode website, http://www.unicode.org/versions/Unicode6.2.0/ .
Purchase The Unicode Standard, Version 6.2 - Core Specification.
Responding to requests, the editorial committee has created a modestly-priced print-on-demand volume that contains the complete text of the core specification of Version 6.2 of the Unicode Standard. This 692-page volume may be purchased from Lulu.com for 17ドル.24, plus shipping.
Note that this volume does not include the Version 6.2 code charts, nor does it include the Version 6.2 Standard Annexes and Unicode Character Database, all of which are available only on the Unicode website, http://www.unicode.org/versions/Unicode6.2.0/ .
Purchase The Unicode Standard, Version 6.2 - Core Specification.
Friday, December 14, 2012
Unicode Stability Policies Updated
The Unicode Character Encoding Stability Policies ensure that developers know
what they can depend on between successive releases of the Unicode Standard.
Recent changes to these policies include new guarantees:
Recent changes to these policies include new guarantees:
- Property aliases will not be reused later for different properties.
- Property value aliases will not be reused later for different property values.
- Characters with the General_Category of Number are guaranteed to have a corresponding Numeric_Type value.
- No new General_Category property values will ever be added.
- New Bidi_Class property values can only be added for a tightly constrained class of new character additions.
Wednesday, December 12, 2012
Feedback requested for Unicode 6.3
Unicode 6.3 is slated to be released in 2013Q3. Now is your opportunity to comment on the contents of this release.
The text of the Unicode Standard Annexes (segmentation, normalization, identifiers, etc.) is open for comments and feedback, with proposed update versions posted at UAX Proposed Updates. Initially, the contents of these documents are unchanged: the one exception is UAX #9 (BIDI), which has major revisions in PRI232. Changes to the text will be rolled in over the next few months, with more significant changes being announced. Feedback is especially useful on the changes in the proposed updates, and should be submitted by mid-January for consideration at the Unicode Technical Committee meeting at the end of January.
A later announcement will be sent when the beta versions of the Unicode character properties for 6.3 are available for comment. The only characters planned for this release are a small number of bidi control characters connected with the changes to UAX #9.
Monday, December 10, 2012
Unicode Collation Proposed Update
The Unicode Collation Algorithm (UCA) data is being modified to make all digits with the same numeric value sort the same, whether they are European (ASCII), Arabic, Devanagari, or others. In addition, the format of the main data table has changed to omit the (unused) 4th level weight, and some data tables are moved to the Unicode CLDR project.
These and other changes are in the new proposed update: see PRI 235. For the exact list of modifications, see Modifications.
These and other changes are in the new proposed update: see PRI 235. For the exact list of modifications, see Modifications.
Friday, November 16, 2012
Unicode 6.2 core specification now available
The Unicode 6.2 core
specification is now available. The text has been updated to align it with
changes to Unicode algorithms and properties that were
released in September, including the addition of the newly adopted Turkish
lira sign. The release of the core
specification completes the definitive documentation of the Unicode Standard,
Version 6.2.
For more details, see http://www.unicode.org/versions/Unicode6.2.0.
For more details, see http://www.unicode.org/versions/Unicode6.2.0.
Friday, October 26, 2012
CLDR Version 22.1 Released
Oct 26, 2012 — Unicode CLDR 22.1 has been released, providing an update to the key building blocks for software supporting the world's languages.
Unicode CLDR 22.1 contains data for 215 languages and 227 territories—654 locales in all. Version 22.1 is an update release, with several important fixes to CLDR 22.0, such as addition of the new Turkish currency symbol, and simpler patterns for fallback timezone formatting (“Los Angeles Time” instead of “United States Time (Los Angeles)”). For details, see CLDR-22.1.
CLDR is by far the largest and most extensive standard repository of locale data, used by a wide spectrum of companies for their software internationalization and localization. It is widely deployed via International Components for Unicode (ICU), and also accessed directly by companies such as Apple, Google, IBM, Twitter, and many others. CLDR is part of the Unicode locale data project, together with the Unicode Locale Data Markup Language (LDML)—an XML format used for general interchange of locale data, such as in Microsoft's .NET.
See the Charts pages for views of the CLDR data, organized in various ways. For more information about the Unicode CLDR project see cldr.unicode.org.
About the Unicode Consortium
The Unicode Consortium is a non-profit organization founded to develop, extend and promote use of the Unicode Standard and related globalization standards. The membership of the consortium represents a broad spectrum of corporations and organizations in the computer and information processing industry. Members are: Adobe Systems, Apple, Google, Government of Andhra Pradesh, Government of Bangladesh, Government of India, IBM, Microsoft, Monotype Imaging, Oracle, SAP, Tamil Virtual University, The Society for Natural Language Technology Research, The University of California (Berkeley), Yahoo!, plus well over a hundred Associate, Liaison, and Individual members. For more information, please contact the Unicode Consortium http://www.unicode.org/contacts.html.
Unicode CLDR 22.1 contains data for 215 languages and 227 territories—654 locales in all. Version 22.1 is an update release, with several important fixes to CLDR 22.0, such as addition of the new Turkish currency symbol, and simpler patterns for fallback timezone formatting (“Los Angeles Time” instead of “United States Time (Los Angeles)”). For details, see CLDR-22.1.
CLDR is by far the largest and most extensive standard repository of locale data, used by a wide spectrum of companies for their software internationalization and localization. It is widely deployed via International Components for Unicode (ICU), and also accessed directly by companies such as Apple, Google, IBM, Twitter, and many others. CLDR is part of the Unicode locale data project, together with the Unicode Locale Data Markup Language (LDML)—an XML format used for general interchange of locale data, such as in Microsoft's .NET.
See the Charts pages for views of the CLDR data, organized in various ways. For more information about the Unicode CLDR project see cldr.unicode.org.
About the Unicode Consortium
The Unicode Consortium is a non-profit organization founded to develop, extend and promote use of the Unicode Standard and related globalization standards. The membership of the consortium represents a broad spectrum of corporations and organizations in the computer and information processing industry. Members are: Adobe Systems, Apple, Google, Government of Andhra Pradesh, Government of Bangladesh, Government of India, IBM, Microsoft, Monotype Imaging, Oracle, SAP, Tamil Virtual University, The Society for Natural Language Technology Research, The University of California (Berkeley), Yahoo!, plus well over a hundred Associate, Liaison, and Individual members. For more information, please contact the Unicode Consortium http://www.unicode.org/contacts.html.
Subscribe to:
Comments (Atom)