The Unicode Blog

Thursday, April 29, 2010

[Unicode Announcement] Unicode Releases Common Locale Data Repository, Version 1.8.1

The Unicode CLDR 1.8.1 maintenance release is now available. See
http://cldr.unicode.org/index/downloads/cldr-1-8-1 for details.

The next major release is CLDR 1.9, scheduled for the end of October.
Two milestone releases are planned for 1.9 as well. The 1.9 release is
focused on tooling and structural changes, while the CLDR 2.0 release
will involve general data submission. For the tentative schedule, see
http://cldr.unicode.org.

----
All of the Unicode Consortium lists are strictly opt-in lists for members
or interested users of our standards. We make every effort to remove
users who do not wish to receive e-mail from us. To see why you are getting
this mail and how to remove yourself from our lists if you want, please
see http://www.unicode.org/consortium/distlist.html#announcements

Posted by Unicode, Inc. at 8:58 PM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Tuesday, April 27, 2010

[Unicode Announcement] Call for Participation: IUC 34, Oct 18-20, 2010

Mountain View, CA, USA – April 26, 2010 – The Unicode® Consortium today
announced a call for participation in The Thirty-fourth
Internationalization & Unicode® Conference (IUC 34), taking place in
Santa Clara, Calif., USA; October 18-20, 2010. The conference is
produced by OMG™.

The Internationalization & Unicode Conference is the premier annual
technical conference for topics on the design and global deployment of
multilingual applications and web sites. Internationalization and
Unicode experts, implementers, clients, teachers, students, and vendors
are invited to attend this unique conference. The interactive format
makes the Internationalization & Unicode Conference a great place to
meet and exchange ideas with leading experts, find out about the needs
of potential clients, and get information about Unicode-enabled products.

To be considered as a presenter for the conference, please submit a
brief abstract before Wednesday, May 26. Topics should be related to
internationalization and localization; presentations structured as
tutorials are also welcome. Suitable topics include, but are not limited
to:

Best Practices and New Approaches

• New technologies, algorithms and methodologies
• Using internationalization libraries and programming environments
• Handling bidirectional or other complex scripts
• Data formats and evolving standards, e.g. XML, JSON, HTML5, DITA,
• Project management for global development teams
• Localization technologies, Crowd Sourcing, Machine Translation, et al
• Development, test, and deployment techniques and experiences
• Improving globalization capabilities within organizations
• Migrating legacy applications to global markets
• Unicode, Emoji, and character encodings

Application Areas

• Social networks
• Search engines, SEO, discovery and navigation best practices
• Websites, Cloud Computing, SAAS, and Web services
• Libraries and education
• Mobile applications, including iPhone, Android, iPad, Kindle, etc.
• Publishing and broadcasting for a global audience
• Internationalized Domain Names and other identifiers
• Security concerns and practices

Language and Locale Support

• African, Asian, Middle Eastern, and support for other languages
• Unicode Common Locale Data Repository (CLDR)
• Font development

Details of the call for participation are available at:
http://www.unicodeconference.org/iuc34call

Interested individuals or organizations are invited to submit a brief
(up to 600 word) abstract of their proposed conference presentation by
Wednesday, May 26 using this web form:
http://www.unicodeconference.org/abstracts

The Program Committee will notify authors by Wednesday, June 9. Final
presentation materials will be required from selected presenters by
Tuesday August 31. The conference agenda will be available by Tuesday,
June 15 at: http://www.unicodeconference.org/

Sponsorships and exhibit space are available; for more information on
sponsoring contact Ken Berk at kenberk@omg.org, +1-781-444 0404. For
exhibiting questions email event_marketing@omg.org . For all other
questions email: info@unicodeconference.org

###

About the Unicode Consortium

The Unicode Consortium is a non-profit organization founded to develop,
extend and promote use of the Unicode Standard and related globalization
standards.

The membership of the consortium represents a broad spectrum of
corporations and organizations in the computer and information
processing industry. Members are: Adobe Systems, Apple, DENIC eG,
Google, Government of India, Government of Tamil Nadu, IBM, Microsoft,
Monotype Imaging, Oracle, The Society for Natural Language Technology
Research, SAP, Sybase, The University of California (Berkeley), The
University of California (Santa Cruz), Yahoo!, plus well over a hundred
Associate, Liaison, and Individual members.

For more information, please contact the Unicode Consortium
http://www.unicode.org/contacts.html. For more information, please
contact the Unicode Consortium http://www.unicode.org.

About the Event Producer

OMG™ is the Event Producer for the Internationalization & Unicode
Conferences. OMG is an open membership, not-for-profit consortium that
produces and maintains computer industry specifications for
interoperable enterprise applications. Our specifications include MDA®,
UML®, CORBA®, MOF™, XMI® and CWM™. OMG's specifications are all
available for download by everyone without charge.

For more information about OMG, visit us online at http://www.omg.org.

Note to editors: Unicode Standard, Unicode and the Unicode Logo are
trademarks of Unicode, Inc. Unicode Consortium is a registered trademark
of Unicode, Inc. OMG and Object Management Group are trademarks of
Object Management Group. All other trademarks are the property of their
respective owners.

Posted by Unicode, Inc. at 11:46 AM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Thursday, April 15, 2010

[Unicode Announcement] New Public Review Issue: Two New Provisional Properties for Characters in Indic Scripts

The Unicode Technical Committee has posted a new issue for public review
and comment. Details are on the following web page:

http://www.unicode.org/review/

Review periods for the new items close on May 3, 2010.

Please see the page for links to discussion and relevant documents.
Briefly, the new issue is:

Issue #168: Two New Provisional Properties for Characters in Indic Scripts
http://www.unicode.org/review/#pri168

The UTC is considering the addition of two new, enumerated provisional
character properties for Indic scripts: Indic_Syllabic_Category and
Matra_Placement. These are to assist in the analysis and processing of
syllables for various Brahmi-derived scripts, providing classificatory
information that is not easy to extract or derive for all of the Indic
scripts in the standard. Feedback is welcome on the construction of the
proposed properties, the details of the proposed assignment of values
for characters, and on the question of the usefulness of defining such
properties.

If you have comments for official UTC consideration, please post them by
submitting your comments through our feedback & reporting page:

http://www.unicode.org/reporting.html

If you wish to discuss issues on the Unicode mail list, then please
use the following link to subscribe (if necessary). Please be aware
that discussion comments on the Unicode mail list are not automatically
recorded as input to the UTC. You must use the reporting link above
to generate comments for UTC consideration.

http://www.unicode.org/consortium/distlist.html

Posted by Unicode, Inc. at 1:06 PM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Saturday, April 10, 2010

[Unicode Announcement] Tracking proposed updates to Unicode technical publications

To make it easier to find and track proposed updates to Unicode
technical publications, the editorial committee has made several
improvements:

* Proposed updates can be found by a predictable, stable URL
* This URL is always accessible from the latest approved version of
the document via a header field titled "Latest Proposed Update".

For example, if you look at the approved 5.2.0 version of UAX
#15, Unicode Normalization Forms
(http://www.unicode.org/reports/tr15/), you'll find at the top
of the document under "Latest Proposed Update" a link to
http://www.unicode.org/reports/tr15/proposed.html. That URL
points to the latest proposed update for UAX #15.

* If there is currently no proposed update for a document, the URL
will point to a stub document indicating that there is no current
proposed update.

In addition, a predictable, stable URL is used for the modifications
section within each proposed update. That section summarizes the changes
that have been made from the previous version. These URLs follow the
format http://www.unicode.org/reports/tr15/proposed.html#Modifications .

Posted by Unicode, Inc. at 12:38 PM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Thursday, April 8, 2010

[Unicode Announcement] W3C India Conference

Here is an announcement for a conference in India which should be of interest to Unicode members.

---------------

The W3C India Office is organizing an International Conference "World Wide Web: Technology, Standards and Internationalization - 2010" in New Delhi on May 6-7, 2010.

The focus of the conference is to promote and proliferate W3C Standards in India to enable seamless Web access in Indian languages. One of the major aspects to be covered in the conference is Internationalization, especially in light of the complexity of implementing Indian Languages.

Core Technology Tracks in the Conference include:

1. W3C and Web Technologies
2. Internationalization Aspects in W3C
3. Web Access through mobile and hand-held devices
4. CSS and Styling issues
5. Web Architecture and Semantic web
6. Human Machine Interface for the Web
7. Web Content Accessibility in Indian Languages
8. W3C and E-Governance

Kindly visit the W3C India Website http://www.w3cindia.in and the Conference Website http://www.w3cindia.in/conf-site/conference-index.htm for more details. The Conference will also attempt to evolve a Roadmap for proliferation and specific requirements for Indian Languages in W3C and associated standards. We look forward to your active cooperation and participation in the Conference.

Posted by Unicode, Inc. at 11:39 AM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Tuesday, March 30, 2010

[Unicode Announcement] Public Review Issue: IVD Submission, PRI #167

The Unicode Consortium has posted a new issue for public review and
comment. Details are on the following web page:

http://www.unicode.org/review/

Review periods for the new items close on June 26, 2010.

Please see the page for links to discussion and relevant documents.
Briefly, the new issue is:

PRI #167 Ideographic Variation Database Submission

The Ideographic Variation Database provides a registry for collections
of unique variation sequences containing unified ideographs, allowing
for standardized interchange according to UTS #37, Ideographic Variation
Database. A submission to the Ideographic Variation Database has been
received for: "Combined registration of the Hanyo-Densi collection and
of sequences in that collection". Details are in the background document.

http://www.unicode.org/ivd/pri/pri167/index.html

Reviewers are encouraged to comment on any aspect of the submissions,
but more particularly on:

* whether the intent of a proposed collection is appropriately described
* whether the glyphic subset corresponding to a proposed sequence is
indeed a glyphic subset of the base character for the sequence
* whether the proposed sequences are congruent with the scope of their
collection, or whether a new collection may be more appropriate

If you have comments for official consideration, please post them by
submitting your comments through our feedback & reporting page:

http://www.unicode.org/reporting.html

If you wish to discuss issues on the Unicode mail list, then please use
the following link to subscribe (if necessary). Please be aware that
discussion comments on the Unicode mail list are not automatically
recorded as input to the IVD registrar. You must use the reporting link
above to generate comments for official consideration.

http://www.unicode.org/consortium/distlist.html

Posted by Unicode, Inc. at 4:02 PM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Wednesday, March 17, 2010

[Unicode Announcement] The Unicode Consortium Releases CLDR, Version 1.8

Mountain View, CA, March 17, 2009 - The Unicode Consortium announced
today the release of the new version of the Unicode Common Locale Data
Repository (Unicode CLDR 1.8), providing key building blocks for
software to support the world's languages.

CLDR 1.8 contains data for 186 languages and 159 territories: 501
locales in all. Version 1.8 of the repository contains over 22% more
locale data than the previous release, with over 42,000 new or modified
data items from over 300 different contributors.

For this release, the Unicode Consortium partnered with ANLoc, the
African Network for Localization, a project sponsored by Canada's
International Development Research Centre (IDRC), to help extend modern
computing on the African continent. ANLoc's vision is to empower
Africans to participate in the digital age by enabling their languages
in computers. A sub-project of ANLoc, called Afrigen, focuses on
creating African locales.

The Afrigen-ANLoc project's mission is to create viable locale data for
at least 100 of the over 2000 languages spoken in Africa, and
incorporate the data into Unicode's CLDR project and OpenOffice.org.
Implementation of fundamental locale data within CLDR is a critical step
for providing computer applications that can be localized into these
African languages, thus reaching populations that have never before been
able to use their native languages on computers and mobile phones.

The Afrigen-ANLoc project selected approximately 200 candidate
languages, including all official languages recognized by a national
government and all languages with at least 500,000 native speakers.
Additional languages were incorporated when volunteers stepped forward.
Data was collected through the Afrigen-ANLoc project by native-speaking
volunteers around the world, entered via a web-based utility designed
specifically for this purpose, and then merged into the CLDR repository.
In all, over 150 volunteers gathered locale data for 72 African
languages, with data for 54 of those incorporated into the CLDR 1.8
release. 41 of these languages are completely new to the Unicode CLDR
project while 13 others existed in earlier versions of CLDR and were
enhanced with additional data. These languages are spoken in 26
countries across the entire African continent.

"The partnership with Afrigen has been a huge benefit for us," says John
Emmons, vice-chair of the Unicode CLDR technical committee and lead CLDR
engineer for IBM. "The Afrigen effort has allowed us to bring many new
languages on board that we wouldn't be able to do through our normal
process, while still maintaining the level of quality and consistency
that we require for every language."

For more information about Unicode CLDR 1.8, see
http://cldr.unicode.org/index/downloads/cldr-1-8

The Afrigen-ANLoc data collection tool was developed by Louise
Berthilson of IT46 (http://www.it46.se), and the project is managed by
Martin Benjamin, director of Kamusi Project International
(http://kamusi.org). For more information about the African Network for
Localization, see http://www.africanlocalisation.net. For more
information about the Afrigen-ANLoc project, see
http://www.it46.se/afrigen. For more information about IDRC, see
http://www.idrc.ca.

About the Unicode Consortium

The Unicode Consortium is a non-profit organization founded to develop,
extend and promote use of the Unicode Standard and related globalization
standards.

For more information, please contact the Unicode Consortium.
http://www.unicode.org/contacts.html

Posted by Unicode, Inc. at 8:59 AM

Email This BlogThis! Share to X Share to Facebook Share to Pinterest

Friday, February 26, 2010

[Unicode Announcement] New Public Review Issues for Unicode 6.0 UAXes

The Unicode Consortium is revising the Unicode Standard Annexes for
eventual release of Unicode 6.0. A standard part of our development
process is to open all of the annexes for public review. All the annexes
are currently available on the PRI page. We encourage all interested
parties to participate in this review. To look at the annexes and make
suggestions for additions and improvements to their content, please see
the Public Review Issues page:

http://www.unicode.org/review/

Comments should be submitted by May 3, 2010.

If you have comments for official UTC consideration, please post them by
submitting your comments through our feedback & reporting page:

Thursday, April 29, 2010

[Unicode Announcement] Unicode Releases Common Locale Data Repository, Version 1.8.1

Tuesday, April 27, 2010

[Unicode Announcement] Call for Participation: IUC 34, Oct 18-20, 2010

Thursday, April 15, 2010

[Unicode Announcement] New Public Review Issue: Two New Provisional Properties for Characters in Indic Scripts

Saturday, April 10, 2010

[Unicode Announcement] Tracking proposed updates to Unicode technical publications

Thursday, April 8, 2010

[Unicode Announcement] W3C India Conference

Tuesday, March 30, 2010

[Unicode Announcement] Public Review Issue: IVD Submission, PRI #167

Wednesday, March 17, 2010

[Unicode Announcement] The Unicode Consortium Releases CLDR, Version 1.8

Friday, February 26, 2010

[Unicode Announcement] New Public Review Issues for Unicode 6.0 UAXes

Links of Interest

Blog Archive

Labels

Followers

Subscribe to this blog