Braille Patterns
Braille is an international writing system for the blind. It was invented in 1821 in Paris by Louis Braille, who was himself blind. The script is used today worldwide. Braille uses a system of six or eight raised dots that are arranged in two vertical rows of three or four dots.
Buginese
The Buginese script, also known as Lontara, is used to write the Buginese (Bugis) language on the island of Sulawesi in Indonesia, primarily in southwest Sulawesi, but the language is also spoken in other areas. The script is a descendant of Brahmi, and may be related to Javanese. It shows some affinity to the Tagalog script. Buginese has been in use since the 14C. It has also been used to write the Makasar, Bima, and Mandar languages.
Buhid
Buhid is a living minority script used in Mindoro in the Philippines used to write the Buhid language. Buhid is a Brahmi-derived script, distantly related to the South Indian scripts. Buhid is closely related to the Hanunóo and Tagbanwa scripts of the Philippines. All three scripts are related to Tagalog, but may not be directly descended from it. The ancestor of these Philippine scripts (including Tagalog) may have been transported to the Philippines via palaeographic scripts of western Java between the 10 and 14C CE.
Byzantine Musical Symbols
Byzantine Musical Symbols are musical symbols used to write the religious music and hymns of the Christian Orthodox Church and some folk music manuscripts. These symbols first appeared in the 7C to 8C CE. In 1881, the Orthodox Patriarchy Musical Committee established the New Analytical Byzantine Musical Notation System, which is used today. Most of the manuscripts are in Greek, although a few are in in Russian, Bulgarian, Romanian, and Arabic.
CJK
"CJK" refers to to the unified Han characters used to write to the Chinese, Japanese, and Korean languages. Technically, CJK can also be used for the Vietnamese language, since early Vietnamese writing systems were based on Han. There are several blocks of CJK characters, including multiple blocks of unified ideographs, and blocks of compatibilty ideographs, symbols and punctuation marks, strokes, and radicals. A description of the CJK blocks and the unification principles appears in section 12.1 of The Unicode Standard, and a history of the encoding appears in Appendix E of the Standard.
Carian
Carian is a partly undeciphered script, which has some relationship to the Greek alphabet. It is used to write the Carian language. Carian dates to the first millennium BCE. A few inscriptions have been found in Caria, on the western portion of present-day Turkey, and a fragmentary bilingual has also been found in Athens. However, the bulk of extant texts have been found in Egypt, and were left by Carian mercenaries. In 1996 an extensive Carian-Greek bilingual was discovered at Kaunos in southwestern Turkey. The bilingual clearly demonstrated Carian was a member of the Anatolian branch of Indo-European, though details of the language still remain unclear.
Chakma
The Chakma script is used to write the Chakma language, which is spoken in Bangladesh and India, and is being adapted for the Tanchangya language in Bangladesh. The script has been used for liturgical purposes, but also is used in teaching materials today. The Chakma language is also written in the Latin and Bengali scripts.
Cham
The Cham script, also known as Akhar Thrah, is a Brahmi-derived script used write the Cham language, an Austronesian language. There are two main varieties of the Cham language: Western Cham, which spoken in Cambodia (and to a lesser extent in Vietnam and Thailand), and Eastern Cham, spoken in Vietnam. Speakers of the former tend to use the Arabic script while some speakers of the latter still use the Cham script.
Cherokee
The Cherokee script is used to write the indigenous Cherokee language. Cherokee is the native tongue to about 20,000 people, though most speakers today use it as a second language. It was originally invented by Sequoyah, a Cherokee silversmith, in the early 19C, between 1815 and 1821. Sequoyah devised a system of numbers for Cherokee, but today Latin numbers are used instead. The script is still taught today.
Combining Diacritical Marks
Diacritical marks are ancillary marks that are added to a base character, and can be used to indicate how the character is to be pronounced or stressed. The Combining Diacritical Marks block includes characters intended for general use with any script. Diacritical marks that are specific to a particular script are encoded with that script. The Combining Diacritical Marks Supplement comprises a set of lesser-used combining diacritical marks.
Combining Diacritical Marks for Symbols
This set of Combining Diacritical Marks for Symbols are, in general, to be applied to mathematical or technical symbols, and serve to extend the set of such symbols. A number of compatibility enclosing marks are also included in this block, which can enclose the base character in different ways.
Combining Half Marks
The Combining Half Marks set consists of a number of combining mark pieces that can be used to visually encode certain combining marks that extend over multiple base letterforms. They are included to facilitate the support of such marks in legacy implementations. However, double diacritics, such as U+0360 and U+0361, are to be preferred. The block also includes macron marks that are recommended for representing a style of supralineation in Coptic.
Common Indic Number Forms
The Common Indic Number Forms are characters that are used to represent fractional values in various scripts of North India, Pakistan and Nepal. These signs were used to write currency, weight, measure, time, and other units. They have been used since the 16C and are still employed today in a limited capacity.
Control Pictures
Control pictures are conventional representations of nongraphic characters for use when it is necessary to show the position of a control code within a data stream. Three characters are included in this block to visibly represent ASCII space.
Coptic
The Coptic script represents the final stage in the development of the Egyptian writing system and was used for writing the Coptic language. Coptic was based on the Greek uncial alphabets but several letters were added that were unique to Coptic. Although the language died out in the 14C, it is still maintained as a liturgical language by Coptic Christians. Before Unicode 4.1, Coptic was considered a stylistic variant of Greek, so 14 Coptic characters appear in the "Greek and Coptic" block. Coptic is now considered to be disunified from Greek, so one can use the 14 Coptic characters in the "Greek and Coptic" block as well as additional letters that appear in the new "Coptic" block (which also includes characters for Old Coptic and Nubian).
Counting Rod Numerals
Chinese counting-rod numerals were used to represent and manipulate numbers in pre-modern East Asian mathematical texts. The rods consisted of a set of small sticks that were several centimeters in length; these were arranged in patterns on a gridded counting board. The glyph shapes represent the conventions of the Song dynasty (960 - 1279), when traditional Chinese mathematics was at its height. The symbols go back to the Warring States Period in China, ca. 4C or 5C BCE.
Cuneiform
Sumero-Akkadian Cuneiform is a logosyllabary that was used from the end of the third millennium until the 1C CE. It spread beyond Mesopotamia to Elam, Assyria, eastern Syria, southern Anatolia, and Egypt, and was used for many languages outside of Sumerian and Akkadian. The script developed from Proto-Cuneiform and Early Dynastic cuneiform. Cuneiform is one of the world's oldest writing systems.
Currency Symbols
The Currency Symbols block includes customary symbols used to indicate certain currencies in general text. The signs may vary in shape and are often used for more than one currency. Not all currencies are represented by a currency symbol; some use sequences of multiple-letters, while the abbreviations for currencies can vary by language. Some contemporary or historic currency symbols, not found in the Currency Symbols block, may be found in other blocks.
Cypriot Syllabary
The Cypriot syllabary is an historical script used to write the Cypriot dialect of Greek and a non-Indo-European language, "Eteo-Cypriot." The script was used from the middle of the 11C to 3C BCE and appears at be descended from one of the Cypro-Minoan scripts of Cyprus. The Cypriot syllabary shares some orthographic conventions with Linear B, but the script is written right to left.
Cyrillic
The Cyrillic script has traditionally been used for writing various Slavic languages, including Russian. The script dates to the 9C or 10C CE, and is named after St. Cyril, a Byzantine missionary. It is one of several scripts that were ultimately derived from the Greek script. Cyrillic has been extended to write non-Slavic languages, particularly the minority languages of Russia and surrounding countries. The Cyrillic Extended-A block is made up of Old Church Slavonic superscripted combining letters, while Cyrillic Extended-B comprises various historic characters, such as those used in Old Cyrillic and Old Abkhasian. The Cyrillic Supplement block is made up of additional letters needed to write various non-Slavic languages.