Unicode 9.0 Web Bookmarks
About this page
This page contains hyperlinks to The Unicode Standard, Version 9.0. The Unicode 9.0.0 page lists the contents with links to each PDF file.
- Preface
- 1 Introduction
- 2 General Structure
- 2.1 Architectural Context
- 2.2 Unicode Design Principles
- 2.3 Compatibility Characters
- 2.4 Code Points and Characters
- 2.5 Encoding Forms
- 2.6 Encoding Schemes
- 2.7 Unicode Strings
- 2.8 Unicode Allocation
- 2.9 Details of Allocation
- Figure 2-13. Unicode Allocation
- Plane 0 (BMP)
- Figure 2-14. Allocation on the BMP
- ASCII and Latin-1 Compatibility Area
- General Scripts Area
- Punctuation and Symbols Area
- Supplementary General Scripts Area
- CJK Miscellaneous Area
- CJKV Ideographs Area
- General Scripts Area (Asia and Africa)
- Hangul Area
- Surrogates Area
- Private Use Area
- Compatibility and Specials Area
- Plane 1 (SMP)
- Plane 2 (SIP)
- Other Planes
- 2.10 Writing Direction
- 2.11 Combining Characters
- Combining Characters
- Diacritics
- Symbol Diacritics
- Enclosing Combining Marks
- Figure 2-17. Combining Enclosing Marks for Symbols
- Script-Specific Combining Characters
- Sequence of Base Characters and Diacritics
- Multiple Combining Characters
- Ligated Multiple Base Characters
- Exhibiting Nonspacing Marks in Isolation
- "Characters" and Grapheme Clusters
- 2.12 Equivalent Sequences
- 2.13 Special Characters
- 2.14 Conforming to the Unicode Standard
- 3 Conformance
- 3.1 Versions of the Unicode Standard
- 3.2 Conformance Requirements
- 3.3 Semantics
- 3.4 Characters and Encoding
- 3.5 Properties
- 3.6 Combination
- 3.7 Decomposition
- 3.8 Surrogates
- 3.9 Unicode Encoding Forms
- 3.10 Unicode Encoding Schemes
- 3.11 Normalization Forms
- 3.12 Conjoining Jamo Behavior
- 3.13 Default Case Algorithms
- 4 Character Properties
- Status and Attributes
- Consistency of Properties
- 4.1 Unicode Character Database
- 4.2 Case
- 4.3 Combining Classes
- Figure 4-1. Positions of Common Combining Marks
- Reordrant, Split, and Subjoined Combining Marks
- Reordrant Class Zero Combining Marks
- Table 4-4. Class Zero Combining Marks—Reordrant
- Table 4-5. Thai, Lao, and Other Logical Order Exceptions
- Split Class Zero Combining Marks
- Table 4-6. Class Zero Combining Marks—Split
- Subjoined Class Zero Combining Marks
- Table 4-7. Class Zero Combining Marks—Subjoined
- Strikethrough Class Zero Combining Marks
- Table 4-8. Class Zero Combining Marks—Strikethrough
- 4.4 Directionality
- 4.5 General Category
- 4.6 Numeric Value
- 4.7 Bidi Mirrored
- 4.8 Name
- 4.9 Unicode 1.0 Names
- 4.10 Letters, Alphabetic, and Ideographic
- 4.11 Properties Related to Text Boundaries
- 4.12 Characters with Unusual Properties
- 5 Implementation Guidelines
- 5.1 Data Structures for Character Conversion
- 5.2 Programming Languages and Data Types
- 5.3 Unknown and Missing Characters
- 5.4 Handling Surrogate Pairs in UTF-16
- 5.5 Handling Numbers
- 5.6 Normalization
- 5.7 Compression
- 5.8 Newline Guidelines
- 5.9 Regular Expressions
- 5.10 Language Information in Plain Text
- 5.11 Editing and Selection
- 5.12 Strategies for Handling Nonspacing Marks
- 5.13 Rendering Nonspacing Marks
- 5.14 Locating Text Element Boundaries
- 5.15 Identifiers
- 5.16 Sorting and Searching
- 5.17 Binary Order
- 5.18 Case Mappings
- 5.19 Mapping Compatibility Variants
- 5.20 Unicode Security
- 5.21 Ignoring Characters in Processing
- 5.22 Best Practice for U+FFFD Substitution
- 6 Writing Systems and Punctuation
- Scripts and Blocks
- Scripts and Writing Systems
- Punctuation
- 6.1 Writing Systems
- 6.2 General Punctuation
- Use and Interpretation
- Rendering
- Writing Direction
- Figure 6-2. Forms of CJK Punctuation
- Layout Controls
- Encoding Characters with Multiple Semantic Values
- Blocks Devoted to Punctuation
- Format Control Characters
- Space Characters
- Dashes and Hyphens
- Paired Punctuation
- Language-Based Usage of Quotation Marks
- European Usage
- Figure 6-3. European Quotation Marks
- Glyph Variation in Curly Quotes
- Table 6-4. Models of Visual Relationship between Quote Glyphs
- East Asian Usage
- Table 6-5. East Asian Quotation Marks
- Glyph Variation in East Asian Usage.
- Figure 6-4. Asian Quotation Marks
- Table 6-6. Opening and Closing Forms
- Overloaded Character Codes
- Consequences for Semantics
- Apostrophes
- Other Punctuation
- Hyphenation Point
- Word Separator Middle Dot
- Fraction Slash
- Spacing Overscores and Underscores
- Doubled Punctuation
- Period or Full Stop
- Ellipsis
- Vertical Ellipsis
- Leader Dots
- Other Basic Latin Punctuation Marks
- Canonical Equivalence Issues for Greek Punctuation
- Bullets
- Paragraph Marks
- Numeric Separators.
- Obelus
- Commercial Minus
- At Sign
- Table 6-7. Names for the @
- Archaic Punctuation and Editorial Marks
- Indic Punctuation
- CJK Punctuation
- Unknown or Unavailable Ideographs
- CJK Compatibility Forms
- 7 Europe-I
- 7.1 Latin
- Languages
- Diacritical Marks.
- Alternative Glyphs.
- Figure 7-1. Alternative Glyphs in Latin
- Variations in Diacritical Marks
- Table 7-1. Preferred Rendering of Cedilla versus Comma Below
- Latvian Cedilla
- Cedilla and Comma Below in Turkish and Romanian
- Exceptional Case Pairs
- Diacritics on i and j
- Figure 7-2. Diacritics on i and j
- Vietnamese
- Figure 7-3. Vietnamese Letters and Tone Marks
- Standards.
- Related Characters
- Letters of Basic Latin: U+0041–U+007A
- Letters of the Latin-1 Supplement: U+00C0–U+00FF
- Latin Extended-A: U+0100–U+017F
- Latin Extended-B: U+0180–U+024F
- IPA Extensions: U+0250–U+02AF
- Phonetic Extensions: U+1D00–U+1DBF
- Latin Extended Additional: U+1E00–U+1EFF
- Latin Extended-C: U+2C60–U+2C7F
- Latin Extended-D: U+A720–U+A7FF
- Latin Extended-E: U+AB30–U+AB6F
- Latin Ligatures: U+FB00–U+FB06
- 7.2 Greek
- Greek: U+0370–U+03FF
- Standards
- Polytonic Greek
- Nonspacing Marks
- Table 7-2. Nonspacing Marks Used with Greek
- Iota
- Variant Letterforms
- Figure 7-4. Variations in Greek Capital Letter Upsilon
- Representative Glyphs for Greek Phi
- Greek Letters as Symbols
- Symbols Versus Numbers
- Compatibility Punctuation
- Historic Letters
- Coptic-Unique Letters
- Related Characters
- Greek Extended: U+1F00–U+1FFF
- Ancient Greek Numbers: U+10140–U+1018F
- 7.3 Coptic
- 7.4 Cyrillic
- 7.5 Glagolitic
- 7.6 Armenian
- 7.7 Georgian
- 7.8 Modifier Letters
- 7.9 Combining Marks
- Sequence of Base Letters and Combining Marks
- Multiple Semantics
- Glyphic Variation
- Overlaid Diacritics
- Marks as Spacing Characters
- Spacing Clones of Diacritical Marks
- Relationship to ISO/IEC 8859-1
- Diacritics Positioned Over Two Base Characters
- Figure 7-9. Double Diacritics
- Figure 7-10. Positioning of Double Diacritics
- Figure 7-11. Use of CGJ with Double Diacritics
- Diacritics Positioned Over Three or More Base Characters
- Subtending Marks
- Combining Marks with Ligatures
- Figure 7-12. Interaction of Combining Marks with Ligatures
- Combining Diacritical Marks: U+0300–U+036F
- Combining Diacritical Marks Extended: U+1AB0–U+1AFF
- Combining Diacritical Marks Supplement: U+1DC0–U+1DFF
- Combining Marks for Symbols: U+20D0–U+20FF
- Combining Half Marks: U+FE20–U+FE2F
- Combining Marks in Other Blocks
- 8 Europe-II
- 9 Middle East-I
- 9.1 Hebrew
- 9.2 Arabic
- Arabic: U+0600–U+06FF
- Figure 9-1. Directionality and Cursive Connection
- Directionality
- Standards
- Encoding Principles
- Punctuation
- The Non-joiner and the Joiner
- Figure 9-2. Using a Joiner
- Figure 9-3. Using a Non-joiner
- Figure 9-4. Combinations of Joiners and Non-joiners
- Tashkil Nonspacing Marks
- Figure 9-5. Placement of Harakat
- Arabic-Indic Digits
- Table 9-1. Arabic Digit Names
- Table 9-2. Glyph Variation in Eastern Arabic-Indic Digits
- Extended Arabic Letters
- Koranic Annotation Signs
- Additional Vowel Marks
- Honorifics
- Arabic Mathematical Symbols
- Date Separator
- Full Stop
- Currency Symbols
- Signs Spanning Numbers
- Figure 9-6. Arabic Year Sign
- Poetic Verse Sign
- Arabic Cursive Joining
- Arabic Ligatures
- Arabic Joining Groups
- Arabic Supplement: U+0750–U+077F
- Arabic Extended-A: U+08A0–U+08FF
- Arabic Presentation Forms-A: U+FB50–U+FDFF
- Arabic Presentation Forms-B: U+FE70–U+FEFF
- 9.3 Syriac
- Syriac: U+0700–U+074F
- Syriac Language
- Languages Using the Syriac Script.
- Shaping
- Directionality
- Syriac Type Styles
- Character Names
- Syriac Abbreviation Mark
- Figure 9-7. Syriac Abbreviation
- Figure 9-8. Use of SAM
- Ligatures and Combining Characters
- Diacritical Marks and Vowels
- Punctuation
- Digits
- Harklean Marks
- Dalath and Rish
- Semkath
- Vowel Marks
- Miscellaneous Diacritics.
- Table 9-12. Miscellaneous Syriac Diacritic Use
- Use of Characters of the Arabic Block
- Syriac Shaping
- 9.4 Samaritan
- 9.5 Mandaic
- 10 Middle East-II
- 11 Cuneiform and Hieroglyphs
- 12 South and Central Asia-I
- 12.1 Devanagari
- Devanagari: U+0900–U+097F
- Principles of the Devanagari Script
- Rendering Devanagari Characters
- Consonant Letters
- Independent Vowel Letters
- Dependent Vowel Signs (Matras)
- Vowel Letters
- Table 12-1. Devanagari Vowel Letters
- Virama (Halant)
- Figure 12-1. Dead Consonants in Devanagari
- Consonant Conjuncts
- Figure 12-2. Conjunct Formations in Devanagari
- Explicit Virama (Halant)
- Figure 12-3. Preventing Conjunct Forms in Devanagari
- Explicit Half-Consonants
- Figure 12-4. Half-Consonants in Devanagari
- Figure 12-5. Independent Half-Forms in Devanagari
- Figure 12-6. Half-Consonants in Oriya
- Consonant Forms
- Figure 12-7. Consonant Forms in Devanagari and Oriya
- Rendering Devanagari
- Rules for Rendering
- Notation
- Dead Consonant Rule
- Consonant RA Rules
- Modifier Mark Rules
- Ligature Rules
- Memory Representation and Rendering Order
- Figure 12-8. Rendering Order in Devanagari
- Sample Half-Forms
- Table 12-2. Sample Devanagari Half-Forms
- Sample Ligatures
- Table 12-3. Sample Devanagari Ligatures
- Ligature Forms for Ra + Vocalic Liquids
- Table 12-4. RA + Vocalic Letter Ligature Forms
- Sample Half-Ligature Forms
- Table 12-5. Sample Devanagari Half-Ligature Forms
- Language-Specific Allographs
- Table 12-6. Marathi and Nepali Allographs
- Combining Marks
- Devanagari Digits, Punctuation, and Symbols
- Extensions in the Main Devanagari Block
- Sindhi Letters
- Konkani
- Bodo, Dogri, and Maithili
- Figure 12-9. Use of Apostrophe in Bodo, Dogri and Maithili
- Figure 12-10. Use of Avagraha in Dogri
- Kashmiri Letters
- Letters for Bihari Languages
- Table 12-7. Devanagari Vowels Used in Bihari Languages
- Prishthamatra Orthography
- Table 12-8. Prishthamatra Orthography
- Devanagari Extended: U+A8E0–U+A8FF
- Vedic Extensions: U+1CD0–U+1CFF
- 12.2 Bengali (Bangla)
- Virama (Hasant)
- Vowel Letters
- Table 12-9. Bengali Vowel Letters
- Table 12-10. Diphthong Vowel Letters in Kokborok
- Two-Part Vowel Signs
- Special Characters
- Historic Characters
- Characters for Assamese
- Table 12-11. Assamese Consonant-Vowel Combinations
- Rendering Behavior
- Consonant-Vowel Ligatures
- Table 12-12. Bengali Consonant-Vowel Combinations
- Figure 12-11. Requesting Bengali Consonant-Vowel Ligature
- Figure 12-12. Blocking Bengali Consonant-Vowel Ligature
- Khiya
- Khanda Ta.
- Figure 12-13. Bengali Syllable tta
- Ya-phalaa
- Interaction of Repha and Ya-phalaa
- Punctuation
- Truncation
- Table 12-13. Use of Apostrophe in Bangla
- 12.3 Gurmukhi
- 12.4 Gujarati
- 12.5 Oriya (Odia)
- 12.6 Tamil
- Tamil: U+0B80–U+0BFF
- Tamil Vowels
- Tamil Ligatures
- Ligatures with Vowel i
- Figure 12-19. Tamil Ligatures with i
- Ligatures with Vowel u
- Table 12-24. Tamil Ligatures with u
- Figure 12-20. Spacing Forms of Tamil u
- Ligatures with ra
- Figure 12-21. Tamil Ligatures with ra
- Ligatures with aa in Traditional Tamil Orthography
- Figure 12-22. Traditional Tamil Ligatures with aa
- Figure 12-23. Traditional Tamil Ligatures with o
- Ligatures with ai in Traditional Tamil Orthography
- Figure 12-24. Traditional Tamil Ligatures with ai
- Figure 12-25. Vowel ai in Modern Tamil
- Tamil aytham
- Punctuation
- Numbers
- Use of Nukta
- Tamil Named Character Sequences
- 12.7 Telugu
- 12.8 Kannada
- 12.9 Malayalam
- Vowel Letters
- Table 12-30. Malayalam Vowel Letters
- Two-Part Vowels
- Historic Characters
- Malayalam Orthographic Reform
- Rendering Malayalam
- Candrakkala
- Table 12-32. Malayalam Conjuncts
- Table 12-33. Candrakkala Examples
- Explicit Candrakkala
- Requesting Traditional Ligatures
- Requesting Open Forms of Conjuncts
- Table 12-34. Use of Joiners in Malayalam
- Anusvara
- Dot Reph
- Chillu Forms
- Special Cases Involving rra
- Table 12-35. Malayalam /rara/ and /uua/
- Table 12-36. Malayalam /nr/ and /nt/
- Legacy Chillu Sequences
- Table 12-37. Atomic Encoding of Malayalam Chillus
- Malayalam Numbers and Punctuation
- 13 South and Central Asia-II
- 13.1 Thaana
- 13.2 Sinhala
- 13.3 Newa
- 13.4 Tibetan
- General Principles of the Tibetan Script
- Figure 13-1. Tibetan Syllable Structure
- Consonants
- Vowels
- Coding Order
- Allographical Considerations
- Head Position "ra"
- Full-Form "ra" in Head Position.
- Subjoined Position "wa", "ya", and "ra"
- Halanta (Srog-Med).
- Line Breaking Considerations
- Tibetan Punctuation
- Svasti Signs
- Other Characters
- Tibetan Half-Numbers
- Tibetan Transliteration and Transcription of Other Languages
- Other Signs
- Traditional Text Formatting and Line Justification
- Figure 13-2. Justifying Tibetan Tseks
- Tibetan Shorthand Abbreviations (bskungs-yig) and Limitations of the Encoding
- 13.5 Mongolian
- History
- Directionality
- Encoding Principles
- Figure 13-3. Mongolian Glyph Convergence
- Cursive Joining
- Figure 13-4. Mongolian Consonant Ligation
- Figure 13-5. Mongolian Positional Forms
- Free Variation Selectors
- Figure 13-6. Mongolian Free Variation Selector
- Representative Glyphs
- Vowel Harmony
- Figure 13-7. Mongolian Gender Forms
- Narrow No-Break Space
- Mongolian Vowel Separator
- Figure 13-8. Mongolian Vowel Separator
- Baluda
- Numbers
- Punctuation
- Nirugu
- Syllable Boundary Marker
- 13.6 Limbu
- 13.7 Meetei Mayek
- 13.8 Mro
- 13.9 Warang Citi
- 13.10 Ol Chiki
- 13.11 Chakma
- 13.12 Lepcha
- 13.13 Saurashtra
- 14 South and Central Asia-III
- 14.1 Brahmi
- 14.2 Kharoshthi
- 14.3 Bhaiksuki
- 14.4 Phags-pa
- History
- Basic Structure
- Syllable Division
- Candrabindu
- Figure 14-5. Phags-pa Syllable Om
- Alternate Letters
- Numbers
- Punctuation
- Positional Variants
- Table 14-7. Phags-pa Positional Forms of I, U, E, and O
- Mirrored Variants
- Table 14-8. Contextual Glyph Mirroring in Phags-pa
- Table 14-9. Phags-pa Standardized Variants
- Figure 14-6. Phags-pa Reversed Shaping
- Cursive Joining
- 14.5 Marchen
- 14.6 Old Turkic
- 15 South and Central Asia-IV
- 16 Southeast Asia
- 16.1 Thai
- 16.2 Lao
- 16.3 Myanmar
- 16.4 Khmer
- Khmer: U+1780–U+17FF
- Principles of the Khmer Script
- Glottal Consonant
- Table 16-5. Independent Khmer Vowel Characters
- Subscript Consonants
- Subscript Independent Vowel Signs
- Consonant Registers
- Table 16-6. Two Registers of Khmer Consonants
- Encoding Principles
- Subscript Consonant Signs
- Table 16-7. Khmer Subscript Consonant Signs
- Dependent Vowel Signs
- Table 16-8. Khmer Composite Dependent Vowel Signs with Nikahit
- Independent Vowel Characters
- Subscript Independent Vowel Signs
- Table 16-9. Khmer Subscript Independent Vowel Signs
- Other Signs as Syllabic Components
- Ligatures
- Figure 16-1. Common Ligatures in Khmer
- Multiple Glyphs
- Figure 16-2. Common Multiple Forms in Khmer
- Characters Whose Use Is Discouraged
- Ordering of Syllable Components.
- Figure 16-3. Examples of Syllabic Order in Khmer
- Consonant Shifters
- Ligature Control
- Figure 16-4. Ligation in Muul Style in Khmer
- Spacing.
- Khmer Symbols: U+19E0–U+19FF
- 16.5 Tai Le
- 16.6 New Tai Lue
- 16.7 Tai Tham
- 16.8 Tai Viet
- 16.9 Kayah Li
- 16.10 Cham
- 16.11 Pahawh Hmong
- 16.12 Pau Cin Hau
- 17 Indonesia and Oceania
- 17.1 Philippine Scripts
- 17.2 Buginese
- 17.3 Balinese
- Structure
- Table 17-2. Balinese Base Consonants and Conjunct Forms
- Table 17-3. Sasak Extensions for Balinese
- Behavior of ra
- Figure 17-2. Writing dharma in Balinese
- Behavior of ra repa
- Rendering
- Table 17-4. Balinese Consonant Clusters with u and u:
- Nukta
- Ordering
- Punctuation
- Hyphenation
- Musical Symbols
- Modre Symbols
- 17.4 Javanese
- 17.5 Rejang
- 17.6 Batak
- 17.7 Sundanese
- 18 East Asia
- 18.1 Han
- CJK Unified Ideographs
- Blocks Containing Han Ideographs
- General Characteristics of Han Ideographs
- Principles of Han Unification
- Unification Rules
- Abstract Shape
- Two-Level Classification
- Ideographic Component Structure
- Figure 18-6. Ideographic Component Structure
- Figure 18-7. The Most Superior Node of an Ideographic Component
- Ideograph Features
- Uniqueness or Unification
- Spatial Positioning
- Examples
- Table 18-5. Ideographs Not Unified
- Table 18-6. Ideographs Unified
- Han Ideograph Arrangement
- Radical-Stroke Indices
- Mappings for Han Ideographs
- CJK Unified Ideographs Extension B: U+20000–U+2A6D6
- CJK Unified Ideographs Extension C: U+2A700–U+2B734
- CJK Unified Ideographs Extension D: U+2B740–U+2B81D
- CJK Unified Ideographs Extension E: U+2B820–U+2CEA1
- CJK Compatibility Ideographs: U+F900–U+FAFF
- CJK Compatibility Supplement: U+2F800–U+2FA1D
- Kanbun: U+3190–U+319F
- Symbols Derived from Han Ideographs
- CJK and KangXi Radicals: U+2E80–U+2FD5
- CJK Additions from HKSCS and GB 18030
- CJK Strokes: U+31C0–U+31EF
- 18.2 Ideographic Description Characters
- 18.3 Bopomofo
- 18.4 Hiragana and Katakana
- 18.5 Halfwidth and Fullwidth Forms
- 18.6 Hangul
- 18.7 Yi
- 18.8 Lisu
- 18.9 Miao
- 18.10 Tangut
- 19 Africa
- 19.1 Ethiopic
- Ethiopic: U+1200–U+137F
- Basic and Extended Ethiopic.
- Encoding Principles.
- Variant Glyph Forms.
- Labialized Subseries.
- Table 19-1. Labialized Forms in Ethiopic -WAA
- Table 19-2. Labialized Forms in Ethiopic -WE
- Keyboard Input.
- Syllable Names.
- Encoding Order and Sorting.
- Word Separators.
- Section Mark
- Diacritical Marks.
- Numbers.
- Ethiopic Extensions
- 19.2 Osmanya
- 19.3 Tifinagh
- 19.4 N’Ko
- 19.5 Vai
- 19.6 Bamum
- 19.7 Bassa Vah
- 19.8 Mende Kikakui
- 19.9 Adlam
- 20 Americas
- 21 Notational Systems
- 21.1 Braille
- 21.2 Western Musical Symbols
- Glyphs
- Symbols in Other Blocks
- Processing.
- Input Methods.
- Directionality.
- Figure 21-1. Examples of Specialized Music Layout
- Format Characters.
- Precomposed Note Characters.
- Figure 21-2. Precomposed Note Characters
- Alternative Noteheads.
- Figure 21-3. Alternative Noteheads
- Augmentation Dots and Articulation Symbols.
- Figure 21-4. Augmentation Dots and Articulation Symbols
- Ornamentation.
- Table 21-1. Examples of Ornamentation
- Gregorian
- Kievan
- 21.3 Byzantine Musical Symbols
- 21.4 Ancient Greek Musical Notation
- 21.5 Duployan
- 21.6 Sutton SignWriting
- 22 Symbols
- 22.1 Currency Symbols
- 22.2 Letterlike Symbols
- Letterlike Symbols: U+2100–U+214F
- Mathematical Alphanumeric Symbols: U+1D400–U+1D7FF
- Mathematical Alphabets
- Basic Set of Alphanumeric Characters.
- Additional Characters.
- Dotless Characters
- Figure 22-3. Wide Mathematical Accents
- Semantic Distinctions.
- Figure 22-4. Style Variants and Semantic Distinctions in Mathematics
- Mathematical Alphabets.
- Table 22-2. Mathematical Alphanumeric Symbols
- Compatibility Decompositions.
- Fonts Used for Mathematical Alphabets
- Arabic Mathematical Alphabetic Symbols: U+1EE00–U+1EEFF
- 22.3 Numerals
- 22.4 Superscript and Subscript Symbols
- 22.5 Mathematical Symbols
- Semantics.
- Mathematical Property
- Mathematical Operators: U+2200–U+22FF
- Supplements to Mathematical Symbols and Arrows
- Supplemental Mathematical Operators: U+2A00–U+2AFF
- Miscellaneous Mathematical Symbols-A: U+27C0–U+27EF
- Miscellaneous Mathematical Symbols-B: U+2980–U+29FF
- Miscellaneous Symbols and Arrows: U+2B00–U+2B7F
- Arrows: U+2190–U+21FF
- Supplemental Arrows
- Standardized Variants of Mathematical Symbols
- 22.6 Invisible Mathematical Operators
- 22.7 Technical Symbols
- Control Pictures: U+2400–U+243F
- Miscellaneous Technical: U+2300–U+23FF
- Keytop Labels.
- Floor and Ceiling
- Crops and Quine Corners
- Figure 22-9. Usage of Crops and Quine Corners
- Angle Brackets.
- APL Functional Symbols
- Symbol Pieces.
- Table 22-6. Use of Mathematical Symbol Pieces
- Horizontal Brackets
- Terminal Graphics Characters.
- Decimal Exponent Symbol
- Figure 22-10. Usage of the Decimal Exponent Symbol
- Dental Symbols.
- Metrical Symbols
- Electrotechnical Symbols
- User Interface Symbols
- Standards.
- Optical Character Recognition: U+2440–U+245F
- 22.8 Geometrical Symbols
- 22.9 Miscellaneous Symbols
- Rendering of Emoji Symbols
- Color Words in Unicode Character Names
- Miscellaneous Symbols and Pictographs
- Standards
- Weather Symbols
- Traffic Signs
- Dictionary and Map Symbols
- Plastic Bottle Material Code System.
- Recycling Symbol for Generic Materials.
- Universal Recycling Symbol.
- Paper Recycling Symbols.
- Gender Symbols
- Genealogical Symbols
- Game Symbols
- Animal Symbols
- Cultural Symbols
- Hand Symbols
- Emoji Modifiers
- Miscellaneous Symbols in Other Blocks
- Emoticons: U+1F600–U+1F64F
- Transport and Map Symbols: U+1F680–U+1F6FF
- Dingbats: U+2700–U+27BF
- Ornamental Dingbats: U+1F650–U+1F67F
- Alchemical Symbols: U+1F700–U+1F77F
- Mahjong Tiles: U+1F000–U+1F02F
- Domino Tiles: U+1F030–U+1F09F
- Playing Cards: U+1F0A0–U+1F0FF
- Yijing Hexagram Symbols: U+4DC0–U+4DFF
- Tai Xuan Jing Symbols: U+1D300–U+1D356
- Ancient Symbols: U+10190–U+101CF
- Phaistos Disc Symbols: U+101D0–U+101FF
- 22.10 Enclosed and Square
- 23 Special Areas and Format Characters
- 23.1 Control Codes
- 23.2 Layout Controls
- 23.3 Deprecated Format Characters
- 23.4 Variation Selectors
- 23.5 Private-Use Characters
- 23.6 Surrogates Area
- 23.7 Noncharacters
- 23.8 Specials
- 23.9 Tag Characters
- 24 About the Code Charts
- 24.1 Character Names List
- 24.2 CJK Ideographs
- 24.3 Hangul Syllables
- A Notational Conventions
- B Unicode Publications and Resources
- B.1 The Unicode Consortium
- B.2 Unicode Publications
- B.3 Unicode Technical Standards
- UTS #6: A Standard Compression Scheme for Unicode
- UTS #10: Unicode Collation Algorithm
- UTS #18: Unicode Regular Expressions
- UTS #22: Character Mapping Markup Language (CharMapML)
- UTS #35: Unicode Locale Data Markup Language (LDML)
- UTS #37: Unicode Ideographic Variation Database
- UTS #39: Unicode Security Mechanisms
- UTS #46: Unicode IDNA Compatibility Processing
- B.4 Unicode Technical Reports
- UTR #16: UTF-EBCDIC
- UTR #17: Unicode Character Encoding Model
- UTR #23: The Unicode Character Property Model
- UTR #25: Unicode Support for Mathematics
- UTR #26: Compatibility Encoding Scheme for UTF-16: 8-Bit (CESU-8)
- UTR #33: Unicode Conformance Model
- UTR #36: Unicode Security Considerations
- UTR #50: Unicode Vertical Text Layout
- UTR #51: Unicode Emoji
- B.5 Unicode Technical Notes
- B.6 Other Unicode Online Resources
- Unicode Online Resources
- Unicode Website
- Unicode Anonymous FTP Site
- Charts
- Character Index
- Conferences
- E-mail Discussion List
- Emoji
- Emoji Charts
- FAQ (Frequently Asked Questions)
- Glossary
- Online Unicode Character Database
- Online Unihan Database
- Policies
- Unicode Common Locale Data Repository (CLDR)
- Updates and Errata
- Versions
- Where Is My Character?
- How to Contact the Unicode Consortium
- C Relationship to ISO/IEC 10646
- D Version History of the Standard
- Table D-1. Versions of Unicode and ISO/IEC 10646
- Table D-2. Allocation of Code Points by Type (Versions 1.0.0 to 3.0)
- Table D-3. Allocation of Code Points by Type (Versions 3.1 to 5.1)
- Table D-4. Allocation of Code Points by Type (Versions 5.2 to 7.0)
- Table D-5. Allocation of Code Points by Type (Versions 8.0 to 9.0)
- E Han Unification History
- E.1 Development of the URO
- E.2 Ideographic Rapporteur Group
- E.3 CJK Sources
- Table E-1. G Source Documentation
- Table E-2. H Source Documentation
- Table E-3. M Source Documentation
- Table E-4. T Source Documentation
- Table E-5. J Source Documentation
- Table E-6. K Source Documentation
- Table E-7. KP Source Documentation
- Table E-8. V Source Documentation
- Table E-9. U Source Documentation
- Omission of Repertoire for Some Sources
- F Documentation of CJK Strokes
- References
- I Index