Halfwidth and Fullwidth Forms (Unicode block)
Halfwidth and Fullwidth Forms | |
---|---|
Range | U+FF00..U+FFEF (240 code points) |
Plane | BMP |
Scripts | Hangul (52 char.) Katakana (55 char.) Latin (52 char.) Common (66 char.) |
Symbol sets | Variant width characters |
Assigned | 225 code points |
Unused | 15 reserved code points |
Unicode version history | |
1.0.0 (1991) | 216 (+216) |
1.1 (1993) | 223 (+7) |
3.2 (2002) | 225 (+2) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] [3] |
Halfwidth and Fullwidth Forms is a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.[4]
Range U+FF01–FF5E reproduces the characters of ASCII 21 to 7E as fullwidth forms. U+FF00 does not correspond to a fullwidth ASCII 20 (space character), since that role is already fulfilled by U+3000 "ideographic space".
Range U+FF61–FF9F encodes halfwidth forms of katakana and related punctuation in a transposition of A1 to DF in the JIS X 0201 encoding – see half-width kana.
The range U+FFA0–FFDC encodes halfwidth forms of compatibility jamo characters for Hangul, in a transposition of their 1974 standard layout. It is used in the mapping of some IBM encodings for Korean, such as IBM code page 933, which allows the use of the Shift Out and Shift In characters to shift to a double-byte character set.[5] Since the double-byte character set could contain compatibility jamo, halfwidth variants are needed to provide round-trip compatibility.[6] [7]
Range U+FFE0–FFEE includes fullwidth and halfwidth symbols.
Block
[edit ]HF ᄀ ᄁ ᆪ ᄂ ᆬ ᆭ ᄃ ᄄ ᄅ ᆰ ᆱ ᆲ ᆳ ᆴ ᆵ
The block has variation sequences defined for East Asian punctuation positional variants.[8] [9] They use U+FE00 VARIATION SELECTOR-1 (VS01) and U+FE01 VARIATION SELECTOR-2 (VS02):
Variation sequences for punctuation alignmentAn additional variant is defined for a fullwidth zero with a short diagonal stroke: U+FF10 FULLWIDTH DIGIT ZERO, U+FE00 VS1 (0︀).[10] [9]
History
[edit ]The following Unicode-related documents record the purpose and process of defining specific characters in the Halfwidth and Fullwidth Forms block:
Version | Final code points[a] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
1.0.0 | U+FF01..FF5E, FF61..FFBE, FFC2..FFC7, FFCA..FFCF, FFD2..FFD7, FFDA..FFDC, FFE0..FFE6 | 216 | (to be determined) | ||
N4403 (pdf, doc) | Umamaheswaran, V. S. (2014年01月28日), "Resolution M61.01", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013年06月10日/14 | ||||
L2/17-056 | Lunde, Ken (2017年02月13日), Proposal to add standardized variation sequences | ||||
L2/17-436 | Lunde, Ken (2018年01月21日), Proposal to add standardized variation sequences for fullwidth East Asian punctuation | ||||
L2/18-039 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Cook, Richard (2018年01月19日), "24. Fullwidth East Asian Punctuation", Recommendations to UTC #154 January 2018 on Script Proposals | ||||
L2/17-362 | Moore, Lisa (2018年02月02日), "B.4.1 New Proposal to add standardized variation sequence for U+FF10 FULL WIDTH DIGIT ZERO", UTC #153 Minutes | ||||
L2/18-115 | Moore, Lisa (2018年05月09日), "Consensus 154-C17", UTC #155 Minutes, Add 16 standardized variation sequences based on L2/17-436R, for Unicode 12.0. | ||||
L2/19-055 | Iancu, Laurențiu (2019年01月14日), Proposed Changes in the Segmentation Property Values for Fullwidth Digits | ||||
L2/19-008 | Moore, Lisa (2019年02月08日), "B.11.11.1.2 Proposed changes in the segmentation property values for fullwidth digits", UTC #158 Minutes | ||||
1.1 | U+FFE8..FFEE | 7 | (to be determined) | ||
3.2 | U+FF5F..FF60 | 2 | L2/99-052 | Freytag, Asmus (1999年02月05日), The math pieces from the symbol font | |
L2/01-033 | Karlsson, Kent; Freytag, Asmus (2001年01月16日), Disunify braces/brackets for math, computing science, and Z notation from similar-looking CJK braces/brackets | ||||
L2/01-159 | N2344 | Ad-hoc report on Mathematical Symbols, 2001年04月03日 | |||
L2/01-157 | N2345R | Karlsson, Kent (2001年04月04日), Proposal to disunify certain fencing CJK punctuation marks from similar-looking Math fences | |||
L2/01-168 | Whistler, Ken (2001年04月10日), Bracket Disunification & Normalization Hell | ||||
L2/01-012R | Moore, Lisa (2001年05月21日), "Disunifying Braces and Brackets", Minutes UTC #86 in Mountain View, Jan 2001 | ||||
L2/01-223 | Suignard, Michel (2001年05月23日), Discussion of Issues Regarding Bracket Disunification | ||||
L2/01-184R | Moore, Lisa (2001年06月18日), "Motion 87-M21", Minutes from the UTC/L2 meeting, Reverse the decision made in motion 86-M6 not to disunify brackets. | ||||
L2/01-317 | Suignard, Michel (2001年08月14日), Bracket Disunification & Normalization | ||||
L2/01-295R | Moore, Lisa (2001年11月06日), "Bracket Disunification and Normalization", Minutes from the UTC/L2 meeting #88 | ||||
L2/02-154 | N2403 | Umamaheswaran, V. S. (2002年04月22日), "Resolution M41.1", Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001年10月15日/19 | |||
|
See also
[edit ]- CJK Symbols and Punctuation (Unicode block)
- Hangul Jamo (Unicode block)
- Katakana (Unicode block)
- Latin script in Unicode
- Enclosed Alphanumerics - bullet point sequences, some appear as full width (e.g. 1.,⓵,(1),(a),a)
References
[edit ]- ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992年11月03日. Retrieved 2016年07月09日.
- ^ "Unicode character database". The Unicode Standard. Retrieved 2023年07月26日.
- ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023年07月26日.
- ^ "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.
- ^ "ICU Demonstration - Converter Explorer". demo.icu-project.org. Retrieved 7 May 2018.
- ^ "Halfwidth and Fullwidth blame".
- ^ "Conversion Data - Old location of the ICU User Guide".
- ^ Lunde, Ken (2018年01月21日). "L2/17-436: Proposal to add standardized variation sequences for fullwidth East Asian punctuation" (PDF).
- ^ a b "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.
- ^ Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray (2015年10月30日). "L2/15-268: Proposal to Represent the Slashed Zero Variant of Empty Set" (PDF).