Windows-1255
MIME / IANA | windows-1255 |
---|---|
Alias(es) | cp1255 (Code page 1255) |
Language(s) | Hebrew, English |
Created by | Microsoft |
Standard | WHATWG Encoding Standard |
Classification | extended ASCII, Windows-125x |
Other related encoding(s) | ISO-8859-8 |
Windows-1255 (referred to as "ANSI" especially often) is a code page used under Microsoft Windows to write Hebrew. It is an almost compatible superset of ISO-8859-8 – most of the symbols are in the same positions (except for A4, which is 'sheqel sign' in Windows-1255 but 'generic currency sign' in ISO 8859-8 and except for DF, which is undefined in Windows-1255 but 'double low line' in ISO 8859-8), but Windows-1255 adds vowel-points and other signs in lower positions.
IBM uses code page 1255 (CCSID 1255, euro sign extended CCSID 5351, and the further extended CCSID 9447) for Windows-1255.[1] [2] [3] [4]
Modern applications prefer Unicode to Windows-1255, especially on the Internet;[5] meaning UTF-8, the dominant encoding for web pages (or UTF-16, while not on the Internet for security reasons). Windows-1255 is used by less than 0.1% of websites.[6]
Character set
[edit ]The following table shows Windows-1255. Each character is shown with its Unicode equivalent.
Windows-1255[7] [8] [9] [10] [11] [12] [13]Usage
[edit ]Windows-1255 Hebrew is always in logical order (as opposed to visual). Microsoft Hebrew products (Windows, Office and Internet Explorer) brought logically-ordered Hebrew to common use, with the result that Windows-1255 is the Hebrew encoding that can be found most on the Web, having ousted the visually ordered ISO-8859-8, and preferred to the logically ordered ISO-8859-8-I because it provides for vowel-points.
Relation to Unicode
[edit ]The Unicode Hebrew block (U+0590–U+05FF) follows Windows-1255 by encoding both letters and vowel-points in the same relative positions as Windows-1255. Unicode goes further in encoding cantillation marks in lower positions. Unicode Hebrew is always in logical order.
For modern applications UTF-8 or UTF-16 is a preferred encoding.
See also
[edit ]- 7-bit Hebrew under ISO 646
- Code page 862
- ISO 8859-8
- LMBCS-3
References
[edit ]- ^ "Code page 1255 information document". Archived from the original on 2016年03月04日.
- ^ "CCSID 1255 information document". Archived from the original on 2016年03月27日.
- ^ "CCSID 5351 information document". Archived from the original on 2014年11月29日.
- ^ "CCSID 9447 information document". Archived from the original on 2016年03月26日.
- ^ John, Nicholas A. (2013). "The Construction of the Multilingual Internet: Unicode, Hebrew, and Globalization". Journal of Computer-Mediated Communication. 18 (3): 321–338. doi:10.1111/jcc4.12015 . ISSN 1083-6101.
Background: the problem of Hebrew and the Internet
- ^ "Usage Statistics of Windows-1255 for Websites, January 2019". w3techs.com. Retrieved 2019年01月17日.
- ^ Unicode mapping table for Windows 1255
- ^ Unicode mappings of windows 1255 with "best fit"
- ^ Code Page CPGID 01255 (pdf) (PDF), IBM
- ^ Code Page CPGID 01255 (txt), IBM
- ^ International Components for Unicode (ICU), ibm-1255_P100-1995.ucm, 2002年12月03日
- ^ International Components for Unicode (ICU), ibm-1251_P100-1995.ucm, 2002年12月03日
- ^ International Components for Unicode (ICU), ibm-5351_P100-1998.ucm, 2002年12月03日