"convert" string to bytes without changing data (encoding)
Michael Ströder
michael at stroeder.com
Fri Mar 30 03:04:49 EDT 2012
Steven D'Aprano wrote:
> On 2012年3月29日 17:36:34 +0000, Prasad, Ramit wrote:
>>>>> Technically, ASCII goes up to 256 but they are not A-z letters.
>>>>>>> Technically, ASCII is 7-bit, so it goes up to 127.
>>>>> No, ASCII only defines 0-127. Values >=128 are not ASCII.
>>>>>> >From https://en.wikipedia.org/wiki/ASCII:
>>>>>> ASCII includes definitions for 128 characters: 33 are non-printing
>>> control characters (now mostly obsolete) that affect how text and
>>> space is processed and 95 printable characters, including the space
>>> (which is considered an invisible graphic).
>>>>>> Doh! I was mistaking extended ASCII for ASCII. Thanks for the
>> correction.
>> There actually is no such thing as "extended ASCII" -- there is a whole
> series of many different "extended ASCIIs". If you look at the encodings
> available in (for example) Thunderbird, many of the ISO-8859-* and
> Windows-* encodings are "extended ASCII" in the sense that they extend
> ASCII to include bytes 128-255. Unfortunately they all extend ASCII in a
> different way (hence they are different encodings).
Yupp.
Looking at RFC 1345 some years ago (while having to deal with EBCDIC) made
this all pretty clear to me. I appreciate that someone did this heavy work of
collecting historical encodings.
Ciao, Michael.
More information about the Python-list
mailing list