"convert" string to bytes without changing data (encoding)

Michael Ströder michael at stroeder.com
Fri Mar 30 03:04:49 EDT 2012


Steven D'Aprano wrote:
> On 2012年3月29日 17:36:34 +0000, Prasad, Ramit wrote:
>>>>> Technically, ASCII goes up to 256 but they are not A-z letters.
>>>>>>> Technically, ASCII is 7-bit, so it goes up to 127.
>>>>> No, ASCII only defines 0-127. Values >=128 are not ASCII.
>>>>>> >From https://en.wikipedia.org/wiki/ASCII:
>>>>>> ASCII includes definitions for 128 characters: 33 are non-printing
>>> control characters (now mostly obsolete) that affect how text and
>>> space is processed and 95 printable characters, including the space
>>> (which is considered an invisible graphic).
>>>>>> Doh! I was mistaking extended ASCII for ASCII. Thanks for the
>> correction.
>> There actually is no such thing as "extended ASCII" -- there is a whole 
> series of many different "extended ASCIIs". If you look at the encodings 
> available in (for example) Thunderbird, many of the ISO-8859-* and 
> Windows-* encodings are "extended ASCII" in the sense that they extend 
> ASCII to include bytes 128-255. Unfortunately they all extend ASCII in a 
> different way (hence they are different encodings).

Yupp.
Looking at RFC 1345 some years ago (while having to deal with EBCDIC) made
this all pretty clear to me. I appreciate that someone did this heavy work of
collecting historical encodings.
Ciao, Michael.


More information about the Python-list mailing list

AltStyle によって変換されたページ (->オリジナル) /