[Python-Dev] PEP 383: Non-decodable Bytes in System Character Interfaces

"Martin v. Löwis" martin at v.loewis.de
Wed Apr 29 22:15:12 CEST 2009


> Sure. However, that requires you to provide meaningful, reproducible
> counter-examples, rather than a stenographic formulation that might
> hint some problem you apparently see (which I believe is just not
> there).
>>> Well, here's another one: PEP 383 would disallow UTF-8 encodings of half
> surrogates. But such encodings are currently supported by Python, and
> they are used as part of CESU-8 coding. That's, in fact, a common way
> of converting UTF-16 to UTF-8. How are you going to deal with existing
> code that relies on being able to code half surrogates as UTF-8?

Can you please elaborate? What code specifically are you talking about?
Regards,
Martin


More information about the Python-Dev mailing list

AltStyle によって変換されたページ (->オリジナル) /