Message152316
| Author |
ezio.melotti |
| Recipients |
ezio.melotti, gvanrossum, kennyluck, lemburg, loewis, tchrist, vstinner |
| Date |
2012年01月30日.08:51:05 |
| SpamBayes Score |
4.417975e-06 |
| Marked as misclassified |
No |
| Message-id |
<1327913466.02.0.834023803478.issue12892@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
Thanks for the patch!
> * fix an error in the error handler for utf-16-le. (In, Python3.2
> b'\xdc\x80\x00\x41'.decode('utf-16-be', 'ignore') returns "\x00"
> instead of "A" for some reason)
This should probably be done on a separate patch that will be applied to 3.2/3.3 (assuming that it can go to 3.2). Rejecting surrogates will go in 3.3 only. (Note that lot of Unicode-related code changed between 3.2 and 3.3.)
> Should we really reject lone surrogates for UTF-7?
No, I meant only UTF-8/16/32; UTF-7 is fine as is. |
|