Message242586
| Author |
malin |
| Recipients |
ezio.melotti, hyeshik.chang, lemburg, loewis, malin, serhiy.storchaka, vstinner |
| Date |
2015年05月05日.00:52:19 |
| SpamBayes Score |
-1.0 |
| Marked as misclassified |
Yes |
| Message-id |
<1430787140.94.0.235676634957.issue24117@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
I found another bug in hz codec.
hz encoding uses 7-bit ASCII to represent Chinese characters, it was popular in USENET networks in the late 1980s and early 1990s.
I will do more check and fix them together, then I will invite you to review the patch.
u = 'hi~python'
b = u.encode('hz') # bug in this step, the right sequence should be b"hi~~python"
print(b) # the output is b"hi~python"
u = b.decode('hz') # so can't decode, UnicodeDecodeError raised
print(u) |
|