Message324100
| Author |
michael-o |
| Recipients |
Michael.Felt, michael-o, terry.reedy |
| Date |
2018年08月25日.20:17:09 |
| SpamBayes Score |
-1.0 |
| Marked as misclassified |
Yes |
| Message-id |
<1535228229.69.0.56676864532.issue34403@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
I think you are absoltely right.
> In any case, it seems to be broken for any system that does not have UTF-8 as default.
You likely mean ASCII. Python assumes that LANG=C is ASCII which is not the case for AIX and HP-UX.
Your patch looks reasonable, I will try this on Monday. The problem is that there is no roman8 codec in Python. Maybe ISO-8859-1 will do it for the test, but I am still eager to add one.
> I mention this because it seems neither roman8 nor roman9 have 'official' iso names or alias (correct me if I am wrong).
There are no ISO names because this is not an ISO encoding. This is an HP invention aka hp-roman8 (roman8, ibm-1051, r8, Cp1051).
Edit: there is roman8 support: https://github.com/python/cpython/blob/e42b705188271da108de42b55d9344642170aa2b/Lib/encodings/hp_roman8.py as well as aliases.
There are a few aliases missing: cp1051, ibm1051 and hp-roman8. This needs an additonal PR. |
|
History
|
|---|
| Date |
User |
Action |
Args |
| 2018年08月25日 20:17:09 | michael-o | set | recipients:
+ michael-o, terry.reedy, Michael.Felt |
| 2018年08月25日 20:17:09 | michael-o | set | messageid: <1535228229.69.0.56676864532.issue34403@psf.upfronthosting.co.za> |
| 2018年08月25日 20:17:09 | michael-o | link | issue34403 messages |
| 2018年08月25日 20:17:09 | michael-o | create |
|