Message107468
| Author |
loewis |
| Recipients |
lars.gustaebel, lemburg, loewis, vstinner |
| Date |
2010年06月10日.18:40:55 |
| SpamBayes Score |
0.010755782 |
| Marked as misclassified |
No |
| Message-id |
<4C1131B6.6080208@v.loewis.de> |
| In-reply-to |
<4C101F12.7090102@egenix.com> |
| Content |
>> 7-zip encodes "à" (U+00e0) as 0x85 (1 byte), and "é" (U+00e9) as 0x82 (1 byte). I don't know this encoding.
>
> That's an old DOS code paged used in Europe: CP850
There is a good chance that they use it because it is the OEM code page
on the system.
In any case, I think that both cp850 and cp1252 are inherently incorrect
for tarfiles (despite these tools using them). tar is a POSIX thing, and
these encodings have nothing to do with POSIX.
So using UTF-8 is a reasonable choice, IMO. The other reasonable choice
would be ASCII. |
|