Message272718
| Author |
steve.dower |
| Recipients |
BreamoreBoy, David.Sankel, Drekin, Jonitis, akira, amaury.forgeotdarc, christian.heimes, christoph, davidsarah, dead1ne, escapewindow, ezio.melotti, flox, giampaolo.rodola, hippietrail, lemburg, lilydjwg, mark, martin.panter, mhammond, ncoghlan, ned.deily, paul.moore, piotr.dobrogost, pitrou, santoso.wijaya, smerlin, ssbarnea, steve.dower, stijn, terry.reedy, tim.golden, tzot, v+python, wiz21 |
| Date |
2016年08月15日.04:52:28 |
| SpamBayes Score |
-1.0 |
| Marked as misclassified |
Yes |
| Message-id |
<1471236748.78.0.102773005894.issue1602@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
For more info here, cgi.parse has code like this:
def parse(fp, ...):
if fp is None:
fp = sys.stdin
encoding = getattr(fp, 'encoding', 'latin-1')
# later on...
return urllib.parse.parse_qs(a_str, encoding=encoding, ...)
As an easy hack, I added this after assigning encoding:
if len(' '.encode(encoding, errors='replace')) > 1:
encoding = 'latin-1'
I have no idea if this is a good idea or not. The current behaviour of mojibake in the parsed result is certainly worse, since the choice of utf-16-le is entirely contained within the parse() function. |
|
History
|
|---|
| Date |
User |
Action |
Args |
| 2016年08月15日 04:52:28 | steve.dower | set | recipients:
+ steve.dower, lemburg, mhammond, terry.reedy, paul.moore, tzot, amaury.forgeotdarc, ncoghlan, pitrou, giampaolo.rodola, christian.heimes, tim.golden, mark, ned.deily, christoph, ezio.melotti, v+python, hippietrail, ssbarnea, flox, davidsarah, santoso.wijaya, akira, BreamoreBoy, David.Sankel, smerlin, lilydjwg, martin.panter, piotr.dobrogost, Drekin, wiz21, stijn, Jonitis, escapewindow, dead1ne |
| 2016年08月15日 04:52:28 | steve.dower | set | messageid: <1471236748.78.0.102773005894.issue1602@psf.upfronthosting.co.za> |
| 2016年08月15日 04:52:28 | steve.dower | link | issue1602 messages |
| 2016年08月15日 04:52:28 | steve.dower | create |
|