This issue tracker has been migrated to GitHub ,
and is currently read-only.
For more information,
see the GitHub FAQs in the Python's Developer Guide.
Created on 2012年07月16日 16:04 by keul, last changed 2022年04月11日 14:57 by admin. This issue is now closed.
| Messages (3) | |||
|---|---|---|---|
| msg165630 - (view) | Author: Luca Fabbri (keul) | Date: 2012年07月16日 16:04 | |
Using the unicodedata.decomposition function on characters like \u201c and \u201d I didn't get back the classic quote character (").
This is a very common error when text is taken from Microsoft Word (where in italian language a couple of quoting character in a sentence like "foo" is automatically changed to "foo").
|
|||
| msg165636 - (view) | Author: R. David Murray (r.david.murray) * (Python committer) | Date: 2012年07月16日 16:43 | |
I don't understand why you would expect to get a ". The unicode characters aren't "s. As far as I can see (from, for example, http://codepoints.net/U+201C), Python is behaving as expected here. |
|||
| msg165644 - (view) | Author: Matthew Barnett (mrabarnett) * (Python triager) | Date: 2012年07月16日 17:48 | |
A codepoint such as "é" ("\N{LATIN SMALL LETTER E WITH ACUTE}") can be decomposed to "\u0065\u0301" ("\N{LATIN SMALL LETTER E}\N{COMBINING ACUTE ACCENT"), but "\u201c" ("\N{LEFT DOUBLE QUOTATION MARK}") and "\u201d" ("\N{RIGHT DOUBLE QUOTATION MARK}") cannot be decomposed.
|
|||
| History | |||
|---|---|---|---|
| Date | User | Action | Args |
| 2022年04月11日 14:57:32 | admin | set | github: 59577 |
| 2012年07月16日 17:49:29 | pitrou | set | status: open -> closed |
| 2012年07月16日 17:48:26 | mrabarnett | set | status: pending -> open nosy: + mrabarnett messages: + msg165644 |
| 2012年07月16日 16:43:42 | r.david.murray | set | status: open -> pending nosy: + r.david.murray messages: + msg165636 resolution: not a bug stage: resolved |
| 2012年07月16日 16:04:22 | keul | create | |