Message109520
| Author |
amaury.forgeotdarc |
| Recipients |
amaury.forgeotdarc, ezio.melotti, lemburg |
| Date |
2010年07月08日.08:53:03 |
| SpamBayes Score |
8.704606e-05 |
| Marked as misclassified |
No |
| Message-id |
<1278579187.34.0.662391507791.issue9198@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
On wide unicode builds, '\U00010000'.isprintable() returns True, and repr() returns the character unmodified.
Is it a good behavior, given that very few fonts have can display this character?
Marc-Andre Lemburg wrote:
> The "printable" property is a Python invention, not a Unicode property,
> so we do have some freedom is deciding what is printable and what
> is not.
The current implementation considers printable """all the characters except those characters defined in the Unicode character database as following categories are considered printable.
* Cc (Other, Control)
* Cf (Other, Format)
* Cs (Other, Surrogate)
* Co (Other, Private Use)
* Cn (Other, Not Assigned)
* Zl Separator, Line ('\u2028', LINE SEPARATOR)
* Zp Separator, Paragraph ('\u2029', PARAGRAPH SEPARATOR)
* Zs (Separator, Space) other than ASCII space('\x20').
"""
We could also arbitrarily exclude all the non-BMP chars. |
|
History
|
|---|
| Date |
User |
Action |
Args |
| 2010年07月08日 08:53:07 | amaury.forgeotdarc | set | recipients:
+ amaury.forgeotdarc, lemburg, ezio.melotti |
| 2010年07月08日 08:53:07 | amaury.forgeotdarc | set | messageid: <1278579187.34.0.662391507791.issue9198@psf.upfronthosting.co.za> |
| 2010年07月08日 08:53:03 | amaury.forgeotdarc | link | issue9198 messages |
| 2010年07月08日 08:53:03 | amaury.forgeotdarc | create |
|