homepage

This issue tracker has been migrated to GitHub , and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author amaury.forgeotdarc
Recipients amaury.forgeotdarc, ezio.melotti, lemburg
Date 2010年07月08日.08:53:03
SpamBayes Score 8.704606e-05
Marked as misclassified No
Message-id <1278579187.34.0.662391507791.issue9198@psf.upfronthosting.co.za>
In-reply-to
Content
On wide unicode builds, '\U00010000'.isprintable() returns True, and repr() returns the character unmodified.
Is it a good behavior, given that very few fonts have can display this character?
Marc-Andre Lemburg wrote:
> The "printable" property is a Python invention, not a Unicode property,
> so we do have some freedom is deciding what is printable and what
> is not.
The current implementation considers printable """all the characters except those characters defined in the Unicode character database as following categories are considered printable.
 * Cc (Other, Control)
 * Cf (Other, Format)
 * Cs (Other, Surrogate)
 * Co (Other, Private Use)
 * Cn (Other, Not Assigned)
 * Zl Separator, Line ('\u2028', LINE SEPARATOR)
 * Zp Separator, Paragraph ('\u2029', PARAGRAPH SEPARATOR)
 * Zs (Separator, Space) other than ASCII space('\x20').
"""
We could also arbitrarily exclude all the non-BMP chars.
History
Date User Action Args
2010年07月08日 08:53:07amaury.forgeotdarcsetrecipients: + amaury.forgeotdarc, lemburg, ezio.melotti
2010年07月08日 08:53:07amaury.forgeotdarcsetmessageid: <1278579187.34.0.662391507791.issue9198@psf.upfronthosting.co.za>
2010年07月08日 08:53:03amaury.forgeotdarclinkissue9198 messages
2010年07月08日 08:53:03amaury.forgeotdarccreate

AltStyle によって変換されたページ (->オリジナル) /