Message122313
| Author |
belopolsky |
| Recipients |
belopolsky, eric.smith, ezio.melotti, lemburg, pitrou |
| Date |
2010年11月24日.21:23:57 |
| SpamBayes Score |
1.7583102e-11 |
| Marked as misclassified |
No |
| Message-id |
<AANLkTikTkKbPG-D-jOe1YzzTZivg+WpV-CtkpctBcXKw@mail.gmail.com> |
| In-reply-to |
<4CED7798.3080901@egenix.com> |
| Content |
On Wed, Nov 24, 2010 at 3:37 PM, Marc-Andre Lemburg
<report@bugs.python.org> wrote:
..
> I don't think we should change that for the formatting methods.
That's a reasonable position. What about
'Lo'
>>> '\N{OLD ITALIC LETTER A}'.isalpha()
False
the str.isalpha() method is underspecified in the reference manual,
but a comment in unicodectype.c describes Py_UNICODE_ISALPHA as
follows:
/* Returns 1 for Unicode characters having the category 'Ll', 'Lu',
'Lt',
'Lo' or 'Lm', 0 otherwise. */
I don't have a wide build handy, but I am fairly sure '\N{OLD ITALIC
LETTER A}'.isalpha() would produce True there. The result above is
simply consequence of surrogates considered to be non-letters:
[False, False] |
|