Message76557
| Author |
loewis |
| Recipients |
lemburg, loewis, nathanlmiles, rsc, terry.reedy, timehorse |
| Date |
2008年11月28日.21:33:40 |
| SpamBayes Score |
0.00015994108 |
| Marked as misclassified |
No |
| Message-id |
<1227908021.62.0.877440749758.issue1693050@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
Unicode TR#18 defines \w as a shorthand for
\p{alpha}
\p{gc=Mark}
\p{digit}
\p{gc=Connector_Punctuation}
which would include all marks. We should recursively check whether we
follow the recommendation (e.g. \p{alpha} refers to all character having
the Alphabetic derived core property, which is Lu+Ll+Lt+Lm+Lo+Nl +
Other_Alphabetic, where Other_Alphabetic is a selected list of
additional character - all from Mn/Mc) |
|