Message132223
| Author |
ezio.melotti |
| Recipients |
belopolsky, eric.araujo, ezio.melotti, fdrake, pluskid, v+python |
| Date |
2011年03月26日.10:17:23 |
| SpamBayes Score |
4.92448e-06 |
| Marked as misclassified |
No |
| Message-id |
<1301134644.32.0.515359058579.issue7311@psf.upfronthosting.co.za> |
| In-reply-to |
| Content |
The attached patch changes the regex to allow non-ascii letters in attribute values (using \w with the re.UNICODE flag instead of [a-zA-Z0-9_]).
Using [^>\s] (or even [^> ]) might be OK too, since that's what browsers seem to use (e.g. Firefox and Chrome show "テ<ス☃ト -d-fg" as title of '<a href="" title=テ<ス☃ト -d-fg href="">foo</a>', including the non-ascii spaces in the middle). |
|