On 11 Jul 2013 10:24, <fronagzen at gmail.com> wrote: >> Actually, I don't think etree has a HTML parser. And I would counter-recommend lxml if speed is an issue: BeautifulSoup takes a looooong time to parse a large document. >> On Thursday, July 11, 2013 5:08:04 PM UTC+8, Fábio Santos wrote: > > > > Use an HTML parser. I strongly recommend BeautifulSoup but I think etree has an HTML parser too. I am not sure.. I meant lxml. My apologies. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.python.org/pipermail/python-list/attachments/20130711/3d9c02c8/attachment.html>