python+libxml2+scrapy AttributeError: 'module' object has no attribute 'HTML_PARSE_RECOVER'

Dmitry Arsentiev dmarsentev at gmail.com
Wed Aug 15 08:49:04 EDT 2012


Hello.
Has anybody already meet the problem like this? -
AttributeError: 'module' object has no attribute 'HTML_PARSE_RECOVER'
When I run scrapy, I get
 File "/usr/local/lib/python2.7/site-packages/scrapy/selector/factories.py",
line 14, in <module>
 libxml2.HTML_PARSE_NOERROR + \
AttributeError: 'module' object has no attribute 'HTML_PARSE_RECOVER'
When I run
 python -c 'import libxml2; libxml2.HTML_PARSE_RECOVER'
I get
Traceback (most recent call last):
 File "<string>", line 1, in <module>
AttributeError: 'module' object has no attribute 'HTML_PARSE_RECOVER'
How can I cure it?
Python 2.7
libxml2-python 2.6.9
2.6.11-gentoo-r6
I will be grateful for any help.
DETAILS:
scrapy crawl lgz -o items.json -t json
Traceback (most recent call last):
 File "/usr/local/bin/scrapy", line 4, in <module>
 execute()
 File "/usr/local/lib/python2.7/site-packages/scrapy/cmdline.py", line 112, in execute
 cmds = _get_commands_dict(inproject)
 File "/usr/local/lib/python2.7/site-packages/scrapy/cmdline.py", line 37, in _get_commands_dict
 cmds = _get_commands_from_module('scrapy.commands', inproject)
 File "/usr/local/lib/python2.7/site-packages/scrapy/cmdline.py", line 30, in _get_commands_from_module
 for cmd in _iter_command_classes(module):
 File "/usr/local/lib/python2.7/site-packages/scrapy/cmdline.py", line 21, in _iter_command_classes
 for module in walk_modules(module_name):
 File "/usr/local/lib/python2.7/site-packages/scrapy/utils/misc.py", line 65, in walk_modules
 submod = __import__(fullpath, {}, {}, [''])
 File "/usr/local/lib/python2.7/site-packages/scrapy/commands/shell.py", line 8, in <module>
 from scrapy.shell import Shell
 File "/usr/local/lib/python2.7/site-packages/scrapy/shell.py", line 14, in <module>
 from scrapy.selector import XPathSelector, XmlXPathSelector, HtmlXPathSelector
 File "/usr/local/lib/python2.7/site-packages/scrapy/selector/__init__.py", line 30, in <module>
 from scrapy.selector.libxml2sel import *
 File "/usr/local/lib/python2.7/site-packages/scrapy/selector/libxml2sel.py", line 12, in <module>
 from .factories import xmlDoc_from_html, xmlDoc_from_xml
 File "/usr/local/lib/python2.7/site-packages/scrapy/selector/factories.py", line 14, in <module>
 libxml2.HTML_PARSE_NOERROR + \
AttributeError: 'module' object has no attribute 'HTML_PARSE_RECOVER'


More information about the Python-list mailing list

AltStyle によって変換されたページ (->オリジナル) /