think that your project settings are not being loaded. I made a small test and define SCRAPY_SETTINGS_MODULE didn't work for me, I don't know why. Instead of:

from scrapy.settings import Settings

I use something like

from scrapy.utils.project import get_project_settings as Settings

PS.: Hoffman, is this the recommended way? Also, since this seems to be a common issue related to the example provided in the common pratices,  should the docs provide more info about this?

2013/3/25 Scrapy_user <>

In the docs is the following example:

import os
os.environ.setdefault('SCRAPY_SETTINGS_MODULE', 'myprojectdir.settings')
from twisted.internet import reactor
from scrapy.crawler import Crawler
from scrapy.settings import Settings
from scrapy import log
from testspiders.spiders.followall import FollowAllSpider

spider = FollowAllSpider(domain='')
crawler = Crawler(Settings())
log.start() # the script will block here

I defined a custom USER_AGENT in the settings, and when I do this in the script:
print Settings().get('USER_AGENT')

it gives me the default Scrapy user agent, not the one I defined in the myprojectdir.settings file.

The current directory is also in my PYTHONPATH so myprojectdir.settings can be imported when running a python interactive shell.

What am I doing wrong here?

Thanks for this awesome software,


