Hi
I am trying to download pdf files so I tried to follow files.py you posted. I followed your example to download ietf.
I didn't get any file back in the path I specified.
These are what I got when I ran scrapy crawl ietf
2013-12-03 13:04:00-0600 [scrapy] INFO: Scrapy 0.12.0.2546 started (bot: loginTest)
2013-12-03 13:04:00-0600 [scrapy] DEBUG: Enabled extensions: TelnetConsole, SpiderContext, WebService, CoreStats, MemoryUsage, CloseSpider
2013-12-03 13:04:00-0600 [scrapy] DEBUG: Enabled scheduler middlewares: DuplicatesFilterMiddleware
2013-12-03 13:04:00-0600 [scrapy] DEBUG: Enabled downloader middlewares: HttpAuthMiddleware, DownloadTimeoutMiddleware, UserAgentMiddleware, RetryMiddleware, DefaultHeadersMiddleware, RedirectMiddleware, CookiesMiddleware, HttpCompressionMiddleware, DownloaderStats
2013-12-03 13:04:00-0600 [scrapy] DEBUG: Enabled spider middlewares: HttpErrorMiddleware, OffsiteMiddleware, RefererMiddleware, UrlLengthMiddleware, DepthMiddleware
2013-12-03 13:04:00-0600 [scrapy] DEBUG: Enabled item pipelines: FilesPipeline
2013-12-03 13:04:00-0600 [scrapy] DEBUG: Telnet console listening on
0.0.0.0:60232013-12-03 13:04:00-0600 [scrapy] DEBUG: Web service listening on
0.0.0.0:60802013-12-03 13:04:00-0600 [ietf] INFO: Spider opened
2013-12-03 13:04:00-0600 [ietf] DEBUG: Crawled (200) <GET
http://www.ietf.org/> (referer: None)
2013-12-03 13:04:00-0600 [ietf] DEBUG: Scraped LogintestItem(file_urls=['
http://www.ietf.org/images/ietflogotrans.gif', '
http://www.ietf.org/rfc/rfc2616.txt', '
http://www.rfc-editor.org/rfc/rfc2616.ps', '
http://www.rfc-editor.org/rfc/rfc2616.pdf', '
http://tools.ietf.org/html/rfc2616.html']) in <
http://www.ietf.org/>
2013-12-03 13:04:00-0600 [ietf] DEBUG: Crawled (200) <GET
http://www.ietf.org/images/ietflogotrans.gif> (referer: None)
2013-12-03 13:04:02-0600 [ietf] DEBUG: Crawled (200) <GET
http://www.ietf.org/rfc/rfc2616.txt> (referer: None)
2013-12-03 13:04:02-0600 [ietf] DEBUG: Crawled (200) <GET
http://tools.ietf.org/html/rfc2616.html> (referer: None)
2013-12-03 13:04:02-0600 [ietf] DEBUG: Crawled (200) <GET
http://www.rfc-editor.org/rfc/rfc2616.pdf> (referer: None)
2013-12-03 13:04:02-0600 [ietf] DEBUG: Crawled (200) <GET
http://www.rfc-editor.org/rfc/rfc2616.ps> (referer: None)
2013-12-03 13:04:02-0600 [ietf] INFO: Passed LogintestItem(files=[], file_urls=['
http://www.ietf.org/images/ietflogotrans.gif', '
http://www.ietf.org/rfc/rfc2616.txt', '
http://www.rfc-editor.org/rfc/rfc2616.ps', '
http://www.rfc-editor.org/rfc/rfc2616.pdf', '
http://tools.ietf.org/html/rfc2616.html'])
2013-12-03 13:04:02-0600 [ietf] INFO: Closing spider (finished)
2013-12-03 13:04:02-0600 [ietf] INFO: Spider closed (finished)
Could you please tell me what did I do wrong here? I guess I dont need to get FilesPipeline?
Thanks in advanced
Papis