Certainly this is best suited for CPU-bound scrapers.
Requires latest development version and scrapy and haven't been
tested on production.
Source code: https://github.com/darkrho/scrapy-redis
Regards,
~Rolando
Yes. But SCHEDULER_PERSIST is optional, set to True only if you want the
pause/resume feature.
The RedisPipeline serialize the item using json, so the
post-processing can be done
in any language that support redis and json.
Regards
~Rolando