Using Scrapy LinkExtractor() to locate specific domain extensions

22 views

Skip to first unread message

unread,

Nov 20, 2016, 9:11:29 AM11/20/16

to scrapy-users

I want to use Scrapy's LinkExtractor() to only follow links in the .th domain

I see there is a deny_extensions(list) parameter, but no allow_extensions() parameter.

Given that, how do I restrict links just to allow domains in .th ?

unread,

Dec 12, 2016, 4:04:33 AM12/12/16

to scrapy-users

I believe this question was answered on StackOverflow

Reply all

Reply to author

Forward

0 new messages