Using Scrapy LinkExtractor() to locate specific domain extensions

22 views
Skip to first unread message

lee hodgson

unread,
Nov 20, 2016, 9:11:29 AM11/20/16
to scrapy-users

I want to use Scrapy's LinkExtractor() to only follow links in the .th domain

I see there is a deny_extensions(list) parameter, but no allow_extensions() parameter.

Given that, how do I restrict links just to allow domains in .th ?

Paul Tremberth

unread,
Dec 12, 2016, 4:04:33 AM12/12/16
to scrapy-users
Reply all
Reply to author
Forward
0 new messages