Using scrapy to scrape html code generated by Javascripts

230 views
Skip to first unread message

Scott Chen

unread,
Dec 18, 2015, 2:24:20 AM12/18/15
to scrapy-users
Hi all,

I'm trying to scrape realtime data that is generated by a javascript <noscript> on a webpage. The html file does not contain the info I would like to scrape. All I can see from the source is a <noscript> tag at the place where my data is at when I use inspect element.

Therefore, I was wondering if scrapy is capable of scraping data generate by javascripts.

Thanks in advance!

Scott

Valdir Stumm Junior

unread,
Dec 18, 2015, 6:33:08 AM12/18/15
to scrapy...@googlegroups.com
Hey, Scott!

You can use Splash (https://splash.readthedocs.org/en/stable/), which is a JS rendering service that integrates with Scrapy through Scrapy-splash (https://github.com/scrapinghub/scrapy-splash).



Regards,

Valdir.


--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users...@googlegroups.com.
To post to this group, send email to scrapy...@googlegroups.com.
Visit this group at https://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.



--
Scrapinghub

Valdir Stumm Junior 
Developer Evangelist, Scrapinghub 

Skypestummjr
TwitterGithub
TwitterLinkedInGithub

We turn web content into structured data. Lead maintainers of Scrapy.

Reply all
Reply to author
Forward
0 new messages