Hello everybody,I need to scrap 2 websites that returns a huge JSON (1.2MB) and uses the JSON + JavaScript to build the HTML. I wanna know what is the best approach?
--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users...@googlegroups.com.
To post to this group, send email to scrapy...@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
I actually have a similar use case. I'd like to query a bunch of REST APIs returning JSON, and was wondering if I could leverage Scrapy to parse the JSON responses. I would have to do some transformations to the field names and structures to normalize them into a common format, and then store them in Django models. Has anyone used scrapy for this purpose?
Thanks for the answer. I was looking for something a bit more solid, where you could specify the paths of the elements to be extracted wit something like XPath. The idea would be to create a spider that was specific for JSON and easy to extend for any schema.
You received this message because you are subscribed to a topic in the Google Groups "scrapy-users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/scrapy-users/WknO5Uf6NxY/unsubscribe?hl=en.
To unsubscribe from this group and all its topics, send an email to scrapy-users...@googlegroups.com.