can we scrape a local file?

1,066 views
Skip to first unread message

Akhil Shah

unread,
Jul 15, 2015, 2:25:49 PM7/15/15
to web-s...@googlegroups.com
The "Start URL" field does not accept a local file (html).  Is it possible to scrape a locally stored html document?

Thanks,
akhil

Mārtiņš Balodis

unread,
Jul 20, 2015, 9:44:24 AM7/20/15
to Akhil Shah, web-scraper
Hi,
Web Scraper won't accept local file urls. You can serve the html files as a local web site and then scrape it. If you have a mac or linux you can use this command to create a simple web server:

python -m SimpleHTTPServer 8000

--
You received this message because you are subscribed to the Google Groups "Web Scraper" group.
To unsubscribe from this group and stop receiving emails from it, send an email to web-scraper...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Robert Hellmundt

unread,
Aug 3, 2015, 6:57:50 AM8/3/15
to Web Scraper, akhi...@gmail.com
Hey,
thank you for the hint!
Unfortunately I am running into some troubles when I try to use webscraper on this simple web server.
If It try to enter the IP (http://192.168.1.71:8000/) I get the message, that the start URL is not a valid URL.

Am I doing something wrong or isn't it possible to do this anymore?

Thank you! :)

Michael

unread,
Aug 4, 2015, 5:29:32 PM8/4/15
to Web Scraper, akhi...@gmail.com
Try
python -m SimpleHTTPServer 80
and

Michael

unread,
Aug 4, 2015, 5:35:51 PM8/4/15
to Web Scraper, akhi...@gmail.com
You could also get Apache and just host it locally that way, I'm having no trouble with Web Scraper accessing files like that. https://www.apachefriends.org/download.html
Reply all
Reply to author
Forward
0 new messages