How to access Google Cache?

266 views
Skip to first unread message

Vacilando

unread,
Oct 14, 2008, 11:14:15 AM10/14/08
to Google App Engine
Hi,

In an application I build I sometimes need to check on a page, but it
is enough for me (and presumably much faster) if I could read its
latest copy from Google Cache.

Is there a way to fetcha page from Google Cache programmatically?

Thanks!

Sylvain

unread,
Oct 14, 2008, 2:56:50 PM10/14/08
to Google App Engine

Sal

unread,
Oct 14, 2008, 8:42:34 PM10/14/08
to Google App Engine
It's not very clear (at least to me) what you're trying to do. If you
have objects that are being pulled from the datastore, then you should
definitely use the memcache API as previously mentioned where
appropriate. However, if you're trying to get a link to a Google
Cached Page, the Search AJAX API exposes a cacheUrl property that you
can use in the JSON response.

Vacilando

unread,
Oct 15, 2008, 7:19:13 AM10/15/08
to Google App Engine
Thanks.

My intention is to be able to use an intelligent agent to examine
other websites' pages - and then flagging pages that contain something
important (words, patterns, etc.)

Jon McAlister

unread,
Oct 16, 2008, 5:24:53 PM10/16/08
to Google App Engine

Tony Arkles

unread,
Oct 16, 2008, 6:01:03 PM10/16/08
to Google App Engine
I think the question is whether or not there's a convenient way to
retrieve the url from google's cache, instead of pushing the request
out onto the (potentially slow) internet.

Jon McAlister

unread,
Oct 19, 2008, 1:04:50 PM10/19/08
to Google App Engine
There certainly is such a way. Use the urlfetch API and make the
request against the google cache. For instance, urlfetch.fetch('http://
www.google.com/search?q=cache:en.wikipedia.org/wiki/Namibia'). All the
expected limitations apply though, in that this will only work for
pages not restricted by robots.txt, and you don't have control over
which time version of the page is returned.
Reply all
Reply to author
Forward
0 new messages