Hi,
Thanks for your help.
I tried using wget as explained in
http://code.google.com/apis/ajaxsearch/documentation/.
But there is some issue with wget coz it gives an error : Unsupported
Scheme.
If you have an idea about wget, is SSL is required for this purpose?
I have looked in many forums for wget but haven't been able to solve
this issue.
thanks
Sarabjeet
On Jun 16, 12:40 am, Ray Baxter <
ray.bax...@gmail.com> wrote:
> On Jun 15, 2008, at 5:48 PM, Sarabjeet wrote:
>
>
>
> > I am working on a project where I need to crawl google search to
> > download the first 50 results for a hundred queries and store them on
> > my computer. I do not need a webpage or something. What is the best
> > API for me to use because I found out that google does not allow
> > client programs to crawl their websites directly. Also, I am not at
> > all familiar with Javascipt but I know Python. The Python gdata
> > library, unfortunately, does not have the search APIs.
>
> There is a restful interface to Google search:
http://code.google.com/apis/ajaxsearch/documentation/reference.html#_...
>
> You will need a valid referrer page, perhaps something describing your
> project and how to contact you. There is some limitation on the number
> of results that can be obtained, but I don't see it documented. In my
> brief testing, the current limitation is 100 results, ie.
http://ajax.googleapis.com/ajax/services/search/web?v=1.0&q=Google&st...
> returns results, buthttp://
ajax.googleapis.com/ajax/services/search/web?v=1.0&q=Google&st...