About the number of results I can get from API

151 views
Skip to first unread message

Sean W

unread,
Mar 16, 2016, 7:00:06 AM3/16/16
to Guardian Open Platform API Forum
Hi, 

I am a student and have had a problem of getting all text of searching results from the API. From the news website I searched the word 'statin' and it showed that there were 1000+ relevant pages and when I searched with API it returned only 400 results. How can I get all texts either from theguardian.com or API?

Thanks. 

Philip McMahon

unread,
Mar 16, 2016, 7:37:02 AM3/16/16
to guardian...@googlegroups.com
Hi Sean, just to check, are you using the pageSize and page parameters to go through the results?

--
You received this message because you are subscribed to the Google Groups "Guardian Open Platform API Forum" group.
To unsubscribe from this group and stop receiving emails from it, send an email to guardian-api-t...@googlegroups.com.
To post to this group, send email to guardian...@googlegroups.com.
Visit this group at https://groups.google.com/group/guardian-api-talk.
For more options, visit https://groups.google.com/d/optout.



This e-mail and all attachments are confidential and may also be privileged. If you are not the named recipient, please notify the sender and delete the e-mail and all attachments immediately. Do not disclose the contents to another person. You may not use the information for any purpose, or store, or copy, it in any way.  Guardian News & Media Limited is not liable for any computer viruses or other material transmitted with or as part of this e-mail. You should employ virus checking software.
 
Guardian News & Media Limited is a member of Guardian Media Group plc. Registered Office: PO Box 68164, Kings Place, 90 York Way, London, N1P 2AP.  Registered in England Number 908396


Xiang Wang

unread,
Mar 16, 2016, 8:07:36 AM3/16/16
to guardian...@googlegroups.com

In the api yes I checked all pages and yes it was 400+in total. 


You received this message because you are subscribed to a topic in the Google Groups "Guardian Open Platform API Forum" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/guardian-api-talk/JmubcWsymvk/unsubscribe.
To unsubscribe from this group and all its topics, send an email to guardian-api-t...@googlegroups.com.

chris.b...@guardian.co.uk

unread,
Mar 17, 2016, 6:32:47 AM3/17/16
to Guardian Open Platform API Forum
Hi Sean,

The search box on the web site uses Google, not the API. This is why the results do not match exactly.

I'm not sure how the Google search comes up with the number of results, but it seems to be a bit unreliable. e.g. when I searched for 'statin', it said 3,800 results on page 1, but only 2,100 results when I went to page 10.

The API returns 162 results, which I think is a more reasonable number. I would guess that the Guardian has probably written hundreds, but not thousands, of articles about statins.

Hope this helps,

Chris

Xiang Wang

unread,
Mar 17, 2016, 7:27:10 AM3/17/16
to guardian...@googlegroups.com

Hi Chris,

Thanks a lot.

Now I got why the two engines have inconsistent results. But it's really a shame as I need a lot more articles talking about statin for a NLP project.

Do anyone know are there any other news wire APIs I can use or is there anyway I can get access to all but not only limited 90 articles from the website powered by Google search sevice.

Many thanks.

Cheers,
Sean

Reply all
Reply to author
Forward
0 new messages