I haven't understood what you've answered. Is it possible to retrieve
more than 50000 articles by using time slices? Because
I'm retrieving 10 articles per iteration (waiting 3 seconds after each
one) and even this way I can't retrieve more than 50000. Is that
right? Because I've read some papers that have used the API to build
collaboration networks by category, e.g Astro-ph; using all the
articles from such category which is exactly what I'm trying to do to
carry on some experiments.
Cheers
Paulo S.
On Aug 26, 7:09 am, Thorsten S <
thorsten.schwan...@gmail.com> wrote:
> the max number of returned search results is limited to 50000 for
> practical reasons.
>
> we recommend using time slices for searches that are too broad.
>
> Cheers
> T.
>
>
>
>
>
>
>
> On Thu, Aug 25, 2011 at 2:51 PM, Paulo S. <
prssoar....@gmail.com> wrote:
> > Using the API, I could notice that the total results for astro-ph
> > category is 105380 as presented below:
>
> > <opensearch:totalResults xmlns:opensearch="
http://a9.com/-/spec/
> > opensearch/1.1/">105380</opensearch:totalResults>
> > <opensearch:startIndex xmlns:opensearch="
http://a9.com/-/spec/
> > opensearch/1.1/">0</opensearch:startIndex>
> > <opensearch:itemsPerPage xmlns:opensearch="
http://a9.com/-/spec/
> > opensearch/1.1/">10</opensearch:itemsPerPage>
>
> > However when I try to get the last articles, I actually get no
> > results:
>
> > Query:
http://export.arxiv.org/api/query?search_query=cat:astro-ph&start=105...
> > Data:
>
> > <feed xmlns="
http://www.w3.org/2005/Atom">
> > <link href="
http://arxiv.org/api/query?search_query%3Dcat%3Aastro-ph
> > %26id_list%3D%26start%3D105370%26max_results%3D10" rel="self"
> > type="application/atom+xml"/>
> > <title type="html">ArXiv Query: search_query=cat:astro-
> > ph&id_list=&start=105370&max_results=10</title>
> > <id>
http://arxiv.org/api/UpGo12YR3p9ADRUPMBxluA9RiLA</id>
> > <updated>2011-08-25T00:00:00-04:00</updated>
> > <opensearch:totalResults xmlns:opensearch="
http://a9.com/-/spec/
> > opensearch/1.1/">105380</opensearch:totalResults>
> > <opensearch:startIndex xmlns:opensearch="
http://a9.com/-/spec/
> > opensearch/1.1/">105370</opensearch:startIndex>
> > <opensearch:itemsPerPage xmlns:opensearch="
http://a9.com/-/spec/
> > opensearch/1.1/">10</opensearch:itemsPerPage>
> > </feed>
>
> > Changing the start and max_results a little bit, I've figured out that
> > I can only get results until 49945, that is, any value of max_results
> > bigger than one for the query below doesn't return any result.
>
> > Query:
http://export.arxiv.org/api/query?search_query=cat:astro-ph&start=499...