Usage of arxiv api in the Xstructure service

57 views
Skip to first unread message

trunov7

unread,
Feb 3, 2009, 7:13:23 AM2/3/09
to arXiv api
We inform the api arxiv community on the usage of arxiv api in the
service http://xstructure.inr.ac.ru.

Xstructure is a service for browsing and searching papers in
arxiv.org. We construct the citation graph, and use it to constuct an
hierarchical classification of the arxiv papers. A user gets the
classification as a list of themes with lists of authoritative and
review papers, and key authors attached. A search form is available;
search results are returned clustered in accord with the
classifiction.

The arxiv api is involved in two places. First, we gather with the api
the metainformation on the papers (title, author list, abstract) with
a list of paper identificators given. Second, lists of papers are
formed for a browsing user via the api.

We were to augment the api in only one but substantial respect: the
search via the search form redirects the request to http://arxiv.org.
We were to do so because the publication date cannot be used as a
parameter of an api request. Would the api be extended in this
direction, we would be glad to switch over to using exclusively the
api for accessing the arxiv data.

Regards,
Xstructure team

Thorsten S

unread,
May 11, 2009, 8:04:46 PM5/11/09
to tru...@gmail.com, gbp...@ms2.inr.ac.ru, arxi...@googlegroups.com
Hi,

thanks for letting us know about http://xstructure.inr.ac.ru/

This does look like an interesting and useful service. Please keep us
<www-...@arxiv.org> informed of future developments.


We believe that the arXiv API functionality you were looking for is
now implemented.

* you can specify date ranges with key "submittedDate" of
"lastUpdatedDate" as part of the querystring, e.g.

search_query=lastUpdatedDate:[200712310900+TO+200712310959]

the date/time string is interpreted as GMT

* you can specify optional "sortBy" and "sortOrder" parameters

sortBy = [relevance, submittedDate, lastUpdatedDate]
sortOrder = [ascending, descending]


For example, to find all the papers last updated between 09:00am -
09:59am GMT on December 31st, 2007, sorted by submission date, issue
this query:

http://export.arxiv.org/api/query?search_query=lastUpdatedDate:[200712310900+TO+200712310959]&sortBy=submittedDate&sortOrder=ascending

you'll notice that arxiv:0705.1827 was first submitted in May 2007 and
replaced during the specified time-window in December 2007, so a
similar query specifying a submittedDate time-window will not list
that particular paper

http://export.arxiv.org/api/query?search_query=submittedDate:[200712310900+TO+200712310959]&sortBy=submittedDate&sortOrder=ascending



To get all of the papers submitted in December 2007 which have
"electron" in the title, use

http://export.arxiv.org/api/query?search_query=lastUpdatedDate:[200712010000+TO+200712312359]+AND+ti:electron&start=0&max_results=500&sortBy=submittedDate&sortOrder=ascending

Here we use the "start" and "max_results" parameters to return the
first 50 matches (slices must be limited to < 2000).

The response feed tells us that there are 97 matches total
<opensearch:totalResults
xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/">97</opensearch:totalResults>
so you could retrieve the remaining 47 via parameter settings
&start=50&max_results=100

Let us know of any problems with this interface.


Best regards
Thorsten Schwander

trunov7

unread,
May 12, 2009, 10:54:24 AM5/12/09
to arXiv api
Hi,Thorsten.

Thanks for detailed reply.
We will try to use this option in future.
Regards,
Grigorii Pivovarov and Trunov Sergei
Reply all
Reply to author
Forward
0 new messages