Hello,
I've been trying to write a small client to query the arXiv API for titles and abstracts of papers published in the last day. I'm finding that the API is currently retrieving titles several days out of date, e.g. the query
misses this paper
that shows up in the same search in the online UI
I'm wondering if the issue is with the API or the way I'm constructing the query. I did try to follow a suggestion in another thread to check the RSS feed for the latest updates and then build an id_list to query for the titles and abstracts, but unfortunately, the API does not return results for the most recent papers even if I specify their ID (it does work for older IDs that do show up in the API query). I guess an alternative is to get the ID from the RSS feed, go to the paper's page, and scrape the title and abstract from there directly, but that seems painful.
If this is a known property of the API, how many days out of date should I expect it to be in general (so that I know how many days to query to catch up on results)?
Thanks!
Pratiksha