Hey all,
I believe I found a problem with the API today. Here's the description of the problem:
If I look at https://arxiv.org/list/cs/new, I see submissions received from Fri 6 Oct 23 to Mon 9 Oct 23, announced Tue, 10 Oct 23. The latest item is arXiv:2310.05922.
However, if I try to list recent items using OAI-PMH, the API informs me that there are no items after Oct-6th:
http://export.arxiv.org/oai2?verb=ListIdentifiers&metadataPrefix=oai_dc&from=2023-10-07&set=cs returns "No records match the paramaters specified for this selective harvesting request", which is obviously false as there are plenty of items announced on Oct-9th and Oct-10th.
The latest date for which the API returns something is 2023-10-06. From these results, the latest item is arXiv:2310.03744, which is far behind what the website shows.
Strangely, although the verb ListIdentifiers (and ListRecords) is not returning an up-to-date list, the verb GetRecord is working: if I run http://export.arxiv.org/oai2?verb=GetRecord&identifier=oai:arXiv.org:2310.05922&metadataPrefix=oai_dc, I can get the details of the latest item shown in the website (which is not shown in ListIdentifiers or ListRecords)
Also, if I try to list recent items using RSS, the list is also not updated:
http://export.arxiv.org/rss/cs returns a list of items for which the latest is arXiv:2310.04453, also behind the latest item shown in the website.
This is happening and, at the same time, https://status.arxiv.org/ says all systems are go.
What is happening? When can we expect the differences to be fixed/solved?
We thought of a workaround that would basically scrape https://arxiv.org/list/cs/new or https://arxiv.org/list/cs/recent, but I'd prefer not doing that..
Pls advise.
Thanks