Bulk data for specific date range

82 views
Skip to first unread message

Shubham Agarwal

unread,
Sep 8, 2023, 8:02:11 AM9/8/23
to arXiv API
Hi! 
Thank you for providing the APIs and for the bulk access. 
I would like to download papers in bulk (PDFs and latex) for a specific range (say August 2023). Is there an easy way to do this?

This might be helpful for incremental updates of a local source dump

Thanks! 
Best,
Shubham

Jake Weiskoff

unread,
Sep 8, 2023, 8:03:27 AM9/8/23
to arxi...@googlegroups.com
Hi Shubham,

Downloading source isn't really the purpose of the API (which is limited to the metadata), but you can read about how we recommend people collect custom data sets here: 


Best,
-Jake

--
You received this message because you are subscribed to the Google Groups "arXiv API" group.
To unsubscribe from this group and stop receiving emails from it, send an email to arxiv-api+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/arxiv-api/442c54c1-7eb3-41f9-bc65-a251e1a6b75dn%40googlegroups.com.

Shubham Agarwal

unread,
Sep 8, 2023, 9:38:42 AM9/8/23
to arxi...@googlegroups.com
Thanks!

If I have done S3 bulk access once, how do I update this local dump
now with new papers (PDFs and latex)?

Thanks!

Best,
Shubham Agarwal
Profile: https://shubhamagarwal92.github.io/
LinkedIn: https://www.linkedin.com/in/shubham-agarwal-4b215146/
> You received this message because you are subscribed to a topic in the Google Groups "arXiv API" group.
> To unsubscribe from this topic, visit https://groups.google.com/d/topic/arxiv-api/oIY2EZNQU5Q/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to arxiv-api+...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/arxiv-api/CAAx3fqTEXbt2UJAF3zCz1%3D0nV9419awrxQWPXnCwhE_1vEKv5Q%40mail.gmail.com.

Jake Weiskoff

unread,
Sep 8, 2023, 9:55:50 AM9/8/23
to arxi...@googlegroups.com
You'd have to download any updated data from the last time you pulled from the buckets. There's no programmatic method that we recommend above others, as this may be dependent upon the software you're using locally. 

-Jake

Reply all
Reply to author
Forward
0 new messages