Hello,
I've written a little python script that downloads preprints posted in the previous month by a long list of authors whose work I'm interested in. It works by
1. querying for preprints by each author, e.g.
2. selecting those posted in the previous month
3. deduplicating the resulting list
4. downloading the pdfs.
I have quite a long wait time in between each query (>3 s per terms of use; up to 30s, but this is a knob I turn).
This successfully runs through some number of author queries, then fails with a socket connection timeout error.
One possible cause is my (admittedly unstable) home internet connection, but I didn't think it was *that* unstable. In that case, the solution is to retry. But if I've fallen afoul of some rate-limiting procedure on the arXiv side, retrying is the opposite of helpful.
So---am I running into throttling on the arXiv server side? How can I do this and be a good citizen?
Best,
Christopher