Thank you, Jack. The kagel dataset seems to be very usefull to fetch all metadata.
In the meantime I could fetch the data from 30th September by increasing the delay time between two calls to the API to 10 seconds. However, I still get sometimes 104 Error Codes. I don't think I make too many requests per time. Maybe there is a lot of traffic on the API recently?Is there some time of the day on which the API is more reliable? Currently I use 03:00 UTC and I only fetch the metadata of one day. However, I make two calls to the API (one for metadata format arXiv and one for arXivRaw).
If my problem ist related to the database issue, that you mentioned, then I guess it should go away the next days. I will observe it and let you know whether I still get a lot of 104 HTTP Errors.
Best
Isabel