Every OAI-PMH metadata record has adatestampassociated with it, which is the last modification time of that record. Because arXiv has updated metadata records in bulk on several occasions, the OAI-PMHdatestampvalues do not correspond with the original submission or replacement times for older articles, and may not for newer articles because of administrative and bibliographic updates. The earliest datestamp is given then the<earliestDatestamp>element of the Identify response.
The OAI-PMH interface does not support selective harvesting based on submission date. The datestamps are designed to support incremental harvesting of updates on an ongoig basis. It is not possible to selectively harvest only, say, articles submitted in February 2001 (identifiers 0102.xxxx). Except for selective harvesting based on subject areas (see description of Sets below) the interface is designed to support copying and synchronization of a complete set of arXiv metadata. In order to harvest metadata for all articles, either make requests without a datestamp range (recommended), or make requests from the<earliestDatestamp>through to the present (but beware that because of bulk updates there are some dates on which there were large numbers of updates).
Once an initial harvest has been completed, the copy may be maintained by making imcremental harvesting requests with thefromdate set to the date of last harvest (fromis best taken from the last server response; don't set theuntildate).