Hello OpenAlex users!
An update of the standard-format OpenAlex snapshot1 has been released, dated 2022-07-09.
S3 bucket: s3://openalex
In addition to the changes in the release notes, file sizes and record counts are included in the manifest files again. Thanks to everyone who let us know they were using them!
When processing this update, we recommend doing a full refresh - that is, downloading the entire snapshot instead of following the instructions for
downloading updated entities. Among other reasons, duplicate Works and Authors were discovered in the previous snapshot, and an incremental update is likely to leave you with an inconsistent copy.
We apologize for the inconvenience, and believe this will be the last time a full refresh is needed. We have completed the tech stack migration that caused the aforementioned inconsistencies and the formatting issues in the initial release of the 2022-06-09 update.
Thanks,
Richard
1 With the retirement of the MAG-format snapshot, henceforth to be referred to as "the snapshot".
--
Richard Orr
OurResearch: We build tools to make scholarly research more open, connected, and reusable—for everyone.