new OpenAlex standard format snapshot released

62 views
Skip to first unread message

Richard Orr

unread,
Jul 13, 2022, 2:42:26 PM7/13/22
to openale...@googlegroups.com
Hello OpenAlex users!

An update of the standard-format OpenAlex snapshot1 has been released, dated 2022-07-09.

S3 bucket: s3://openalex

In addition to the changes in the release notes, file sizes and record counts are included in the manifest files again. Thanks to everyone who let us know they were using them!

When processing this update, we recommend doing a full refresh - that is, downloading the entire snapshot instead of following the instructions for downloading updated entities. Among other reasons, duplicate Works and Authors were discovered in the previous snapshot, and an incremental update is likely to leave you with an inconsistent copy.

We apologize for the inconvenience, and believe this will be the last time a full refresh is needed. We have completed the tech stack migration that caused the aforementioned inconsistencies and the formatting issues in the initial release of the 2022-06-09 update.

Thanks,
Richard

1 With the retirement of the MAG-format snapshot, henceforth to be referred to as "the snapshot".

--
Richard Orr
Lead Developer - Unpaywall, OpenAlex
OurResearchWe build tools to make scholarly research more open, connected, and reusable—for everyone.
Reply all
Reply to author
Forward
0 new messages