Hello,
I've been exploring PDF availability across different APIs and found an interesting pattern.
Example paper: https://api.openalex.org/works/W4402775298
I checked the same paper across multiple sources:
However, Semantic Scholar returns externalIds.ArXiv: "2408.02784" - and the PDF is directly accessible at https://arxiv.org/pdf/2408.02784.pdf.
For papers where:
Could OpenAlex automatically populate pdf_url with the constructed arXiv link? The pattern is simple and reliable
This could significantly improve PDF coverage without relying on external services to provide the URL - arXiv's URL structure is stable and predictable.
Also could expose arxiv_id as a top-level field in the response, might make it easier for users to construct the URL ourselves.
Thanks,
Purna
--
You received this message because you are subscribed to the Google Groups "OpenAlex Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openalex-commun...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/openalex-community/7cbc197f-39bd-4fab-88a9-ee464e325e9an%40googlegroups.com.