Good morning everyone,
Context: I'm doing a cross-institutional benchmarking analysis to compare a specific research department at multiple universities. My workflow is:
This approach is necessary because I only have detailed internal data for my own institution, but need to benchmark against other universities where I must rely on OpenAlex affiliation-based queries.
The problem is that, using the API, OpenAlex retains detailed authorship information only for the first 100 authors of a publication. For publications with >100 authors, affiliations beyond this cutoff are not indexed. This creates a significant data loss issue for certain research fields.
More critically: I cannot determine whether authors from Institution A tend to appear in positions 1-100 more frequently than authors from Institution B in the same large-collaboration publications. This creates an unmeasurable bias when comparing institutions.
Is there a way to query OpenAlex (not using the snapshot) that captures all institutional affiliations for 100+ author publications, even if individual author details are truncated? Are there alternative approaches to retrieve institutional affiliation information for large-collaboration publications that I'm missing?
I'm aware I could use DOI-matching for my own institution, but this doesn't solve the core problem: I need a consistent methodology that works equally for all institutions involved.
Any thoughts on this would be greatly appreciated!
Silvia Fattori
--
You received this message because you are subscribed to the Google Groups "OpenAlex Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openalex-commun...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/openalex-community/0f5f553b-8fdf-4b6e-a123-6f880ace9392n%40googlegroups.com.
Yes I see.
I did this for another project of ours and could be a solution for you too: download all publications of the relevant topics and of the relevant period, make sure you download all authors when author count is >= 100 and then do the filtering and sorting on your side in a database instead of relying on the api. Can be a long running download though but for me it was worth the wait. That’s a compromise between using the api with its limitations and hosting and paying for a data snapshot yourself…
Let me know if this can be a solution for you.
--
You received this message because you are subscribed to the Google Groups "OpenAlex Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openalex-commun...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/openalex-community/7BDF9105-855E-429D-B002-119BD0C8B0AB%40gmail.com.
To view this discussion visit https://groups.google.com/d/msgid/openalex-community/CAHRH3y6xohStH0KNpiOho-hQ8brw_LK1LZr7q2n4F7Qh0t6%3Dcg%40mail.gmail.com.