A member of my team has surfaced issues with the author information on several papers he spot-checked for a project. The issues appear to affect both the bulk snapshot data and the OpenAlex API. For example, this paper:
Lists "Marion Pang" as the second author in the snapshot, while the API and OpenAlex website list "Miaosen Pang". The journal website lists Miaosen Pang:
However, there appears to be deeper disambiguation issues. The OpenAlex record for this author appears to be conflating them with at least one other author that does biomedical research. In fact, the ORCID provided in the OpenAlex data points to a researcher who works in proteomics, not metallic alloys.
I am concerned because we identified several such examples with very limited manual exploration, making me think there may be systemic issues with author data. We rely on this to be accurate for our primary use case, and at this point my team doesn't trust the data.
Can someone from OpenAlex comment on this issue?