Extracting the Country Details from Author Correspondence information?

493 views
Skip to first unread message

Yazwand Palanichamy

unread,
Aug 19, 2021, 6:10:22 AM8/19/21
to zotero-dev
Greetings all, I hope this message finds you all safe and well. 

What I want to do: 
I want to extract the country of the corresponding author from published research articles online using Zotero. As an example, please see below:

Screenshot_2.png
I have extracted all of the 'abstract metadata' from each of these online articles already and stored them onto my Zotero repository (please see below). So I have information such as the "Name of the Journal"; "Title of the paper"; "Year of publication", and of course, the abstract itself. This is all fine and well. 

Screenshot_3.png

However, I have noticed that the only metadata that is missing, would be the country of the corresponding author from each of the published papers online

Why do I want the country metadata for each author across all papers in my dataset? 
I am working on topic modeling research (i.e., a method of unsupervised machine learning) to help extract important keywords/topic areas and therefore, analyze key trends of interest in the "Environmental Policy" discipline (from 2000-2019).

Therefore, I have been reviewing research articles (specifically just the abstract information) published from several environmental policy (EPOL) journals online to extrapolate some of these trends/keywords, and therefore investigate (specifically) what environmental issues do environmental policy scholars/advocates seem to be prioritizing the most, and why? 

By doing so, the end goal would be to effectively classify/and or create a roadmap so that future policy researchers (as well as interested academics in other related fields) can be aware of some of the fundamental topic areas/trends of interest that researchers in the EPOL field are currently prioritizing. 

The missing analysis 
One particular analysis that I want to do is a country-based topic distribution analysis.

By this, I mean reviewing the recorded country of origin for each of the corresponding authors, to therefore analyze what predominant issues that environmental policy scholars are currently facing from a geographical/global perspective. Instead of only just from a per-journal perspective.  

As an example, perhaps scholars reporting from the DRC in Africa are currently prioritizing the topic of "Deforestation" as a major policy area of interest. 

Whereas in India, water governance policies/water security might be more of a prevalent issue for policy scholars to investigate over deforestation. 

Conclusion
Having said that, it would be most appreciated if a Dev could provide some guidance on how to extract the country-based information of the corresponding author as an additional piece of metadata. Or if such a feature is not supported yet in Zotero, perhaps? 

Of course, another option would be to do so manually. But the dataset of all of the abstract metadata that I have collected amounts to around 33,000-34,000. So doing so manually, would certainly take too long to consider. 

Any and all help would be greatly appreciated! Thanks, folks, and please do take care. 

And my apologies for the long-winded request here. 

With kind regards, 

Yazwand 


Abe Jellinek

unread,
Aug 19, 2021, 1:53:15 PM8/19/21
to zoter...@googlegroups.com
There unfortunately doesn’t exist a field for author affiliation/location in Zotero - there isn’t (as far as I know) any citation style that wants affiliations, and Zotero generally only stores metadata relevant to generating a complete and accurate citation.

If you’re only working with a few databases, modifying the relevant translators to store affiliations in the Extra field should be straightforward. That would mean re-adding each article to Zotero, though. If you’re processing the data you’ve collected outside of Zotero, you can use the Crossref API to fetch author affiliation metadata by DOI: https://api.crossref.org/swagger-ui/index.html#/Works/get_works__doi_

On Aug 19, 2021, at 4:10 AM, Yazwand Palanichamy <pala...@gmail.com> wrote:


Greetings all, I hope this message finds you all safe and well. 

What I want to do: 
I want to extract the country of the corresponding author from published research articles online using Zotero. As an example, please see below:

<Screenshot_2.png>
I have extracted all of the 'abstract metadata' from each of these online articles already and stored them onto my Zotero repository (please see below). So I have information such as the "Name of the Journal"; "Title of the paper"; "Year of publication", and of course, the abstract itself. This is all fine and well. 

<Screenshot_3.png>


However, I have noticed that the only metadata that is missing, would be the country of the corresponding author from each of the published papers online

Why do I want the country metadata for each author across all papers in my dataset? 
I am working on topic modeling research (i.e., a method of unsupervised machine learning) to help extract important keywords/topic areas and therefore, analyze key trends of interest in the "Environmental Policy" discipline (from 2000-2019).

Therefore, I have been reviewing research articles (specifically just the abstract information) published from several environmental policy (EPOL) journals online to extrapolate some of these trends/keywords, and therefore investigate (specifically) what environmental issues do environmental policy scholars/advocates seem to be prioritizing the most, and why? 

By doing so, the end goal would be to effectively classify/and or create a roadmap so that future policy researchers (as well as interested academics in other related fields) can be aware of some of the fundamental topic areas/trends of interest that researchers in the EPOL field are currently prioritizing. 

The missing analysis 
One particular analysis that I want to do is a country-based topic distribution analysis.

By this, I mean reviewing the recorded country of origin for each of the corresponding authors, to therefore analyze what predominant issues that environmental policy scholars are currently facing from a geographical/global perspective. Instead of only just from a per-journal perspective.  

As an example, perhaps scholars reporting from the DRC in Africa are currently prioritizing the topic of "Deforestation" as a major policy area of interest. 

Whereas in India, water governance policies/water security might be more of a prevalent issue for policy scholars to investigate over deforestation. 

Conclusion
Having said that, it would be most appreciated if a Dev could provide some guidance on how to extract the country-based information of the corresponding author as an additional piece of metadata. Or if such a feature is not supported yet in Zotero, perhaps? 

Of course, another option would be to do so manually. But the dataset of all of the abstract metadata that I have collected amounts to around 33,000-34,000. So doing so manually, would certainly take too long to consider. 

Any and all help would be greatly appreciated! Thanks, folks, and please do take care. 

And my apologies for the long-winded request here. 

With kind regards, 

Yazwand 


--
You received this message because you are subscribed to the Google Groups "zotero-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to zotero-dev+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/zotero-dev/d0fb89bc-3cdb-4154-8eb0-c018bb63a0a5n%40googlegroups.com.
<Screenshot_3.png>
<Screenshot_2.png>

Diego de la Hera

unread,
Aug 23, 2021, 8:20:18 AM8/23/21
to zotero-dev
Hi, Yazwand.

Some months ago I did something similar to what Abe suggested, using Python to retrieve author affiliations from Crossref for a list of references in a Zotero library. I wrote a short article about it here, and the code is available here. Unfortunately, for the collection I was working with, affiliation data was mostly unavailable. But maybe you have better luck!

Best,

Diego
Reply all
Reply to author
Forward
0 new messages