The New York Tech Libraries are celebrating Open Access Week 2025. Please share within your networks our calendar of activities:
I would like to highlight two Friday sessions that support sustainability and creating a corpus of open access literature.
The New York Tech Libraries are collaborating with researchers from the National Institute of Plant Genome Research (NIPGR), New Delhi, India, the Leibniz Information Centre for Science and Technology and University Library, and Cambridge University to promote tools developed by #semanticclimate, pygetpapers. Please reach out if you have any questions about any of our Open Access Week events.
Friday, October 24: Open Science and Citizen Science with #semanticClimate: Open-Software for Knowledge Liberation
Register to get the Zoom link.
Session 1: 8:30am-9:45am EST
Corpus creation from open access repositories and their analysis with semantic tools presented by Ms. Udita Agarwal, Ph.D. student, National Institute of Plant Genome Research (NIPGR) and Dr. Renu Kumari, Program Manager #semanticClimate and Ms. Shaik Zainab, Anurag University, Hyderabad. The session will provide an introduction to corpus creation and pygetpapers for literature review. Links to the recording will be available after the session.
They will be showing their complete READER-oriented toolkit. This takes OA material (both academic articles and more generally Open material such as the UN IPCC reports) and automatically turns it into semantic form (no AI contamination at this stage). There are several components, all Open, and managed by Dr. Renu Kumari, and largely automatic (i.e. their default use can scope a query in the time it takes to download material):
This is highly flexible (written in Python) but designed for use by those without technical knowledge (other than having Python on the machine).
Friday, October 24: Open Science and Citizen Science with #semanticClimate: Open-Software
for Knowledge Liberation
Session 2: 12:30pm-1:50pm: Introduction to the #semanticClimate by co-Founder Dr. Gitanjali Yadav.
Dr. Yadav is a professor and researcher at the National Institute of Plant Genome Research (NIPGR), New Delhi, India. Instructor: Ms. Udita Agarwal, Ph.D Student NIPGR; Co-instructor: Dr. Renu Kumari, Program Manager #semanticClimate and Ms. Shaik Zainab, Anurag University, Hyderabad. Co-coordinators: Gitanjali Yadav, National Institute of Plant Genome Research (NIPGR), New Delhi, India, Simon Worthington, TIB – Leibniz Information Centre for Science and Technology and University Library, Peter Murray-Rust, Cambridge University, UK
In today’s world of increasing publications and their availability in the format which are not machine readable has led to the development of the tools by #semanticClimate. pygetpapers (see video example) is one of them which is being used to create corpus of the open access literatures in a structured and machine readable format for further analysis on the corpus.
The applications of the semantic and structured corpus are the following:
1- To train Natural Language Processing (NLP) models for:
2- Facilitate literature reviews:
This has also led to resolve the following challenges associated with exponential growth of publications.
Millie Gonzalez, MLIS, MBA
Dean of Libraries
Pronouns: She, Her, Ella
New York Institute of Technology
Student Activities Center, Room 317, Old Westbury Campus
Give to the Library