Personal Inquiry Letter Regarding Bulk DOAJ Article Metadata Access

2 views
Skip to first unread message

ge cheng

unread,
Dec 17, 2025, 5:09:42 AM12/17/25
to doaj-public-api
December 17, 2025
Dear DOAJ Team,
I hope this message finds you well.
My name is Li Cheng, and I am an independent developer from Changsha, Hunan Province, China. I am currently learning and researching AI large model development as part of my personal educational journey. I am writing to inquire about accessing large volumes of article metadata from the DOAJ (Directory of Open Access Journals) database.
Learning Purpose: As an individual learner passionate about AI and academic research, I am building an AI large model to enhance my understanding of natural language processing and academic literature analysis. I require a substantial collection of scholarly article metadata to serve as training data for my educational project. The larger the dataset, the better for ensuring the quality of training results and the generalization capabilities of the model.
Specific Requirements:
  1. Bulk article metadata including titles, authors, abstracts, keywords, publication dates, and journal information
  2. The most complete dataset possible, covering various academic fields收录 in DOAJ
  3. Historical and current data - the longer the time span, the better
  4. Article classification data and citation relationship information
  5. Data in machine-readable formats (JSON/XML preferred) for batch processing
  6. If API interfaces are available, I hope they can support large-scale data acquisition
Data Volume Specification: Since AI model training has high requirements for data volume, I hope to obtain as much data as possible. If your institution has limitations or batch provision mechanisms for data access, please inform me of the specific rules and procedures.
Personal Commitments:
  • This data will be used exclusively for personal learning and educational purposes
  • I will strictly comply with DOAJ's terms of service and open access policies
  • No commercial use or redistribution of the data
  • I am willing to sign any necessary agreements for personal use
  • For extremely large datasets, I am willing to cooperate with any necessary technical integration procedures
Questions:
  1. Does DOAJ support large-volume metadata access requests from individual users?
  2. For large data volume requirements, what is the specific application process and approval timeline?
  3. Are there any limitations on the amount of data per single access? If so, what is the maximum number of records?
  4. Do you provide complete database export services or API batch access?
  5. Does bulk data acquisition involve any fees? If so, what are the charging standards?
  6. How frequently is the data updated? Can I regularly obtain new data?
As a solo learner, I deeply value the open access movement and the resources that DOAJ provides to the global research community. A large volume of high-quality academic data is of great significance to my AI learning project.
Thank you for your time and consideration. I look forward to your guidance on how I can properly access and utilize DOAJ metadata in bulk for AI model training purposes.
Best regards,
Li Cheng
Independent Developer & AI Enthusiast
Email: 5224...@qq.com
Changsha, Hunan Province, China
December 17, 2025
Reply all
Reply to author
Forward
0 new messages