LOD-People Data Gathering Sprint (July 29, 2025)

16 views
Skip to first unread message

Gabriel Bodard

unread,
Jul 28, 2025, 12:33:51 PMJul 28
to 'Gabriel Bodard' via Ancient People, pelagios...@googlegroups.com
Dear all,

A reminder of tomorrow's LOD People (Pelagios Network People Activity) meeting, via zoom at 12:00 BST, which we will spend discussing and gathering sample datasets. The goal is to find small samples (say min 100–1000 max records) of open-licensed person-data in whatever format they are available (RDF, JSON, CSV, TEI-XML, etc.), and upload them to the LOD-People repository for analysis, experimentation with future conversion/crosswalk/summary tools, description, etc.

For your reference here are is a list of datasets and databases we have looked at in previous sprints:

If you are not able to join in person, please send any contributions, suggestions, or sample datasets to Ancient-People, or leave a comment with as much information as possible about your dataset (including where to download the raw data) in the Github issue at <https://github.com/DigiClass/LOD-People/issues/1>.
To register for the sprint, please go to <https://ics.sas.ac.uk/events/linked-open-data-people-data-gathering-sprint> and fill out the booking form. You will be sent the zoom link by email.
This sprint will be the kicking-off of this work package, which will continue asynchronously over the next several months.

Many thanks, and hope to see many of you tomorrow!

Gabby

-- 
Dr Gabriel BODARD (he/him)
Reader in Digital Classics

Director of Studies (research): Digital Humanities Research Hub
Director of Studies (research): Institute of Classical Studies

Mailing address:
  Institute of Classical Studies
  University of London
  Senate House
  Malet Street
  London WC1E 7HU
 
Due to new IT security rules, I am currently not able to read or reply to email outside of office hours, or while travelling or working from home. This may result in slower replies than usual.

Gabriel Bodard

unread,
Aug 1, 2025, 10:30:12 AMAug 1
to 'Gabriel Bodard' via Ancient People
Dear all,

After the super useful sample data-gathering sprint  this week, we now have 8 small datasets (between 100–1000 records each) in a variety of formats, in the LOD-People github repo: https://github.com/DigiClass/LOD-People/tree/main/sample-data. Some of these are in RDF (XML, N3 or TTL), others in CSV or TEI XML. I know a few more are on the way, or are just waiting for someone to help whittle down a much larger dataset to the small sample size we're looking for (where this is not a linear crop).

If you would like to add some of your data to this collection, you have three options:

  1. Make an exerpt of your data, in whatever native format you use, and email it to me or Jun (offlist), along with the basic info we need for the README, and we'll push it to Git for you.
  2. Leave the basic readme info and a link to a downloadable data dump in a comment on this issue, and we'll grab it from there. Feel free to nudge (on-list) to let us know you've done so.
  3. Push the data dump and a readme to Github yourself, either by requesting push permissions, or via a fork and pull request. Again, a nudge to let us know might speed up our noticing it.

Our plan is to hold another sprint activity in the fall to start looking at this sample data, seeing how we can describe and align their terms (to one another and/or to standard ontologies/vocabularies), so if you would like your biological and technological distinctiveness assimiliated into the collective, please do show us a bit of your data in the meantime!

Many thanks,
Reply all
Reply to author
Forward
0 new messages