Groups keyboard shortcuts have been updated
Dismiss
See shortcuts

LP9 Data Collection Activity call

72 views
Skip to first unread message

Jun Ogawa

unread,
Dec 5, 2023, 12:01:47 PM12/5/23
to Ancient People
Dear all, 

As many of you know, Linked Pasts 9 just started yesterday and we, the People Activity, held a kick-off meeting to discuss agendas coming two weeks. Thank you so much to those who attended the event, and for those who couldn't, we are very welcome to join the asynchronous activities from now on. 

At yesterday's meeting, we decided to work on collecting and depositing some sample LOD people data from existing projects, whether it is your own or of some others, as one of the two tasks we take for these two weeks. For the details, please look at the agenda document from yesterday. 

For this activity, I started the issue on our GitHub and created a sample thread to introduce the project and the data that provides. Maybe we can start the activity by introducing some data on this issue, either from the existing projects or other personal data? 
Once we have some sample data, we can make some new directories on GitHub and start storing them in an organized way. For this depositing method, we can discuss more on this Google group! 

Anyway, those who are interested, please begin editing the issues!

All the best,
Jun Ogawa

Gabriel Bodard

unread,
Dec 6, 2023, 9:12:43 AM12/6/23
to Ancient People
Many thanks, Jun.

In the first instance, could I ask if anyone in the group (whether from the call on Monday, or others interested in the LOD People activity) would be willing to help take a lead on this exercise? The idea as I understand it is to collect (a) information about and (b) small sample sets of open-licensed, linked person data that we can upload to the LOD-People Git repository for (i) example, (ii) analysis and (iii) experimentation with any queries, formats or tools we want to pilot and propose here. What this would involve would be perhaps some subset of:

  1. Create a sample of your own dataset, document in the light-touch way Jun has proposed in the ticket, and upload to the repo;
  2. Find and excerpt a small sample, plus light documentation, of other open licensed datasets that are available online;
  3. Reach out to others and encourage them to create and license small samples of their data for this purpose.

I suggest that we create a /sample-data/ directory, and put each sample dataset in a child folder therein (potentially multiple files in open formats such as RDF, JSON, XML or even CSV + Readme + license). Or if you think it would be easier to have a flat folder structure for scripting experiments on this data, then we'll need a very very transparent and strict naming scheme, to minimise confusion (and perhaps a registry file listing all data in the directory?).

Who would like to help get this set up?

In the meantime, please feel free to start gathering data examples, posting suggestions (here or on the ticket), request Git access, etc.

Many thanks!

Gabby

-- 
Dr Gabriel BODARD (he/him)
Reader in Digital Classics

Director of Studies (research): Digital Humanities Research Hub
Director of Studies (research): Institute of Classical Studies

Mailing address:
  Institute of Classical Studies
  University of London
  Senate House
  Malet Street
  London WC1E 7HU
 
Especially at the moment, I may email at odd hours of the day and night/days of the week. I do not ever expect a reply outside of your working hours.

From: ancient...@googlegroups.com <ancient...@googlegroups.com> on behalf of Jun Ogawa <htjk65...@gmail.com>
Sent: 05 December 2023 17:01
To: Ancient People <ancient...@googlegroups.com>
Subject: [ancient-people] LP9 Data Collection Activity call
 
--
You received this message because you are subscribed to the Google Groups "Ancient People" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ancient-peopl...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ancient-people/c50c4654-a848-4605-8306-3fd68232df47n%40googlegroups.com.

Greta Hawes

unread,
Dec 6, 2023, 5:46:22 PM12/6/23
to ancient...@googlegroups.com

Dear Gabby and Jun,

 

Thanks for setting this up, and sorry I couldn’t make the first call.   I’ve added some info and a sample from MANTO data to the issue that Jun created.  Happy to move this wherever it needs to go if a registry file ends up being preferred.

 

Greta.

 

Kyriaki Konstantinidou

unread,
Dec 7, 2023, 7:05:06 AM12/7/23
to ancient...@googlegroups.com
Dear Gabby and Jun, 

Thank you very much for setting this up! I shall ask Brady Kiesling to provide a sample from the "Digital Pausanias" project. I think that, by the end of January, we shall have sufficient data for a sample from the SLaVEgents project too. 
With my best wishes, 
      Kyriaki 


From: ancient...@googlegroups.com <ancient...@googlegroups.com> on behalf of Greta Hawes <greta...@gmail.com>
Sent: Thursday, December 7, 2023 12:46 AM
To: ancient...@googlegroups.com <ancient...@googlegroups.com>

Sinai Rusinek

unread,
Dec 7, 2023, 8:27:03 AM12/7/23
to ancient...@googlegroups.com, tomgh...@gmail.com
Dear Tom and all, 
Would you like me to start a table for the first activity? We can then share it in your email. I believe a google spreadsheet would be the most convenient for the time of the LP9 - we can then move it to Github later. Or did you already have something planned? 
All the best,
Sinai

Tom Gheldof

unread,
Dec 7, 2023, 8:35:02 AM12/7/23
to Sinai Rusinek, ancient...@googlegroups.com
Dear Sinai,

That would be great! I was also thinking about doing this (but currently still doing that for the Gazetteers activity), starting from the projects listed on the Github page, but if you want, you can already go ahead and share it with everyone in this group, so we can further populate the list and afterwards update in on Github...

All the best,

Tom

Op do 7 dec 2023 om 14:27 schreef Sinai Rusinek <sinai....@gmail.com>:

Gabriel Bodard

unread,
Dec 7, 2023, 9:34:21 AM12/7/23
to ancient...@googlegroups.com
Dear Tom and Sinai,

Are we now crossing the streams and talking about the first exercise (collating and describing existing digital person-data projects), which as far as I can see hasn't been proposed on this list yet?

If so yes, let's pick that up here (so not as to confuse with the "sample data" thread) and decide how to start.

Thanks,

Gabby


Sinai Rusinek

unread,
Dec 7, 2023, 9:49:43 AM12/7/23
to ancient...@googlegroups.com
Thanks Gabby, you are right, of course, sorry for the mess. 

Before proceeding to filling the table with the project descriptions, could we first discuss what parameters we would like to review in the table? 

For example, I am still not sure that the table will enable us to account for the intricacies of databases that deal with names and attestations in addition to Persons. 
Feel free to add suggested parameters in the first sheet or in the table, or just continue the discussion here...

All best, 
Sinai



--
You received this message because you are subscribed to the Google Groups "Ancient People" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ancient-peopl...@googlegroups.com.

Tom Gheldof

unread,
Dec 7, 2023, 10:45:25 AM12/7/23
to ancient...@googlegroups.com
Thanks, Sinai! And you're right, Gabby, best to keep the 2 initiatives separately :-)

Indeed, more onomastic focussed databases (such as TM People) will need other parameters, so it seems best if other group members can also check the spreadsheet and add suggestions...

All best,

Tom

Op do 7 dec 2023 om 15:49 schreef Sinai Rusinek <sinai....@gmail.com>:

Greta Hawes

unread,
Dec 7, 2023, 3:30:49 PM12/7/23
to ancient...@googlegroups.com

Hi Sinai,

 

These categories in the spreadsheet look like they cover most of the territory that I was thinking in terms of.  Perhaps also any system for internal disambiguation or capturing uncertainty – i.e. relations between Persons within the dataset that express “possibly same as” etc?

 

Greta.

 


Date: Friday, 8 December 2023 at 1:49 am
To: ancient...@googlegroups.com <ancient...@googlegroups.com>

Jun Ogawa

unread,
Dec 11, 2023, 1:02:41 AM12/11/23
to Ancient People
Hi all,

For the second exercise of collecting sample data, I just created a new folder, /sample-data/, in the repository (https://github.com/DigiClass/LOD-People/tree/main/sample-data). 
So, any of those who have any sample people data, please feel free to upload it there (either as a collaborator or by pull requests)! 

Also if you have time, please write a brief description of it in GitHub issues (https://github.com/DigiClass/LOD-People/issues/1). 

Best
Jun

2023年12月8日金曜日 5:30:49 UTC+9 greta...@gmail.com:

Jun Ogawa

unread,
Dec 11, 2023, 10:53:45 AM12/11/23
to Ancient People
Dear all, 

At today's meeting, we discussed again the process of uploading sample data to our GH repository. 
Based on this discussion, we drafted a README (https://github.com/DigiClass/LOD-People/blob/main/sample-data/README.md) right under the 'sample-data' folder. 

All the details are now in this document. If you know or have any data related to historical people, please share it with the community!

Best regards,
Jun

2023年12月11日月曜日 15:02:41 UTC+9 Jun Ogawa:
Reply all
Reply to author
Forward
0 new messages