AI4LAM Speech-to-Text WG, May 27, on transcript metadata

18 views
Skip to first unread message

Owen King

unread,
May 22, 2025, 12:08:11 PMMay 22
to AI4LAM group
The AI4LAM Speech-to-Text Working Group invites you to its next meeting on Tuesday, May 27 at 09:00 US-Pacific | 12:00 US-Eastern | 17:00 UK | 18:00 Central Europe | 03:00 +1 Canberra.

Our topic will be metadata for transcripts.  Many libraries, archives, and museums create and use transcripts associated with audiovisual resources.  Transcripts come from various sources, and they vary in quality and other characteristics.  For example, if a given transcript was created by Whisper, one would like to know:  With which model?  Using what parameter values?  Has it been corrected by a human?  According to what conventions?  Transcript metadata can answer such questions.

We will discuss recent attempts to organize transcript metadata, including FADGI's Guidelines for Embedding Metadata in WebVTT Files, available here: https://www.digitizationguidelines.gov/guidelines/accessibilty_WebVTT.html 

We will also discuss recent work in this group to define a more expansive set of elements, called Transcript Provenance Metadata Elements (TPME).  The current draft specification, about which we're currently seeking feedback, is available here:   https://docs.google.com/document/d/10hEbp_RkOeSm5uorTlI_QmyE0sD1G5OAkWUfHICf8W0 

Please join us!



Cheers,
Owen (on behalf of the Speech-To-Text WG conveners)

Owen King (he/him)
Metadata Operations Specialist
E: owen...@wgbh.org
One Guest Street, Boston, MA 02135

Logo


Reply all
Reply to author
Forward
0 new messages