The AI4LAM Speech-to-Text Working Group requests feedback on a draft specification for Transcript Provenance Metadata Elements (TPME).
Motivation: Many libraries, archives, and museums create and use transcripts associated with audiovisual resources. Transcripts come from various sources, and they vary in quality and other characteristics. For example, if a given transcript was created
by Whisper, one would like to know: With which model? Using what parameter values? Has it been corrected by a human? According to what conventions? TPME is intended to capture this kind of data.
If you are part of an organization that needs this kind of data, we invite your feedback on the draft available here:
Several options for offering feedback:
- Add comments directly to the Google Doc
With kind regards,
Owen King
and the organizers of the AI4LAM Speech-to-Text Working Group
Owen King
|
|
Metadata Operations Specialist
|
owen...@wgbh.org
|
One Guest Street, Boston MA 02135
|
|
|
|