Hello everyone,
Our June community call is coming up next week! It is scheduled for Tuesday, June 18, 16:00 UTC (8AM PST | 11PM EST | 16:00 GMT | 17:00 CET).
The theme for this session is "Whisper – Choosing a Model and Operationalizing a Speech to Text Pipeline." Whisper’s release has been catalytic and created incredible promise for transcribing AV materials. Over the last year, many institutions have gone from exploring Whisper to testing, designing and implementing a speech-to-text pipeline. This set of lightning talks will describe work in detail at three institutions on topics ranging from finding and fine tuning the right model to implementing an end-to-end service (including software for caption review and correction and media players).
Speakers:
o Javier de la Rosa, National Library of Norway AI-lab
o Oddmund Møgedal, University of Oslo on Autotext
o Alan Lundgard, Niqui O’Neil & Ed Summers, Stanford University LibrariesFurther details, including the joining details, can be found on our agenda/notes document here:
https://docs.google.com/document/d/1U0vrTo7OjZCMx9n-4a5toa-njjbsZHiFr2275k7f9wQ/edit?usp=sharing. Register your interest and mark yourself as attending!