Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Speech dataset availability.

79 views
Skip to first unread message

gert...@gmail.com

unread,
May 31, 2015, 9:23:59 PM5/31/15
to
Other than TIDigits and TIMIT, are there any other speech databases available (hopefully free)?

The project I am working on involves speech recognition and speaker identification. So ideally, a database with a limited set of speech classes (such as 0~9 digits) from many people, and/or a database with speeches from a medium (perhaps 10 ~ 30) set of people with extensive (i.e. more than 0~9 utterances that is) speech utterances.

Any help would be appreciated

Thanks

Nickolay Shmyrev

unread,
Jun 2, 2015, 4:13:02 AM6/2/15
to
Free modern databases:

Tedlium (lectures)

http://www-lium.univ-lemans.fr/en/content/ted-lium-corpus

Librispeech (audiobooks)

http://www.openslr.org/12/

Voxforge (multilingual)

http://www.voxforge.org

Brian Smith

unread,
Jun 2, 2015, 4:55:46 AM6/2/15
to
Thanks for these.

gert...@gmail.com

unread,
Jun 4, 2015, 9:57:21 PM6/4/15
to
Thank you Nickolay !
0 new messages