Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Call to participate: FreeCLAS - free commons of linguistically annotated speech

0 views
Skip to first unread message

mono...@nwlink.com

unread,
Sep 3, 2008, 8:03:29 PM9/3/08
to
Hi,

FreeCLAS(http://www.ihear.com/FreeCLAS) is a new project to build a a
data base of high-quality speech data. "High quality" means annotated
data that have been validated by humans. Building such a data base has
been expensive because it requires substantial investment of people's
attention. As a result, high-quality speech data is not generally
available.

FreeCLAS uses a wiki. This is a call for people to join the wiki to
build it. Embedded in the wiki is a tool, shva, which opens from your
browser to let you hear, view and annotate any utterance in FreeCLAS.
At this point, there is a seed data base of a small collection of
utterances annotated in en-US and IPA.

shva and other related software downloadable from FreeCLAS are all
Free Software, licensed under GPL or other compatible licenses. The
speech data is under the Creative Commons attribute-share-alike
license.

ter

0 new messages