FreeCLAS(http://www.ihear.com/FreeCLAS) is a new project to build a a
data base of high-quality speech data. "High quality" means annotated
data that have been validated by humans. Building such a data base has
been expensive because it requires substantial investment of people's
attention. As a result, high-quality speech data is not generally
available.
FreeCLAS uses a wiki. This is a call for people to join the wiki to
build it. Embedded in the wiki is a tool, shva, which opens from your
browser to let you hear, view and annotate any utterance in FreeCLAS.
At this point, there is a seed data base of a small collection of
utterances annotated in en-US and IPA.
shva and other related software downloadable from FreeCLAS are all
Free Software, licensed under GPL or other compatible licenses. The
speech data is under the Creative Commons attribute-share-alike
license.
ter