Irving Ruan
unread,Aug 22, 2010, 7:50:37 PM8/22/10Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to VocalKit
Hello VK community,
I have come across VocalKit while doing research for using speech
recognition libraries in an iPhone app I am building. It's a neat
wrapped for PocketSphinx, and should be cool to used.
While perusing through the archive on this group, I have realized that
the biggest issue that most people are facing is dealing with large
dictionary files and many words to recognize. Namely, the wsj has 5k
words, and has a high error rate because it has a lot of words.
However, changing it to only include words "YES NO GOODBYE" allows the
app to recognize it >95% of the time since it's only handling three
words. While this is really cool, I'm just wondering what sort of
solutions can we implement to overcome this obvious road bump? Maybe
I'm just confused by how PocketSphinx works internally, but why would
the default 5k language model and dictionary file have very little, if
not no, accuracy with speech-to-text recognition?
VocalKit is a great wrapper for PocketSphinx, and I foresee it
becoming widely used if a lot of development is done on it. Thanks in
advance!
-I.