voice recognition homophones

733 views
Skip to first unread message

Mark Spahn

unread,
Feb 24, 2012, 10:37:07 AM2/24/12
to hon...@googlegroups.com
Anybody out their who uses voice recognition software?
With voice recognition, they're is always the problem
of homophones: words where there pronunciation is
the same but there spellings are different.

One solution is to proofread the text that is produced
from you vocal rendition. But to minimize or possibly
eliminate the need for a second pass through the text
produced from your speech, another solution would be
to modify your speech so that homophones never occur.

Thus, whenever you want to write "there", you say /grap/,
when you want to write "their", you say /thwin/, and
when you want to write "they're", you say /boorie/.
Here, "..." represents a word, and /.../ represents a
sound (pronunciation).
The idea is to assign each homophone-word a unique
pronunciation that is not used by any other word.
There are plenty of still-unused pronunciations to pick from,
as illustrated above.
In a one-time training session, you tell your VR software
that every time it hears /grap/, it should spell this word as
"there", and similarly /thwin/="their", /boorie/="they're".

So each VR writer develops his own set of substitute
pronunciations for homophone-words, and speaks in
a spelling-unambiguous idiolect. But what if these homophone
substitute pronunciations become standardized, so that
all VR writers use the same set of VR pronunciations?
These VR writers would constitute a subculture that
could conduct conversations in VR-speak, in which
each participant knows that /grap/ is the word "there", etc.
(Short-story writers, feel free to make of this idea
what you will.)

Question: Has anyone who uses VR already adopted
this brilliant idea? If you write by VR, how, in fact,
*do* you distinguish among there/their/they're ?

Side-question: East Asians (Chinese, Japanese, Koreans,
whose languages do not make the L/R distinction)
sometimes are unable to hear the difference between, say,
"lice" and "rice". But VR software *is* able to hear
the difference. How? Might VR be a pedagogical help
in getting East Asians to hear the L/R distinction, or
in getting gweilo to hear the different tones of Chinese?
-- Mark Spahn (West Seneca, NY)

Ray Roman

unread,
Feb 24, 2012, 10:52:49 AM2/24/12
to hon...@googlegroups.com
"The said they're going to visit their parents over there."

I just dictated this with Dragon Naturally Speaking without using any such tools.

On Fri, Feb 24, 2012 at 7:37 AM, Mark Spahn <mark...@verizon.net> wrote:
Anybody out their who uses voice recognition software?  With voice recognition, they're is always the problem of homophones:  words where there pronunciation is the same but there spellings are different.

One solution is to proofread the text that is produced from you vocal rendition. 

-- Mark Spahn  (West Seneca, NY)



--
Ray Roman 
Japanese to English legal translation
japane...@gmail.com

Alan Siegrist

unread,
Feb 24, 2012, 11:21:42 AM2/24/12
to hon...@googlegroups.com
Ray Roman writes:

> "The said they're going to visit their parents over there."
>
> I just dictated this with Dragon Naturally Speaking without
> using any such tools.

Hi Ray,

This is interesting. Was DNS responsible for making the mistake of "the"
instead of "they" as the first word in the sentence?

This seems to confirm my experience with DNS where it seems to make mistakes
not where you would expect, namely homonyms which it actually disambiguates
well, but rather in more subtle places.

Regards,

Alan Siegrist
Carmel, CA, USA

Ray Roman

unread,
Feb 24, 2012, 11:32:07 AM2/24/12
to hon...@googlegroups.com
Yes. DNS has trouble with "the" in my experience so far.


On Fri, Feb 24, 2012 at 8:21 AM, Alan Siegrist <AlanFS...@comcast.net> wrote:

This is interesting. Was DNS responsible for making the mistake of "the"
instead of "they" as the first word in the sentence?


Mark Spahn

unread,
Feb 24, 2012, 12:58:36 PM2/24/12
to hon...@googlegroups.com
The said they're going to visit their parents over there."
I just dictated this with Dragon Naturally Speaking without using any such tools.
Ray Roman 
==UNQUOTE==
 
Interesting.  So VR software actually has A.I.?
Can that be true?  "Oui, c'est VRAI."
I have heard of an instance where "saw it" was
misheard by DNS(?) as "sought".
Would this mishearing disappear for a DNS user
who adopted a New England accent, so that the
rhotacized sound /sawritt/ would be correctly
interpreted as "saw it"?
Reply all
Reply to author
Forward
0 new messages