Synthesized voices -- better than ever

455 views
Skip to first unread message

CourVO

unread,
Oct 5, 2009, 9:45:45 PM10/5/09
to Voice Actors
BELOW IS THE TEXT OF A RECENT BLOG I WROTE AT COURVO.BIZ. I'D LIKE TO
KNOW WHAT YOU THINK ABOUT IT:
---------------------------------
There’s been a lot of traffic recently on a forum populated by
AudioBook readers, bantering about issue of computer-generated voices.

That topic is traditionally disdained by a group so dedicated to the
finer nuances of a good read. These are serious audio-book listeners
who celebrate the various human narrators, and the interpretation each
one brings to a narrative.

But something new and improved has surfaced, and it’s making some
converts even among this hard-core group of those favoring the real
human voice.

So listen to the samples at:
http://www.loquendo.com/en/demos/demo_tts.htm
and then I’ll finish up below. It’s an international site, so you
have to scroll down to hear the US/English samples.

------------------------------

So….whadya think? I agree, it’s the best computer generated voice
I’ve ever heard.

Much of the give ‘n’ take on this forum moved into the realm of where
the artistry is in this sort of software solution…and how would the
audiobook publisher business model change. Beyond that, the
discussion also addressed who has rights, and what is the revenue
stream. Can a programmer replace a narrator? How labor-intensive and
artistry-intensive is that?

This hard-core group of audiobook aficionados, agrees this is the best
“fake” voice they’ve heard, but also agree it’s not there…yet.

Which, of course prompts the question: “When?”.

Text-to-Speech and voice recognition programs (eg. Dragon Naturally
Speaking) have always been reliant on complex formulas or algorithms
that incorporate the finer points of artificial intelligence. They’ve
steadily gotten better with each new jump in computer speed and
function.

It’s likely that we’re not far from a computer-generated voice
accomplished enough to satisfy a sector of buyers who aren’t as
discerning as the audiobook group mentioned above.

So now, I’ll state the question that has already likely bubbled-up in
your own mind: “Is this likely to hurt yet another sector of jobs/
clients now available to us as voice-actors?”

Your thoughts?

CourVO

DansBigVoice

unread,
Oct 5, 2009, 10:52:50 PM10/5/09
to Voice Actors
After listening to several of the samples I was shocked at how close
the voices were to the real thing. The artificial voices are as close
as I have ever heard. It also concerns me as a narrator. As the
artificial voices get closer and closer to human voices, will
publishers for the sake of expediency turn to this technology in place
of a human narrator? I do believe no matter how close a computer
generated voice gets one cannot replace artistry.

David Sigmon

unread,
Oct 6, 2009, 3:31:08 AM10/6/09
to Voice Actors, CourVO
While artificial intelligence and computer technology may eventually be able to mimic the human voice flawlessly, it will never be able to mimic the human imagination or insight that comes from living one's life and experiencing our varied environments and relationships. No software will ever be able to adequately convey the emotional connection acquired when holding a crying newborn peeing all over your shirtfront, or the emotional letdown earned when failing to revive a drunken auto accident victim using CPR.  How can a computer be taught what it tastes like to have the victim's frothy vomit spew into your mouth as you try to breath life back into his failing lungs?

I'll bet a case of Wild Turkey that audiobook customers disappointment in the artificial voice will cause them to seek out a human narrated recording the next time they spend money to hear a story.  Anything with fiction, drama, or emotion, is probably safe.

All that said, non-fiction or text/reference books, as well as automated style text, may end up relying on computer voices. 

David Sigmon

Audiobook Narrator & Voice Actor
Da...@DavidSigmon.com
http://www.voiceoveruniverse.com/profile/DavidSigmon
757-339-8727
798 Argonne Avenue NE
Atlanta, GA 30308

--- On Mon, 10/5/09, CourVO <cou...@gmail.com> wrote:

Peter Drew

unread,
Oct 6, 2009, 11:00:50 AM10/6/09
to Voice Actors
Audiobook producers will certainly hold out longer against using
synthetic voices. The more immediate concern is the industrial/
corporation market and retail marketing on the Web via video. Why pay
someone to read an already dull script to accompany a human resources
video on the latest changes to the company's benefits package? With
hi-def hand-held cameras and desktop video production, many small
retailers can make their own videos or contract a local agency to
crank out a video for little money, saving even more by using a "voice
in a box."

It's not a matter of if but when. Technology marches on and the human
voice will be synthesized to a relatively high degree of realism and
natural character. Will a synthetic voice ever match the artistry and
subtlety of a well-trained and experienced actor? We'll just have to
wait and see...

I wrote about this a few years ago:

http://www.peterdrewvo.com/html/the_future_of_voiceovers.html

Cheers,

Peter Drew

Griz326

unread,
Oct 6, 2009, 2:27:14 PM10/6/09
to Voice Actors
>>> It's not a matter of if but when

Very true, Peter. In fact, I've put in a query for those voices
already. Depending upon how much computer directing is required to
make those synthetic voices work, it may be faster/easier/
moregooderbedder for me to use synthetic voices than to do narrations
myself. I'm looking forward to finding out.

There'll still be a market for live V/O, but synthetic voices will be
a real threat to lesser talent very soon. And I have a theory about
which markets synthetic voices will hit hard. IMO, short form work
will get it first. My reasoning for this is that the human ear/brain
will be annoyed by a computer voice in long form. My only "proof" of
this is in testing mic preamps. In :30 second A:B comparisons, the
differences between preamps were nearly impossible to hear; in long
form the differences in preamps became dramatic. My theory is that the
same will hold true for synthetic voices. Commercial tags, :10 VOs and
other vocal bits (...dare I say imaging which is pretty synthetic
already...) will be food for bottom feeders because there won't be
enough synthetic voice to offend the ears of most people.

Bobbin Beam

unread,
Oct 6, 2009, 3:47:21 PM10/6/09
to David Sigmon, Voice Actors, CourVO

I can’t believe that someday someone would pay somebody to take the time to try and articulate just the right nuance of read that can out of one person, which captures heart, brain and vocal instrument for every given piece of copy, and marketplace trend. Hell they can hire an actual voice actor for that, and most probably invest a lot less in the long run!

 

All The Best,

Bobbin Beam, Voice Actress 760-519-5072

Available via Source Connect, ISDN & phone patch

www.bobbinbeam.com

http://blog.bobbinbeam.com

 

Listen to or view my latest projects


</table


 

Peter Drew

unread,
Oct 6, 2009, 3:58:30 PM10/6/09
to Voice Actors
Very interesting info about pre-amps, Steve. I'm aware of the ear
fatigue factor
that too much compression can induce when listening to something for
over
about 8 minutes. Hadn't thought about how profound the effect simple
amplification and
the coloring from a pre-amp might be. I know all pre-amps have their
own "sound," even
the ones that are considered uncolored. Same goes for coloring in
different mics.

What you say about very short VO and synthesized voices does make
sense. It'll
suck for the VO community, for sure. We'll just have to hang onto our
mics and see
what happens.

Cheers,

Peter Drew
Reply all
Reply to author
Forward
0 new messages