Hi all, a friendly reminder that we will be hosting a talk by Tara Sainath from Google Research tomorrow at 4pm in CEPSR 750/Costa Commons. An abstract follows. Please forward to anyone you think would be interested. See you tomorrow!
"Towards End-To-End Speech Recognition"Tara Sainath, Google Research4pm, September 16thCosta Commons/CEPSR 750
Abstract:
In this talk, I will discuss various efforts in our group at Google towards replacing various parts of the acoustic modeling pipeline with neural networks. First, I will describe a new modeling approach known as Convolutional, Long Short-Term Memory, Deep Neural Networks (CLDNNs), and why this architecture makes sense for speech tasks. Next, I will talk about using CLDNNs for raw-waveform modeling, allowing us to remove front-end log-mel filterbank feature computation. Finally, I will discuss CTC, which allows us to remove the need for a prior alignment and CD states.