NETWORK TRAINING FOR CONTINUOUS SPEECH RECOGNITION (WedAmOR2)

Author(s) :

Issac Alphonso	(Institute for Signal and Information Processing, USA)
Joseph Picone	(Institute for Signal and Information Processing, USA)

Abstract :

The standard training approach for a hidden Markov model (HMM) based speech recognition system uses an expectation maximization (EM) based supervised training framework to estimate parameters. EM-based parameter estimation for speech recognition is performed using several complicated stages of iterative reestimation. These stages are heuristic in nature and prone to human error. This paper describes a new training recipe that reduces the complexity of the training process, while retaining the robustness of the EM-based supervised training framework. This paper show that the network training recipe can achieve comparable recognition performance to a traditional trainer while alleviating the need for complicated systems and training recipes for spoken language processing systems.

Menu