EUSIPCO'2002 banner

Paper data
Time sequence information within a Gaussian mixture model

Stapert Robert, ACULAB
Mason John, University of Wales Swansea

Page numbers in the proceedings:
Volume III pp 595-598

Language and Speech Recognition

Paper abstract
This paper addresses the task of text independent speaker recognition and in particular looks at capturing time sequence information within the modelling process itself. A recent extension to the popular Gaussian mixture model (GMM) is the segmental mixture model (SMM), and its advantages are thought to be more pronounced as more and more training data becomes available. Here this idea is examined along with a hypothesis on model size, model complexity and their dependencies on the quantity of available training data. Experimental results on a 2000 speaker database show that an SMM does offer better recognition results than a GMM once a threshold in the amount of training data has been reached.

A PDF version is available here