COMBINATION OF PHONE N-GRAMS FOR A MPEG-7-BASED SPOKEN DOCUMENT RETRIEVAL SYSTEM (WedAmOR2)
Author(s) :
Nicolas Moreau (Technical University of Berlin, Germany)
Hyoung-Gook Kim (Technical University of Berlin, Germany)
Thomas Sikora (Technical University of Berlin, Germany)
Abstract : In this paper, we present a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. The audio part of MPEG-7 aims at standardizing the indexing of audio documents. It encloses a SpokenContent tool that provides a description framework of the semantic content of speech signals. In the context of MPEG-7, we propose an indexing and retrieval method that uses phonetic information only and a vector space IR model. Different strategies based on the use of phone N-gram indexing terms are experimented.

Menu