论文信息 - Combination of phone N-grams for a MPEG-7-based spoken document retrieval system

Combination of phone N-grams for a MPEG-7-based spoken document retrieval system

In this paper, we present a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. The audio part of MPEG-7 aims at standardizing the indexing of audio documents. It encloses a SpokenContent tool that provides a description framework of the semantic content of speech signals. In the context of MPEG-7, we propose an indexing and retrieval method that uses phonetic information only and a vector space IR model. Different strategies based on the use of phone N-gram indexing terms are experimented.

Thomas Sikora | Nicolas Moreau | Hyoung-Gook Kim

[1] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .

[2] Kenney Ng,et al. Subword-based approaches for spoken document retrieval , 2000, Speech Commun..

[3] Shih-Fu Chang,et al. Overview of the MPEG-7 standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[4] Martin Wechsler,et al. Spoken document retrieval based on phoneme recognition , 1998 .

[5] B. S. Manjunath,et al. Introduction to mpeg-7 , 2002 .

[6] Philip N. Garner,et al. SpokenContent representation in MPEG-7 , 2001, IEEE Trans. Circuits Syst. Video Technol..