Content-Based Retrieval of Audio in News Broadcasts

This paper describes a complete, scalable and extensible content-based retrieval system for news broadcasts. Depending on segmentation results of the selected audio data, our system allows users to query audio data semantically by using both domain based fuzzy classes (anchor, commercial, reporter, sports, transition, weatherforecast, and venuesound) and similarity search. Two kinds of experiments were conducted on audio tracks of TRECVID news broadcasts to evaluate performance of the proposed query-by-example technique. The results obtained from our experiments demonstrate that Audio Spectrum Flatness feature in MPEG-7 standard performs better in music audio samples compared to other kinds of audio samples and the system is robust under different conditions.

[1]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[2]  Chunru Wan,et al.  Content-based audio retrieval with relevance feedback , 2006, Pattern Recognit. Lett..

[3]  T. Virtanen,et al.  Probabilistic Model Based Similarity Measures for Audio Query-by-Example , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[4]  Takeo Kanade,et al.  Semantic analysis for video contents extraction—spotting by association in news video , 1997, MULTIMEDIA '97.

[5]  M. Sert,et al.  Web-based query engine for content-based and semantic retrieval of audio , 2004, IEEE International Symposium on Consumer Electronics, 2004.

[6]  Adnan Yazici,et al.  Content-Based Classification and Segmentation of Mixed-Type Audio by Using MPEG-7 Features , 2009, 2009 First International Conference on Advances in Multimedia.

[7]  Alberto Del Bimbo,et al.  Content-based indexing and retrieval of TV news , 2001, Pattern Recognit. Lett..

[8]  Zhong Ming,et al.  SVM-Based Video Scene Classification and Segmentation , 2008, 2008 International Conference on Multimedia and Ubiquitous Engineering (mue 2008).

[9]  Xueqing Li,et al.  A fuzzy logic based speech extraction approach for e-Learning content production , 2008, 2008 International Conference on Audio, Language and Image Processing.

[10]  Nicolás Ruiz-Reyes,et al.  SPEECH/MUSIC DISCRIMINATION BASED ON WARPING TRANSFORMATION AND FUZZY LOGIC FOR INTELLIGENT AUDIO CODING , 2009, Appl. Artif. Intell..

[11]  Adnan Yazici,et al.  Structural and Semantic Modeling of Audio for Content-Based Querying and Browsing , 2006, FQAS.

[12]  Christian Spevak,et al.  SOUNDSPOTTER – A PROTOTYPE SYSTEM FOR CONTENT-BASED AUDIO RETRIEVAL , 2002 .

[13]  Samy Bengio,et al.  Large-scale content-based audio retrieval from text queries , 2008, MIR '08.

[14]  Miki Haseyama,et al.  Audio-Based Shot Classification for Audiovisual Indexing Using PCA, MGD and Fuzzy Algorithm , 2007, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[15]  Chunru Wan,et al.  Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines , 2002, Soft Comput..

[16]  Kang Li,et al.  Semantics-Based Video Indexing using a Stochastic Modeling Approach , 2007, 2007 IEEE International Conference on Image Processing.