Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues
暂无分享,去创建一个
John R. Smith | Ching-Yung Lin | Harriet J. Nock | Giridharan Iyengar | Chalapathy Neti | Milind R. Naphade | W. H. Adams | M. Naphade | Ching-Yung Lin | John R. Smith | H. Nock | C. Neti | G. Iyengar | W. H. Adams
[1] M. Ibrahim Sezan,et al. A computational approach to semantic event detection , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).
[2] Yoni Bauduin,et al. Audio-Visual Speech Recognition , 2004 .
[3] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[4] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[5] M. Casey. Reduced-Rank Spectra and Minimum-Entropy Priors as Consistent and Reliable Cues for Generalized Sound Recognition , 2001 .
[6] Nuno Vasconcelos,et al. Bayesian modeling of video editing and structure: semantic features for video summarization and browsing , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).
[7] Malcolm Slaney,et al. Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[8] Stéphane H. Maes,et al. Transcription of broadcast news-system robustness issues and adaptation techniques , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[9] A. Murat Tekalp,et al. Probabilistic Analysis and Extraction of Video Content , 1999, ICIP.
[10] Vladimir Vapnik,et al. The Nature of Statistical Learning , 1995 .
[11] Giridharan Iyengar,et al. Models for automatic classification of video sequences , 1997, Electronic Imaging.
[12] Kevin Murphy,et al. Bayes net toolbox for Matlab , 1999 .
[13] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.
[14] Daniel Patrick Whittlesey Ellis,et al. Prediction-driven computational auditory scene analysis , 1996 .
[15] Brendan J. Frey,et al. Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).
[16] Salim Roukos,et al. TREC-6 Ad-Hoc Retrieval , 1997, TREC.
[17] C.-C. Jay Kuo,et al. Integrated approach to multimodal media content analysis , 1999, Electronic Imaging.
[18] Marti A. Hearst. Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..
[19] Shih-Fu Chang,et al. VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.
[20] Marc Davis,et al. Media Streams: an iconic visual language for video annotation , 1993, Proceedings 1993 IEEE Symposium on Visual Languages.
[21] David A. Forsyth,et al. Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[22] Robert B. McGhee,et al. Aircraft Identification by Moment Invariants , 1977, IEEE Transactions on Computers.
[23] Giridharan Iyengar,et al. Speaker change detection using joint audio-visual statistics , 2000, RIAO.
[24] Anil K. Jain,et al. Shape-Based Retrieval: A Case Study With Trademark Image Databases , 1998, Pattern Recognit..
[25] John R. Smith,et al. Integrating Features, Models, and Semantics for TREC Video Retrieval , 2001, TREC.
[26] Wayne H. Wolf,et al. Hidden Markov model parsing of video programs , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[27] Svetha Venkatesh,et al. Towards automatic extraction of expressive elements from motion pictures: tempo , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).
[28] Brian Kingsbury,et al. Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[29] Christopher M. Bishop,et al. Neural networks for pattern recognition , 1995 .
[30] Svetha Venkatesh,et al. Toward automatic extraction of expressive elements from motion pictures: tempo , 2002, IEEE Trans. Multim..
[31] Wei Xiong,et al. Query by video clip , 1999, Multimedia Systems.
[32] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.
[33] Stephen E. Robertson,et al. GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .
[34] Michael A. Casey. Reduced-Rank Spectra and Minimum Entropy Priors for Generalized Sound Recognition , 2001 .
[35] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.