Discovering meaningful multimedia patterns with audio-visual concepts and associated text
暂无分享,去创建一个
Shih-Fu Chang | Ching-Yung Lin | Ajay Divakaran | Huifang Sun | Lexing Xie | Lyndon S. Kennedy | Shih-Fu Chang | L. Kennedy | Ching-Yung Lin | Ajay Divakaran | Lexing Xie | Huifang Sun
[1] Howard D. Wactlar,et al. Associating video frames with text , 2003 .
[2] Shih-Fu Chang,et al. Unsupervised Mining of Statistical Temporal Structures in Video , 2003 .
[3] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.
[4] James H. Martin,et al. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.
[5] Refractor. Vision , 2000, The Lancet.
[6] David Marr,et al. VISION A Computational Investigation into the Human Representation and Processing of Visual Information , 2009 .
[7] Christopher C. White,et al. Focus on Durability, PATH Research at the National Institute of Standards and Technology | NIST , 2001 .
[8] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.