Multimodal Person Discovery in Broadcast TV at MediaEval 2016
暂无分享,去创建一个
[1] Václav Hlavác,et al. Detector of Facial Landmarks Learned by the Structured Output SVM , 2012, VISAPP.
[2] Jean-Marc Odobez,et al. EUMSSI: a Platform for Multimodal Analysis and Recommendation using UIMA , 2014, OIAF4HLT@COLING.
[3] Ngoc Thang Vu,et al. Speech recognition for machine translation in Quaero , 2011, IWSLT.
[4] Julie Mauclair,et al. Speaker Diarization: About whom the Speaker is Talking ? , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.
[5] Jean-Marc Odobez,et al. Comparison of two methods for unsupervised person identification in TV shows , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).
[6] Georges Quénot,et al. Towards a Better Integration of Written Names for Unsupervised Speakers Identification in Videos , 2013, SLAM@INTERSPEECH.
[7] Sylvain Meignier,et al. Automatic named identification of speakers using diarization and ASR systems , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] Olivier Galibert,et al. A presentation of the REPERE challenge , 2012, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI).
[9] Georges Quénot,et al. Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast , 2012, INTERSPEECH.
[10] Sophie Rosset,et al. Models Cascade for Tree-Structured Named Entity Detection , 2011, IJCNLP.
[11] Jun Yang,et al. Naming every individual in news video monologues , 2004, MULTIMEDIA '04.
[12] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Mickael Rouvier,et al. An open-source state-of-the-art toolbox for broadcast news diarization , 2013, INTERSPEECH.
[14] S. Chen,et al. Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .
[15] Georges Linarès,et al. Multimodal understanding for person recognition in video broadcasts , 2014, INTERSPEECH.
[16] Georges Quénot,et al. QCompere @ REPERE 2013 , 2013, SLAM@INTERSPEECH.
[17] Delphine Charlet,et al. Unsupervised face identification in TV content using audio-visual sources , 2013, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI).
[18] Hervé Bredin,et al. Integer linear programming for speaker diarization and cross-modal identification in TV broadcast , 2013, INTERSPEECH.
[19] Olivier Galibert,et al. The First Official REPERE Evaluation , 2013, SLAM@INTERSPEECH.
[20] Paul Deléglise,et al. CRIM and LIUM approaches for multi-genre broadcast media transcription , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[21] Georges Quénot,et al. From Text Detection in Videos to Person Identification , 2012, 2012 IEEE International Conference on Multimedia and Expo.
[22] Georges Quénot,et al. Unsupervised Speaker Identification in TV Broadcast Based on Written Names , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[23] Anindya Roy,et al. Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast , 2014, International Journal of Multimedia Information Retrieval.
[24] Georges Quénot,et al. Naming multi-modal clusters to identify persons in TV broadcast , 2015, Multimedia Tools and Applications.
[25] Sophie Rosset,et al. Person Instance Graphs for Named Speaker Identification in TV Broadcast , 2014, Odyssey.
[26] Delphine Charlet,et al. Scene understanding for identifying persons in TV shows: Beyond face authentication , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).
[27] Takeo Kanade,et al. Name-It: Naming and Detecting Faces in News Videos , 1999, IEEE Multim..
[28] Olivier Galibert,et al. The REPERE Corpus : a multimodal corpus for person recognition , 2012, LREC.
[29] Jean-Marc Odobez,et al. Video text recognition using sequential Monte Carlo and error voting methods , 2005, Pattern Recognit. Lett..
[30] Rong Yan,et al. Multiple instance learning for labeling faces in broadcasting news video , 2005, MULTIMEDIA '05.
[31] Georges Linarès,et al. PERCOLI: A Person Identification System for the 2013 REPERE Challenge , 2013, SLAM@INTERSPEECH.
[32] Paul Deléglise,et al. Extracting true speaker identities from transcriptions , 2007, INTERSPEECH.
[33] Michael Felsberg,et al. Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.
[34] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[35] Cordelia Schmid,et al. Face recognition from caption-based supervision , 2010 .
[36] Ricky Houghton. Named Faces: Putting Names to Faces , 1999, IEEE Intell. Syst..
[37] Jean-Luc Gauvain,et al. Speaker diarization from speech transcripts , 2004, INTERSPEECH.
[38] Sue Tranter. Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[39] L. Lamel,et al. A comparative study using manual and automatic transcriptions for diarization , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..