Audio-Visual Speaker Localization Using Graphical Models
暂无分享,去创建一个
Jean Ponce | Fei-Fei Li | Thomas S. Huang | Akash Kushal | Mandar Rahurkar | Thomas S. Huang | Li Fei-Fei | J. Ponce | Akash Kushal | Mandar Rahurkar
[1] A FischlerMartin,et al. Random sample consensus , 1981 .
[2] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.
[3] Michael Isard,et al. Active Contours , 2000, Springer London.
[4] Vladimir Pavlovic,et al. Audio-visual speaker detection using dynamic Bayesian networks , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).
[5] Brendan J. Frey,et al. Transformed hidden Markov models: estimating mixture models of images and inferring spatial transformations in video sequences , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).
[6] Brendan J. Frey,et al. Fast, Large-Scale Transformation-Invariant Clustering , 2001, NIPS.
[7] Brendan J. Frey,et al. Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.
[8] Patrick Pérez,et al. Sequential Monte Carlo Fusion of Sound and Vision for Speaker Tracking , 2001, ICCV.
[9] Nebojsa Jojic,et al. A Graphical Model for Audiovisual Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..