Nonlinear Video Diffusion based on Audio-Video Synchrony
暂无分享,去创建一个
[1] Pierre Vandergheynst,et al. Blind Audiovisual Source Separation Based on Sparse Redundant Representations , 2010, IEEE Transactions on Multimedia.
[2] Pierre Vandergheynst,et al. Audio-based nonlinear video diffusion , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[3] Pierre Vandergheynst,et al. Learning Multimodal Dictionaries , 2007, IEEE Transactions on Image Processing.
[4] C. Sigg,et al. Nonnegative CCA for Audiovisual Source Separation , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.
[5] Michael Elad,et al. Pixels that sound , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[6] Tsuhan Chen,et al. Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition , 2005, IEEE Transactions on Multimedia.
[7] Patrick Pérez,et al. Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.
[8] Christian Jutten,et al. Developing an audio-visual speech source separation algorithm , 2004, Speech Commun..
[9] Richard M. Dansereau,et al. Co-channel audiovisual speech separation using spectral matching constraints , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[10] Chalapathy Neti,et al. Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.
[11] Chalapathy Neti,et al. Noisy audio feature enhancement using audio-visual speech data , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[12] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[13] J L Schwartz,et al. Audio-visual enhancement of speech in noise. , 2001, The Journal of the Acoustical Society of America.
[14] J. Driver. Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading , 1996, Nature.
[15] P. Lions,et al. Image selective smoothing and edge detection by nonlinear diffusion. II , 1992 .
[16] Jitendra Malik,et al. Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..
[17] W. H. Sumby,et al. Visual contribution to speech intelligibility in noise , 1954 .
[18] L. Nirenberg. A strong maximum principle for parabolic equations , 1953 .
[19] Sebastian Lang,et al. Audiovisual Person Tracking with a Mobile Robot , 2004 .
[20] Sabri Gurbuz,et al. Moving-Talker, Speaker-Independent Feature Study, and Baseline Results Using the CUAVE Multimodal Speech Corpus , 2002, EURASIP J. Adv. Signal Process..
[21] Malcolm Slaney,et al. FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks , 2000, NIPS.
[22] Riccardo Leonardi,et al. Indexing audiovisual databases through joint audio and video processing , 1998, Int. J. Imaging Syst. Technol..