论文信息 - Nonlinear Video Diffusion based on Audio-Video Synchrony - 字舞流文

Nonlinear Video Diffusion based on Audio-Video Synchrony

Pierre Vandergheynst | Anna Llagostera Casanovas | P. Vandergheynst | A. L. Casanovas

[1] Pierre Vandergheynst,et al. Blind Audiovisual Source Separation Based on Sparse Redundant Representations , 2010, IEEE Transactions on Multimedia.

[2] Pierre Vandergheynst,et al. Audio-based nonlinear video diffusion , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3] Pierre Vandergheynst,et al. Learning Multimodal Dictionaries , 2007, IEEE Transactions on Image Processing.

[4] C. Sigg,et al. Nonnegative CCA for Audiovisual Source Separation , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[5] Michael Elad,et al. Pixels that sound , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6] Tsuhan Chen,et al. Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition , 2005, IEEE Transactions on Multimedia.

[7] Patrick Pérez,et al. Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[8] Christian Jutten,et al. Developing an audio-visual speech source separation algorithm , 2004, Speech Commun..

[9] Richard M. Dansereau,et al. Co-channel audiovisual speech separation using spectral matching constraints , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10] Chalapathy Neti,et al. Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.

[11] Chalapathy Neti,et al. Noisy audio feature enhancement using audio-visual speech data , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[13] J L Schwartz,et al. Audio-visual enhancement of speech in noise. , 2001, The Journal of the Acoustical Society of America.

[14] J. Driver. Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading , 1996, Nature.

[15] P. Lions,et al. Image selective smoothing and edge detection by nonlinear diffusion. II , 1992 .

[16] Jitendra Malik,et al. Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[17] W. H. Sumby,et al. Visual contribution to speech intelligibility in noise , 1954 .

[18] L. Nirenberg. A strong maximum principle for parabolic equations , 1953 .

[19] Sebastian Lang,et al. Audiovisual Person Tracking with a Mobile Robot , 2004 .

[20] Sabri Gurbuz,et al. Moving-Talker, Speaker-Independent Feature Study, and Baseline Results Using the CUAVE Multimodal Speech Corpus , 2002, EURASIP J. Adv. Signal Process..

[21] Malcolm Slaney,et al. FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks , 2000, NIPS.

[22] Riccardo Leonardi,et al. Indexing audiovisual databases through joint audio and video processing , 1998, Int. J. Imaging Syst. Technol..