Nonlinear Video Diffusion based on Audio-Video Synchrony

[1]  Pierre Vandergheynst,et al.  Blind Audiovisual Source Separation Based on Sparse Redundant Representations , 2010, IEEE Transactions on Multimedia.

[2]  Pierre Vandergheynst,et al.  Audio-based nonlinear video diffusion , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Pierre Vandergheynst,et al.  Learning Multimodal Dictionaries , 2007, IEEE Transactions on Image Processing.

[4]  C. Sigg,et al.  Nonnegative CCA for Audiovisual Source Separation , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[5]  Michael Elad,et al.  Pixels that sound , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Tsuhan Chen,et al.  Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition , 2005, IEEE Transactions on Multimedia.

[7]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[8]  Christian Jutten,et al.  Developing an audio-visual speech source separation algorithm , 2004, Speech Commun..

[9]  Richard M. Dansereau,et al.  Co-channel audiovisual speech separation using spectral matching constraints , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Chalapathy Neti,et al.  Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.

[11]  Chalapathy Neti,et al.  Noisy audio feature enhancement using audio-visual speech data , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[13]  J L Schwartz,et al.  Audio-visual enhancement of speech in noise. , 2001, The Journal of the Acoustical Society of America.

[14]  J. Driver Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading , 1996, Nature.

[15]  P. Lions,et al.  Image selective smoothing and edge detection by nonlinear diffusion. II , 1992 .

[16]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[18]  L. Nirenberg A strong maximum principle for parabolic equations , 1953 .

[19]  Sebastian Lang,et al.  Audiovisual Person Tracking with a Mobile Robot , 2004 .

[20]  Sabri Gurbuz,et al.  Moving-Talker, Speaker-Independent Feature Study, and Baseline Results Using the CUAVE Multimodal Speech Corpus , 2002, EURASIP J. Adv. Signal Process..

[21]  Malcolm Slaney,et al.  FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks , 2000, NIPS.

[22]  Riccardo Leonardi,et al.  Indexing audiovisual databases through joint audio and video processing , 1998, Int. J. Imaging Syst. Technol..