It's not just what you say, but also how you say it : exploring the auditory and visual properties of speech prosody
暂无分享,去创建一个
[1] W. H. Sumby,et al. Visual contribution to speech intelligibility in noise , 1954 .
[2] P. Lieberman. Some Acoustic Correlates of Word Stress in American English , 1959 .
[3] Shinji Maeda,et al. A characterization of American English intonation , 1976 .
[4] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.
[5] B. Walden,et al. Effects of training on the visual recognition of consonants. , 1977, Journal of speech and hearing research.
[6] D. Massaro,et al. Integration of featural information in speech perception. , 1978, Psychological review.
[7] John J. Ohala,et al. Production of Tone , 1978 .
[8] D. Ladd. The structure of intonational meaning , 1978 .
[9] Q Summerfield,et al. Use of Visual Information for Phonetic Perception , 1979, Phonetica.
[10] J. D. Pijper. Modelling British English Intonation: An Analysis by Resynthesis of British English Intonation , 1983 .
[11] M. Walker,et al. The expressive function of the eye flash , 1983 .
[12] S. McKee,et al. The detection of motion in the peripheral visual field , 1984, Vision Research.
[13] W. V. Summers. Effects of stress and final-consonant voicing on vowel production: articulatory and acoustic analyses. , 1987, The Journal of the Acoustical Society of America.
[14] R. Schulman,et al. Articulatory dynamics of loud and normal speech. , 1989, The Journal of the Acoustical Society of America.
[15] B. Repp,et al. Stimulus order effects in vowel discrimination. , 1990, The Journal of the Acoustical Society of America.
[16] Björn Lindblom,et al. Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .
[17] Hubert Partl. German TEX , 1990 .
[18] R. Krauss,et al. Do conversational hand gestures communicate? , 1991, Journal of personality and social psychology.
[19] Q. Summerfield,et al. Lipreading and audio-visual speech perception. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.
[20] L. Shriberg. Four new speech and prosody-voice measures for genetics research and other studies in developmental phonological disorders. , 1993, Journal of speech and hearing research.
[21] J. Streeck. Gesture as communication I: Its coordination with gaze and speech , 1993 .
[22] R. Krauss,et al. The Communicative Value of Conversational Hand Gesture , 1995 .
[23] R. Krauss,et al. Nonverbal Behavior and Nonverbal Communication: What do Conversational Hand Gestures Tell Us? , 1996 .
[24] B. Lindblom,et al. Role of articulation in speech perception: clues from production. , 1996, The Journal of the Acoustical Society of America.
[25] V. Gracco,et al. Functional data analyses of lip motion. , 1996, The Journal of the Acoustical Society of America.
[26] Elisabeth Selkirk,et al. Sentence Prosody: Intonation, Stress and Phrasing , 1996 .
[27] D. Ladd,et al. The perception of intonational emphasis: continuous or categorical? , 1997 .
[28] E. Vatikiotis-Bateson,et al. Eye movement of perceivers during audiovisualspeech perception , 1998, Perception & psychophysics.
[29] R. Krauss. Why Do We Gesture When We Speak? , 1998 .
[30] Evelyn McClave,et al. Pitch and Manual Gestures , 1998 .
[31] S Oviatt,et al. Modeling global and focal hyperarticulation during human-computer error resolution. , 1998, The Journal of the Acoustical Society of America.
[32] Hani Yehia,et al. Quantitative association of vocal-tract and facial behavior , 1998, Speech Commun..
[33] G. McConkie,et al. Attention to facial regions in segmental and prosodic visual speech perception tasks. , 1999, Journal of speech, language, and hearing research : JSLHR.
[34] D. Ladd,et al. Constant "segmental anchoring" of F0 movements under changes in speech rate. , 1999, The Journal of the Acoustical Society of America.
[35] E. Vatikiotis-Bateson,et al. Estimation and generalization of multimodal speech production , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).
[36] B. Wells,et al. Prosodic Variation in Southern British English , 2000, Language and speech.
[37] M. Pell. Influence of emotion and focus location on prosody in matched statements and questions. , 2001, The Journal of the Acoustical Society of America.
[38] Emiel Krahmer,et al. On the alleged existence of contrastive accents , 2001, Speech Commun..
[39] Marc Schröder,et al. The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching , 2003, Int. J. Speech Technol..
[40] M. Pell. Evaluation of Nonverbal Emotion in Face and Voice: Some Preliminary Findings on a New Battery of Tests , 2002, Brain and Cognition.
[41] Takaaki Kuratate,et al. Linking facial animation, head motion and speech acoustics , 2002, J. Phonetics.
[42] Takaaki Kuratate,et al. Video-based face motion measurement , 2002, J. Phonetics.
[43] C. W. Wightman. ToBI Or Not ToBI ? , 2002 .
[44] D. Massaro. Multimodal Speech Perception: A Paradigm for Speech Science , 2002 .
[45] J. Masefield,et al. "Sagging transitions" between high pitch accents in English: experimental evidence , 2003, J. Phonetics.
[46] K. Munhall,et al. Gaze behavior in audiovisual speech perception: The influence of ocular fixations on the McGurk effect , 2003, Perception & psychophysics.
[47] D. Massaro,et al. Perceiving Prosody from the Face and Voice: Distinguishing Statements from Echoic Questions in English , 2003, Language and speech.
[48] J. Schwartz,et al. Seeing to hear better: evidence for early audio-visual interactions in speech identification , 2004, Cognition.
[49] M. Swerts,et al. MORE ABOUT BROWS A Cross-Linguistic Study via Analysis-by-Synthesis , 2004 .
[50] Sharon M. Thomas,et al. Contributions of oral and extraoral facial movement to visual and audiovisual speech perception. , 2004, Journal of experimental psychology. Human perception and performance.
[51] Chiu-yu Tseng,et al. Fluent speech prosody: Framework and modeling , 2005, Speech Commun..
[52] Kevin G Munhall,et al. Empirical modeling of human face kinematics during speech using motion clustering. , 2005, The Journal of the Acoustical Society of America.
[53] Shinji Maeda,et al. Face models based on a guided PCA of motion-capture data: Speaker dependent variability in /s/-/R/ contrast production , 2005 .
[54] Yi Xu,et al. Phonetic realization of focus in English declarative intonation , 2005, J. Phonetics.
[55] Synnöve Carlson,et al. Conveyance of emotional connotations by a single word in English , 2005, Speech Commun..
[56] R A States,et al. Precision and repeatability of the Optotrak 3020 motion measurement system , 2006, Journal of medical engineering & technology.
[57] John C. Wells,et al. English Intonation : An Introduction , 2006 .
[58] D. Poeppel,et al. Temporal window of integration in auditory-visual speech perception , 2007, Neuropsychologia.
[59] M. Swerts,et al. The Effects of Visual Beats on Prosodic Prominence: Acoustic Analyses, Auditory Perception and Visual Perception. , 2007 .
[60] L. Maletsky,et al. Accuracy of an optical active-marker system to track the relative motion of rigid bodies. , 2007, Journal of biomechanics.
[61] Emiel Krahmer,et al. Facial expression and prosodic prominence: Effects of modality and facial area , 2008, J. Phonetics.
[62] Noah Silbert,et al. Focus, prosodic context, and phonological feature specification: patterns of variation in fricative production. , 2008, The Journal of the Acoustical Society of America.
[63] J. Vaissière. Perception of Intonation , 2008 .
[64] Mark G. Grotefend. The Perception Is... , 2009 .
[65] P. Keating,et al. Optical Phonetics and Visual Perception of Lexical and Phrasal Stress in English , 2009, Language and speech.
[66] M. Dohen,et al. Pointing is 'special' , 2009 .
[67] Paul L. Rosin,et al. Quantitative analysis of facial movement - A review of three-dimensional imaging techniques , 2009, Comput. Medical Imaging Graph..
[68] Devin R. Berg,et al. Precision, repeatability and accuracy of Optotrak® optical motion tracking systems , 2009 .
[69] Jeffrey B. Nyquist,et al. Spatial and temporal limits of motion perception across variations in speed, eccentricity, and low vision. , 2009, Journal of vision.
[70] Duane G. Watson. The Many Roads to Prominence , 2010 .
[71] Maureen Stone. Laboratory Techniques for Investigating Speech Articulation , 2010 .
[72] Duane G. Watson,et al. Experimental and theoretical advances in prosody: A review , 2010, Language and cognitive processes.
[73] Paul L. Rosin,et al. A comparison of the reproducibility of verbal and nonverbal facial gestures using three-dimensional motion analysis , 2010, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.
[74] Emiel Krahmer,et al. Visual prosody of newsreaders: Effects of information structure, emotional content and intended audience on facial expressions , 2010, J. Phonetics.
[75] B. McMurray,et al. What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations. , 2011, Psychological review.
[76] Yi Xu. SPEECH PROSODY : A METHODOLOGICAL REVIEW , 2011 .
[77] Cheyenne Munson,et al. Features as an emergent product of computing perceptual cues relative to expectations , 2011 .