It's not just what you say, but also how you say it : exploring the auditory and visual properties of speech prosody

........................................................................................................ xxxi Chapter

[1]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[2]  P. Lieberman Some Acoustic Correlates of Word Stress in American English , 1959 .

[3]  Shinji Maeda,et al.  A characterization of American English intonation , 1976 .

[4]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[5]  B. Walden,et al.  Effects of training on the visual recognition of consonants. , 1977, Journal of speech and hearing research.

[6]  D. Massaro,et al.  Integration of featural information in speech perception. , 1978, Psychological review.

[7]  John J. Ohala,et al.  Production of Tone , 1978 .

[8]  D. Ladd The structure of intonational meaning , 1978 .

[9]  Q Summerfield,et al.  Use of Visual Information for Phonetic Perception , 1979, Phonetica.

[10]  J. D. Pijper Modelling British English Intonation: An Analysis by Resynthesis of British English Intonation , 1983 .

[11]  M. Walker,et al.  The expressive function of the eye flash , 1983 .

[12]  S. McKee,et al.  The detection of motion in the peripheral visual field , 1984, Vision Research.

[13]  W. V. Summers Effects of stress and final-consonant voicing on vowel production: articulatory and acoustic analyses. , 1987, The Journal of the Acoustical Society of America.

[14]  R. Schulman,et al.  Articulatory dynamics of loud and normal speech. , 1989, The Journal of the Acoustical Society of America.

[15]  B. Repp,et al.  Stimulus order effects in vowel discrimination. , 1990, The Journal of the Acoustical Society of America.

[16]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[17]  Hubert Partl German TEX , 1990 .

[18]  R. Krauss,et al.  Do conversational hand gestures communicate? , 1991, Journal of personality and social psychology.

[19]  Q. Summerfield,et al.  Lipreading and audio-visual speech perception. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[20]  L. Shriberg Four new speech and prosody-voice measures for genetics research and other studies in developmental phonological disorders. , 1993, Journal of speech and hearing research.

[21]  J. Streeck Gesture as communication I: Its coordination with gaze and speech , 1993 .

[22]  R. Krauss,et al.  The Communicative Value of Conversational Hand Gesture , 1995 .

[23]  R. Krauss,et al.  Nonverbal Behavior and Nonverbal Communication: What do Conversational Hand Gestures Tell Us? , 1996 .

[24]  B. Lindblom,et al.  Role of articulation in speech perception: clues from production. , 1996, The Journal of the Acoustical Society of America.

[25]  V. Gracco,et al.  Functional data analyses of lip motion. , 1996, The Journal of the Acoustical Society of America.

[26]  Elisabeth Selkirk,et al.  Sentence Prosody: Intonation, Stress and Phrasing , 1996 .

[27]  D. Ladd,et al.  The perception of intonational emphasis: continuous or categorical? , 1997 .

[28]  E. Vatikiotis-Bateson,et al.  Eye movement of perceivers during audiovisualspeech perception , 1998, Perception & psychophysics.

[29]  R. Krauss Why Do We Gesture When We Speak? , 1998 .

[30]  Evelyn McClave,et al.  Pitch and Manual Gestures , 1998 .

[31]  S Oviatt,et al.  Modeling global and focal hyperarticulation during human-computer error resolution. , 1998, The Journal of the Acoustical Society of America.

[32]  Hani Yehia,et al.  Quantitative association of vocal-tract and facial behavior , 1998, Speech Commun..

[33]  G. McConkie,et al.  Attention to facial regions in segmental and prosodic visual speech perception tasks. , 1999, Journal of speech, language, and hearing research : JSLHR.

[34]  D. Ladd,et al.  Constant "segmental anchoring" of F0 movements under changes in speech rate. , 1999, The Journal of the Acoustical Society of America.

[35]  E. Vatikiotis-Bateson,et al.  Estimation and generalization of multimodal speech production , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[36]  B. Wells,et al.  Prosodic Variation in Southern British English , 2000, Language and speech.

[37]  M. Pell Influence of emotion and focus location on prosody in matched statements and questions. , 2001, The Journal of the Acoustical Society of America.

[38]  Emiel Krahmer,et al.  On the alleged existence of contrastive accents , 2001, Speech Commun..

[39]  Marc Schröder,et al.  The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching , 2003, Int. J. Speech Technol..

[40]  M. Pell Evaluation of Nonverbal Emotion in Face and Voice: Some Preliminary Findings on a New Battery of Tests , 2002, Brain and Cognition.

[41]  Takaaki Kuratate,et al.  Linking facial animation, head motion and speech acoustics , 2002, J. Phonetics.

[42]  Takaaki Kuratate,et al.  Video-based face motion measurement , 2002, J. Phonetics.

[43]  C. W. Wightman ToBI Or Not ToBI ? , 2002 .

[44]  D. Massaro Multimodal Speech Perception: A Paradigm for Speech Science , 2002 .

[45]  J. Masefield,et al.  "Sagging transitions" between high pitch accents in English: experimental evidence , 2003, J. Phonetics.

[46]  K. Munhall,et al.  Gaze behavior in audiovisual speech perception: The influence of ocular fixations on the McGurk effect , 2003, Perception & psychophysics.

[47]  D. Massaro,et al.  Perceiving Prosody from the Face and Voice: Distinguishing Statements from Echoic Questions in English , 2003, Language and speech.

[48]  J. Schwartz,et al.  Seeing to hear better: evidence for early audio-visual interactions in speech identification , 2004, Cognition.

[49]  M. Swerts,et al.  MORE ABOUT BROWS A Cross-Linguistic Study via Analysis-by-Synthesis , 2004 .

[50]  Sharon M. Thomas,et al.  Contributions of oral and extraoral facial movement to visual and audiovisual speech perception. , 2004, Journal of experimental psychology. Human perception and performance.

[51]  Chiu-yu Tseng,et al.  Fluent speech prosody: Framework and modeling , 2005, Speech Commun..

[52]  Kevin G Munhall,et al.  Empirical modeling of human face kinematics during speech using motion clustering. , 2005, The Journal of the Acoustical Society of America.

[53]  Shinji Maeda,et al.  Face models based on a guided PCA of motion-capture data: Speaker dependent variability in /s/-/R/ contrast production , 2005 .

[54]  Yi Xu,et al.  Phonetic realization of focus in English declarative intonation , 2005, J. Phonetics.

[55]  Synnöve Carlson,et al.  Conveyance of emotional connotations by a single word in English , 2005, Speech Commun..

[56]  R A States,et al.  Precision and repeatability of the Optotrak 3020 motion measurement system , 2006, Journal of medical engineering & technology.

[57]  John C. Wells,et al.  English Intonation : An Introduction , 2006 .

[58]  D. Poeppel,et al.  Temporal window of integration in auditory-visual speech perception , 2007, Neuropsychologia.

[59]  M. Swerts,et al.  The Effects of Visual Beats on Prosodic Prominence: Acoustic Analyses, Auditory Perception and Visual Perception. , 2007 .

[60]  L. Maletsky,et al.  Accuracy of an optical active-marker system to track the relative motion of rigid bodies. , 2007, Journal of biomechanics.

[61]  Emiel Krahmer,et al.  Facial expression and prosodic prominence: Effects of modality and facial area , 2008, J. Phonetics.

[62]  Noah Silbert,et al.  Focus, prosodic context, and phonological feature specification: patterns of variation in fricative production. , 2008, The Journal of the Acoustical Society of America.

[63]  J. Vaissière Perception of Intonation , 2008 .

[64]  Mark G. Grotefend The Perception Is... , 2009 .

[65]  P. Keating,et al.  Optical Phonetics and Visual Perception of Lexical and Phrasal Stress in English , 2009, Language and speech.

[66]  M. Dohen,et al.  Pointing is 'special' , 2009 .

[67]  Paul L. Rosin,et al.  Quantitative analysis of facial movement - A review of three-dimensional imaging techniques , 2009, Comput. Medical Imaging Graph..

[68]  Devin R. Berg,et al.  Precision, repeatability and accuracy of Optotrak® optical motion tracking systems , 2009 .

[69]  Jeffrey B. Nyquist,et al.  Spatial and temporal limits of motion perception across variations in speed, eccentricity, and low vision. , 2009, Journal of vision.

[70]  Duane G. Watson The Many Roads to Prominence , 2010 .

[71]  Maureen Stone Laboratory Techniques for Investigating Speech Articulation , 2010 .

[72]  Duane G. Watson,et al.  Experimental and theoretical advances in prosody: A review , 2010, Language and cognitive processes.

[73]  Paul L. Rosin,et al.  A comparison of the reproducibility of verbal and nonverbal facial gestures using three-dimensional motion analysis , 2010, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[74]  Emiel Krahmer,et al.  Visual prosody of newsreaders: Effects of information structure, emotional content and intended audience on facial expressions , 2010, J. Phonetics.

[75]  B. McMurray,et al.  What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations. , 2011, Psychological review.

[76]  Yi Xu SPEECH PROSODY : A METHODOLOGICAL REVIEW , 2011 .

[77]  Cheyenne Munson,et al.  Features as an emergent product of computing perceptual cues relative to expectations , 2011 .