Production and Convergence of Multiscale Clustering in Speech

Language entails the scaling of variability across levels of measurement—small linguistic variations occur at the millisecond level, larger variations occur at the next level, and even larger variations occur over longer timescales. For acoustic onsets in speech signals, small temporal variations occur at the phonetic level, larger variations occur at the phrasal level, and even larger variations occur at the conversational level. Scaling across levels of measurement can be quantified in terms of power law distributions. In this article we review recent investigations into power law clustering of acoustic speech onsets. Studies demonstrate that the multiscale clustering in onsets reflects communicative aspects of speech in adult conversations as well as infant vocalizations. We also review evidence that multiscale clustering in the vocalizations of individuals converges during vocal interactions. We relate multiscale convergence to the notion of complexity matching, that is, the hypothesis that maximal information transfer occurs when the power laws of 2 interacting complex systems are matched. We conclude by discussing potential extensions of this work including estimating the multifractal structure of speech and testing the maximal information transfer prediction of complexity matching.

[1]  J. K. Bock Syntactic persistence in language production , 1986, Cognitive Psychology.

[2]  D. W. Allan,et al.  Statistics of atomic frequency standards , 1966 .

[3]  Damian G. Kelty-Stephen,et al.  Temporal correlations in postural sway moderate effects of stochastic resonance on postural stability. , 2013, Human movement science.

[4]  Eduardo G. Altmann,et al.  On the origin of long-range correlations in texts , 2012, Proceedings of the National Academy of Sciences.

[5]  P. Latham,et al.  References and Notes Supporting Online Material Materials and Methods Figs. S1 to S11 References Movie S1 Optimally Interacting Minds R�ports , 2022 .

[6]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .

[7]  D. Delignières,et al.  Strong anticipation: complexity matching in interpersonal coordination , 2012, Experimental Brain Research.

[8]  B. Vereijken,et al.  Multifractal formalisms of human behavior. , 2013, Human movement science.

[9]  Jennifer S. Pardo,et al.  Measuring phonetic convergence in speech production , 2013, Front. Psychol..

[10]  Manuel Varlet,et al.  Predicting the biological variability of environmental rhythms: Weak or strong anticipation for sensorimotor synchronization? , 2013, Brain and Cognition.

[11]  Michael T. Turvey,et al.  Strong anticipation: Sensitivity to long-range correlations in synchronization behavior , 2008 .

[12]  Sam Tilsen,et al.  Multitimescale Dynamical Interactions Between Speech Rhythm and Gesture , 2009, Cogn. Sci..

[13]  A. Opstal Dynamic Patterns: The Self-Organization of Brain and Behavior , 1995 .

[14]  Daniel S. Messinger,et al.  Simple mathematical law benchmarks human confrontations , 2013, Scientific Reports.

[15]  Jennifer S. Pardo,et al.  On phonetic convergence during conversational interaction. , 2006, The Journal of the Acoustical Society of America.

[16]  Christopher T. Kello,et al.  The Pervasiveness of 1/f Scaling in Speech Reflects the Metastable Basis of Cognition , 2008, Cogn. Sci..

[17]  James A. Dixon,et al.  Strong anticipation: Multifractal cascade dynamics modulate scaling in synchronization behaviors , 2011 .

[18]  Christopher T. Kello,et al.  Intrinsic and extrinsic contributions to heavy tails in visual foraging , 2014 .

[19]  Beatrix Vereijken,et al.  Interaction-dominant dynamics in human cognition: beyond 1/f(alpha) fluctuation. , 2010, Journal of experimental psychology. General.

[20]  Luis Alvarez,et al.  Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications , 2012, Lecture Notes in Computer Science.

[21]  J. Collins,et al.  Random walking during quiet standing. , 1994, Physical review letters.

[22]  Michael T. Turvey,et al.  On strong anticipation , 2010, Cognitive Systems Research.

[23]  Claudia Carello,et al.  Haptic perceptual intent in quiet standing affects multifractal scaling of postural fluctuations. , 2014, Journal of experimental psychology. Human perception and performance.

[24]  Christopher T. Kello,et al.  The emergent coordination of cognitive function. , 2007, Journal of experimental psychology. General.

[25]  H. Stanley,et al.  Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. , 1995, Chaos.

[26]  Andreas Daffertshofer,et al.  ‘Human paced’ walking: Followers adopt stride time dynamics of leaders , 2014, Neuroscience Letters.

[27]  P. Jusczyk,et al.  Clauses are perceptual units for young infants , 1987, Cognition.

[28]  John G. Holden,et al.  The Self-Organization of a Spoken Word , 2012, Front. Psychology.

[29]  Bahador Bahrami,et al.  Conversation, Coupling and Complexity: Matching Scaling Laws Predict Performance in a Joint Decision Task , 2013, CogSci.

[30]  Damian G. Stephen,et al.  The role of fractality in perceptual learning: exploration in dynamic touch. , 2010, Journal of experimental psychology. Human perception and performance.

[31]  Gavan Lintern,et al.  Dynamic patterns: The self-organization of brain and behavior , 1997, Complex.

[32]  Damian G. Kelty-Stephen,et al.  Multiplicative-cascade dynamics in pole balancing. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Alexandra Paxton,et al.  Argument disrupts interpersonal synchrony , 2013, Quarterly journal of experimental psychology.

[34]  Diana Cristina González,et al.  Analysis of the Multifractal Nature of Speech Signals , 2012, CIARP.

[35]  Alexandra Paxton,et al.  Network Analysis of Multimodal, Multiscale Coordination in Dyadic Problem Solving , 2014, CogSci.

[36]  Bruce J. West,et al.  Transmission of information between complex systems: 1/f resonance. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  Deborah J. Aks,et al.  Memory Across Eye-Movements: 1/f Dynamic in Visual Search , 2010 .

[38]  D. Oller The emergence of the speech capacity , 2000 .

[39]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[40]  G. V. van Orden,et al.  Self-organization of cognitive performance. , 2003, Journal of experimental psychology. General.

[41]  Maria N. Geffen,et al.  Category-Specific Processing of Scale-Invariant Sounds in Infancy , 2014, PloS one.

[42]  Daniel M. Dubois,et al.  Mathematical Foundations of Discrete and Functional Systems with Strong and Weak Anticipations , 2003, ABiALS.

[43]  Bruce J. West,et al.  Maximizing information exchange between complex networks , 2008 .

[44]  Justin M Fine,et al.  Emergent complexity matching in interpersonal coordination: Local dynamics and global variability. , 2015, Journal of experimental psychology. Human perception and performance.

[45]  M. Teich,et al.  The periodogram and Allan variance reveal fractal exponents greater than unity in auditory-nerve spike trains. , 1996, The Journal of the Acoustical Society of America.

[46]  Didier Delignières,et al.  Transition from Persistent to Anti-Persistent Correlations in Postural Sway Indicates Velocity-Based Control , 2011, PLoS Comput. Biol..

[47]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[48]  J. Cappella,et al.  Talk and Silence Sequences in Informal Conversations III: Interspeaker Influence , 1981 .

[49]  A. Woodward,et al.  Perception of acoustic correlates of major phrasal units by young infants , 1992, Cognitive Psychology.

[50]  Elena Patten,et al.  Vocal Patterns in Infants with Autism Spectrum Disorder: Canonical Babbling Status and Vocalization Frequency , 2014, Journal of Autism and Developmental Disorders.

[51]  Sebastian Wallot,et al.  Using nonlinear methods to quantify changes in infant limb movements and vocalizations , 2014, Front. Psychol..

[52]  Daniel Mirman,et al.  Interactions dominate the dynamics of visual cognition , 2010, Cognition.

[53]  C. Frith,et al.  Coming to Terms , 2012, Psychological science.

[54]  Christopher T. Kello,et al.  Complexity matching in dyadic conversation. , 2014, Journal of experimental psychology. General.

[55]  Hans G. Feichtinger,et al.  Analysis, Synthesis, and Estimation of Fractal-Rate Stochastic Point Processes , 1997, adap-org/9709006.

[56]  Espen A F Ihlen,et al.  Age-related changes in inter-joint coordination during walking. , 2014, Journal of applied physiology.

[57]  H E Stanley,et al.  Deviations from uniform power law scaling in nonstationary time series. , 1997, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[58]  Jeffrey M. Hausdorff,et al.  When human walking becomes random walking: fractal analysis and modeling of gait rhythm fluctuations. , 2001, Physica A.

[59]  Rick Dale,et al.  Behavior Matching in Multimodal Communication Is Synchronized , 2012, Cogn. Sci..

[60]  Alen Hajnal,et al.  Transfer of calibration between hand and foot: Functional equivalence and fractal fluctuations , 2011, Attention, perception & psychophysics.

[61]  Klaus J. Kohler,et al.  The Perception of Prominence Patterns , 2009, Phonetica.

[62]  Fred Hasselman,et al.  Classifying acoustic signals into phoneme categories: average and dyslexic readers make use of complex dynamical patterns and multifractal scaling properties of the speech signal , 2014, PeerJ.

[63]  Robert F. Port,et al.  Rhythmic constraints on stress timing in English , 1998 .

[64]  P. Kuhl,et al.  Acoustic determinants of infant preference for motherese speech , 1987 .

[65]  M. Pickering,et al.  Toward a mechanistic psychology of dialogue , 2004, Behavioral and Brain Sciences.

[66]  K. Mellanby How Nature works , 1978, Nature.

[67]  D. Oller,et al.  Phrasing in prelinguistic vocalizations. , 1995, Developmental psychobiology.

[68]  Michael J. Richardson,et al.  Recurrence Quantification as an Analysis of Temporal Coordination with Complex Signals , 2014 .