An adaptive neuro-fuzzy inference system for the qualitative study of perceptual prominence in linguistics

This paper explores the applications of fuzzy logic inference systems as an instrument to perform linguistic analysis in the domain of prosodic prominence. Understanding how acoustic features interact to make a linguistic unit be perceived as more relevant than the surrounding ones is generally needed to study the cognitive processes needed for speech understanding. It also has technological applications in the field of speech recognition and synthesis. We present a first experiment to show how fuzzy inference systems, being characterised by their capability to provide detailed insight about the models obtained through supervised learning can help investigate the complex relationships among acoustic features linked to prominence perception.

[1]  Anne Lacheret,et al.  A corpus-based learning method for prominence detection in spontaneous speech , 2009 .

[2]  A.C.M. Rietveld,et al.  On the relation between pitch excursion size and prominence , 1985 .

[3]  Antonio Origlia,et al.  On the Use of the Rhythmogram for Automatic Syllabic Prominence Detection , 2011, INTERSPEECH.

[4]  Anupam Shukla,et al.  Multilingual speaker recognition using ANFIS , 2010, 2010 2nd International Conference on Signal Processing Systems.

[5]  David House Differential perception of tonal contours through the syllable , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[6]  G. Fant,et al.  Speech , Music and Hearing Quarterly Progress and Status Report Preliminaries to the study of Swedish prose reading and reading style , 2007 .

[7]  Hui Fang Adaptive Neurofuzzy Inference System in the Application of the Financial Crisis Forecast , .

[8]  Masato Akagi,et al.  A method for emotional speech synthesis based on the position of emotional state in Valence-Activation space , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[9]  Petra Wagner,et al.  Different parts of the same elephant: A roadmap to disentangle and connect different perspectives on prosodic prominence , 2015, ICPhS.

[10]  Okyay Kaynak,et al.  Adaptive neuro-fuzzy inference system based autonomous flight control of unmanned air vehicles , 2007, Expert Syst. Appl..

[11]  Antonio Origlia,et al.  A dynamic tonal perception model for optimal pitch stylization , 2013, Comput. Speech Lang..

[12]  Petra Wagner,et al.  Beat It! Gesture-based Prominence Annotation as a Window to Individual Prosody Processing Strategies , 2016 .

[13]  J. Terken Fundamental frequency and perceived prominence of accented syllables. , 1991, The Journal of the Acoustical Society of America.

[14]  Andrew Rosenberg,et al.  Cross-Language Prominence Detection , 2012 .

[15]  J. Sawusch,et al.  The processing of duration and intensity cues to prominence. , 1996, The Journal of the Acoustical Society of America.

[16]  Y. Chakrapani,et al.  Adaptive Neuro-Fuzzy Inference System based Fractal Image Compression , 2009 .

[17]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[18]  Pier Marco Bertinetto,et al.  Prosodic prominence detection in Italian continuous speech using probabilistic graphical models , 2014 .

[19]  Barbara Heuft,et al.  Eine prominenzbasierte Methode zur Prosodieanalyse und -synthese , 1999 .

[20]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[21]  Juraj Simko,et al.  Hierarchical representation and estimation of prosody using continuous wavelet transform , 2017, Comput. Speech Lang..

[22]  Giovanni Acampora,et al.  Interoperable neuro-fuzzy services for emotion-aware ambient intelligence , 2013, Neurocomputing.

[23]  Moses Ekpenyong,et al.  Adaptive Prosody Modelling for Improved Synthetic Speech Quality , 2013, LTC.

[24]  Saeed Setayeshi,et al.  Abnormal Red Blood Cells Detection Using Adaptive Neuro-fuzzy System , 2012, MMVR.

[25]  Antonio Origlia,et al.  Investigating syllabic prominence with Conditional Random Fields and Latent-Dynamic Conditional Random Fields , 2012, INTERSPEECH.