Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification

[1]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[2]  T.H. Crystal,et al.  Linear prediction of speech , 1977, Proceedings of the IEEE.

[3]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[4]  J. Laver The phonetic description of voice quality , 1980 .

[5]  Roger K. Moore Computer Speech and Language , 1986 .

[6]  D. Childers,et al.  Two-channel speech analysis , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  John Vanderkooy,et al.  Transfer-Function Measurement with Maximum-Length Sequences , 1989 .

[8]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[9]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[10]  Christer Gobl,et al.  Acoustic characteristics of voice quality , 1992, Speech Commun..

[11]  Lou Boves,et al.  Fitting a LF-model to inverse filter signals , 1993, EUROSPEECH.

[12]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[13]  K. Stevens,et al.  Classification of glottal vibration from acoustic measurements , 1995 .

[14]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[15]  P. Ladefoged,et al.  The sounds of the world's languages , 1996 .

[16]  Mirjam Wester Automatic Classification of Voice Quality: Comparing Regression Models and Hidden Markov Models , 1998 .

[17]  M. Ng,et al.  Acoustic, aerodynamic, physiologic, and perceptual properties of modal and vocal fry registers. , 1998, The Journal of the Acoustical Society of America.

[18]  Jonathan Harrington,et al.  The Acoustic Theory of Speech Production , 1999 .

[19]  Kristin P. Bennett,et al.  Support vector machines: hype or hallelujah? , 2000, SKDD.

[20]  Christophe d'Alessandro,et al.  Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data , 2001, INTERSPEECH.

[21]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[22]  Ludmila I. Kuncheva,et al.  Using measures of similarity and inclusion for multiple classifier fusion by decision templates , 2001, Fuzzy Sets Syst..

[23]  Friedhelm Schwenker,et al.  Solving Multi-class Pattern Recognition Problems with Tree-Structured Support Vector Machines , 2001, DAGM-Symposium.

[24]  Richard Ogden Turn transition, creak and glottal stop in Finnish talk-in-interaction , 2001, Journal of the International Phonetic Association.

[25]  P. Alku,et al.  Normalized amplitude quotient for parametrization of the glottal flow. , 2002, The Journal of the Acoustical Society of America.

[26]  H. P. Huang,et al.  Fuzzy Support Vector Machines for Pattern Recognition and Data Mining , 2002 .

[27]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[28]  Ailbhe Ní Chasaide,et al.  The role of voice quality in communicating emotion, mood and attitude , 2003, Speech Commun..

[29]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[30]  C. Gobl,et al.  Amplitude-Based Source Parameters for Measur ing Voice Quality , 2003 .

[31]  C. Gobl The Voice Source in Speech Communication - Production and Perception Experiments Involving Inverse Filtering and Synthesis , 2003 .

[32]  Jerold A. Edmondson,et al.  The aryepiglottic folds and voice quality in the Yi and Bai languages : laryngoscopic case studies + , 2003 .

[33]  Nick Campbell Specifying Affect and Emotion for Expressive Speech Synthesis , 2004, CICLing.

[34]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[35]  N. Campbell,et al.  Voice Quality : the 4 th Prosodic Dimension , 2004 .

[36]  Mika Ito Politeness and Voice Quality – The Alternative Method to Measure Aspiration Noise , 2004 .

[37]  Jacqueline Walker,et al.  A Review of Glottal Waveform Analysis , 2005, WNSP.

[38]  Ailbhe Ní Chasaide,et al.  Voice quality and f0 cues for affect expression: implications for synthesis , 2005, INTERSPEECH.

[39]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[40]  J. Beck Perceptual analysis of voice quality: the place of Vocal Profile Analysis. , 2005 .

[41]  Günther Palm,et al.  Comparison of Multiclass SVM Decomposition Schemes for Visual Object Recognition , 2005, DAGM-Symposium.

[42]  M. Zusi,et al.  A fuzzy-input fuzzy-output SVM technique for classification of hyperspectral remote sensing images , 2006, Proceedings of the 7th Nordic Signal Processing Symposium - NORSIG 2006.

[43]  Bin Yang,et al.  Robust Estimation of Voice Quality Parameters Under Realworld Disturbances , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[44]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[45]  Nick Campbell,et al.  On the Use of NonVerbal Speech Sounds in Human Communication , 2007, COST 2102 Workshop.

[46]  Paavo Alku,et al.  Comparison of multiple voice source parameters in different phonation types , 2007, INTERSPEECH.

[47]  Diane M. Griffiths,et al.  THE REGENTS OF THE UNIVERSITY OF CALIFORNIA , 2007 .

[48]  Quarterly Progress and Status Report A preliminary study of acoustic voice quality correlates , 2007 .

[49]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[50]  Friedhelm Schwenker,et al.  Fuzzy-Input Fuzzy-Output One-Against-All Support Vector Machines , 2007, KES.

[51]  M. Lugger,et al.  Extracting voice quality contours using discrete hidden Markov models , 2008, Speech Prosody 2008.

[52]  Paavo Alku,et al.  HMM-based Finnish text-to-speech system utilizing glottal inverse filtering , 2008, INTERSPEECH.

[53]  Hiroshi Ishiguro,et al.  A Method for Automatic Detection of Vocal Fry , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[54]  Günther Palm,et al.  Comparison of Neural Classification Algorithms Applied to Land Cover Mapping , 2009, WIRN.

[55]  Junichi Yamagishi,et al.  Glottal spectral separation for parametric speech synthesis , 2008, INTERSPEECH.

[56]  Matti Airas,et al.  TKK Aparat: An environment for voice inverse filtering and parameterization , 2008, Logopedics, phoniatrics, vocology.

[57]  Bin Yang,et al.  Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[58]  Nicolas Sturmel,et al.  Glottal closure instant detection using Lines of Maximum Amplitudes (LOMA) of thewavelet transform , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[59]  P. Alku,et al.  Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering. , 2009, The Journal of the Acoustical Society of America.

[60]  Thierry Dutoit,et al.  Complex cepstrum-based decomposition of speech for glottal source estimation , 2009, INTERSPEECH.

[61]  Christian Thiel,et al.  Multiple Classifier Systems Incorporating Uncertainty , 2011 .

[62]  John Kane,et al.  A spectral LF model based approach to voice source parameterisation , 2010, INTERSPEECH.

[63]  Jody Kreiman,et al.  Integrated software for analysis and synthesis of voice quality , 2010, Behavior research methods.

[64]  John Kane,et al.  Identifying Regions of Non-Modal Phonation Using Features of the Wavelet Transform , 2011, INTERSPEECH.

[65]  M. Lugger,et al.  Classification of different speaking groups ITG Fachtagung Sprachkommunikation 2006 CLASSIFICATION OF DIFFERENT SPEAKING GROUPS BY MEANS OF VOICE QUALITY PARAMETERS , 2011 .

[66]  Haoxuan Li,et al.  LF model based glottal source parameter estimation by extended Kalman filtering , 2011 .

[67]  Günther Palm,et al.  A generic framework for the inference of user states in human computer interaction , 2012, Journal on Multimodal User Interfaces.

[68]  Haoxuan Li,et al.  Comparison of time- and frequency-domain based LF-model fitting methods for voice source parametrisation , 2012 .