GLOTTAL EXCITATION EXTRACTION OF VOICED SPEECH - JOINTLY PARAMETRIC AND NONPARAMETRIC APPROACHES

[1]  A. Gray,et al.  Least squares glottal inverse filtering from the acoustic speech waveform , 1979 .

[2]  Jerry M. Mendel,et al.  Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications , 1991, Proc. IEEE.

[3]  Paavo Alku,et al.  HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Nupur Prakash,et al.  Evaluation of MFCC for emotion identification in Hindi speech , 2011, 2011 IEEE 3rd International Conference on Communication Software and Networks.

[5]  Stephen J. Wright,et al.  Primal-Dual Interior-Point Methods , 1997 .

[6]  Paolo Dario,et al.  An Algorithm for the Least Square-Fitting of Ellipses , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[7]  Thierry Dutoit,et al.  Complex cepstrum-based decomposition of speech for glottal source estimation , 2009, INTERSPEECH.

[8]  James Demmel,et al.  Applied Numerical Linear Algebra , 1997 .

[9]  Zhou Yanbing,et al.  Higher order spectral analysis for vibration signals of the large steam turbine in slow-down process , 2011, 2011 2nd International Conference on Intelligent Control and Information Processing.

[10]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[11]  John R. Deller,et al.  On the time domain properties of the two-pole model of the glottal waveform and implications for LPC , 1983, Speech Commun..

[12]  Rafik A. Goubran,et al.  Robust voice activity detection using higher-order statistics in the LPC residual domain , 2001, IEEE Trans. Speech Audio Process..

[13]  N. C. Nigam Introduction to Random Vibrations , 1983 .

[14]  M. Matausek,et al.  A new approach to the determination of the glottal waveform , 1980 .

[15]  S. Biswas,et al.  Speaker Identification Using Cepstral Based Features and Discrete Hidden Markov Model , 2007, 2007 International Conference on Information and Communication Technology.

[16]  Sanjay Mehrotra,et al.  On the Implementation of a Primal-Dual Interior Point Method , 1992, SIAM J. Optim..

[17]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[18]  Paavo Alku An automatic method to estimate the time-based parameters of the glottal pulseform , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[20]  M. J. D. Powell,et al.  On the convergence of trust region algorithms for unconstrained minimization without derivatives , 2012, Comput. Optim. Appl..

[21]  L. N. Vicente,et al.  Trust-Region Interior-Point SQP Algorithms for a Class of Nonlinear Programming Problems , 1998 .

[22]  Thierry Dutoit,et al.  Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation , 2011, Speech Commun..

[23]  Sung Joon Ahn,et al.  Least Squares Orthogonal Distance Fitting of Curves and Surfaces in Space , 2004, Lecture Notes in Computer Science.

[24]  Evelyn Abberton,et al.  Laryngographic assessment of normal voice: A tutorial , 1989 .

[25]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[26]  Jorge J. Moré,et al.  Recent Developments in Algorithms and Software for Trust Region Methods , 1982, ISMP.

[27]  John E. Dennis,et al.  An Adaptive Nonlinear Least-Squares Algorithm , 1977, TOMS.

[28]  C. L. Nikias,et al.  Signal processing with higher-order spectra , 1993, IEEE Signal Processing Magazine.

[29]  Meng-Lin Ku,et al.  Higher-order statistics based sequential spectrum sensing for cognitive radio , 2011, 2011 11th International Conference on ITS Telecommunications.

[30]  John R. Wolberg,et al.  Data Analysis Using the Method of Least Squares: Extracting the Most Information from Experiments , 2005 .

[31]  Karen O. Egiazarian,et al.  Moving target classification in ground surveillance radar ATR system by using novel bicepstral-based information features , 2011, 2011 8th European Radar Conference.

[32]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[33]  P. Scott Minimax and L_{1} curve fitting in non-Gaussian MAP estimation , 1975 .

[34]  L. N. Vicente,et al.  Trust-Region Interior-Point Algorithms for Minimization Problems with Simple Bounds , 1996 .

[35]  Dimitris G. Manolakis,et al.  Statistical and Adaptive Signal Processing: Spectral Estimation, Signal Modeling, Adaptive Filtering and Array Processing , 1999 .

[36]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[37]  Leon S. Lasdon,et al.  Feature Article - Survey of Nonlinear Programming Applications , 1980, Oper. Res..

[38]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[39]  DeLiang Wang,et al.  Robust Speaker Recognition Using Binary Time-Frequency Masks , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[40]  Z Xiangsun A Self-adaptive Trust Region Method For Unconstrained Optimization , 2001 .

[41]  Anders Forsgren,et al.  Interior Methods for Nonlinear Optimization , 2002, SIAM Rev..

[42]  Philippe L. Toint,et al.  A retrospective trust-region method for unconstrained optimization , 2010, Math. Program..

[43]  Mike Brookes,et al.  Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[44]  C. Kelley Solving Nonlinear Equations with Newton's Method , 1987 .

[45]  Serge Gratton,et al.  Approximate Gauss-Newton Methods for Nonlinear Least Squares Problems , 2007, SIAM J. Optim..

[46]  Jorge Nocedal,et al.  A trust region method based on interior point techniques for nonlinear programming , 2000, Math. Program..

[47]  Tu Bao Ho,et al.  Temporal decomposition: a promising approach to VQ-based speaker identification , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[48]  Athina P. Petropulu,et al.  System reconstruction from higher order spectra slices , 1997, IEEE Trans. Signal Process..

[49]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[50]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[51]  Li-Zhi Liao,et al.  Convergence analysis of the Levenberg–Marquardt method , 2007, Optim. Methods Softw..

[52]  Diego P. Ruiz,et al.  Bispectrum estimation using AR‐modelling , 1999 .

[53]  Sanjoy Dasgupta,et al.  Random projection trees for vector quantization , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[54]  Narendra Karmarkar,et al.  A new polynomial-time algorithm for linear programming , 1984, Comb..

[55]  Tilo Strutz,et al.  Data Fitting and Uncertainty: A practical introduction to weighted least squares and beyond , 2010 .

[56]  Mike Brookes,et al.  Voice source cepstrum coefficients for speaker identification , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[57]  Georgios B. Giannakis,et al.  Bispectral analysis and model validation of texture images , 1995, IEEE Trans. Image Process..

[58]  Francisco Herrera,et al.  Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Florian A. Potra,et al.  Q-superlinear convergence of the iterates in primal-dual interior-point methods , 2001, Math. Program..

[60]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[61]  Byeong Gi Lee,et al.  Lossless pole-zero modeling of speech signals , 1993, IEEE Trans. Speech Audio Process..

[62]  Keiichi Funaki,et al.  Recursive ARMAX speech analysis based on a glottal source model with phase compensation , 1999, Signal Process..

[63]  Pietro Laface,et al.  Language Identification using Acoustic Models and Speaker Compensated Cepstral-Time Matrices , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[64]  Til T. Phan,et al.  Text-Independent Speaker Identification , 1999 .

[65]  Thomas F. Coleman,et al.  An Interior Trust Region Approach for Nonlinear Minimization Subject to Bounds , 1993, SIAM J. Optim..

[66]  John H. L. Hansen,et al.  Speaker Identification Within Whispered Speech Audio Streams , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[67]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[68]  Douglas A. Reynolds,et al.  A Gaussian mixture modeling approach to text-independent speaker identification , 1992 .

[69]  Pasi Fränti,et al.  Fast Agglomerative Clustering Using a k-Nearest Neighbor Graph , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70]  J. Flanagan Some properties of the glottal sound source. , 1958, Journal of speech and hearing research.

[71]  Warren P. Mason The approximate networks of acoustic filters , 1930 .

[72]  Nicholas I. M. Gould,et al.  Trust Region Methods , 2000, MOS-SIAM Series on Optimization.

[73]  S. Chandra,et al.  Experimental comparison between stationary and nonstationary formulations of linear prediction applied to voiced speech analysis , 1974 .

[74]  Carl Tim Kelley,et al.  Iterative methods for optimization , 1999, Frontiers in applied mathematics.

[75]  Mark L. Nagurka,et al.  A vector quantization method for nearest neighbor classifier design , 2004, Pattern Recognit. Lett..

[76]  D. Gorinevsky An approach to parametric nonlinear least square optimization and application to task-level learning control , 1997, IEEE Trans. Autom. Control..

[77]  Yunjian Ge,et al.  One curve-fit method for the evaluation of the total distortion of sinusoidal signal , 2010, The 2010 IEEE International Conference on Information and Automation.

[78]  R. Miller Nature of the Vocal Cord Wave , 1956 .

[79]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[80]  Bayya Yegnanarayana,et al.  Determination of instants of significant excitation in speech using group delay function , 1995, IEEE Trans. Speech Audio Process..

[81]  Bayya Yegnanarayana,et al.  Robustness of group-delay-based method for extraction of significant instants of excitation from speech signals , 1999, IEEE Trans. Speech Audio Process..

[82]  Zheng Bao,et al.  Total least mean squares algorithm , 1998, IEEE Trans. Signal Process..

[83]  Patrick A. Naylor,et al.  The SIGMA Algorithm: A Glottal Activity Detector for Electroglottographic Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[84]  Chrysostomos L. Nikias,et al.  The complex cepstrum of higher order cumulants and nonminimum phase system identification , 1988, IEEE Trans. Acoust. Speech Signal Process..

[85]  Christophe d'Alessandro,et al.  The voice source as a causal/anticausal linear filter , 2003 .

[86]  Athina P. Petropulu,et al.  The complex cepstrum and bicepstrum: analytic performance evaluation in the presence of Gaussian noise , 1990, IEEE Trans. Acoust. Speech Signal Process..

[87]  Y. Venkataramani,et al.  Text Independent Composite Speaker Identification/Verification Using Multiple Features , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[88]  Arie Yeredor,et al.  The extended least squares criterion: minimization algorithms and applications , 2001, IEEE Trans. Signal Process..

[89]  Dah-Chung Chang,et al.  An automatic modulation classification technique using high-order statistics for multipath fading channels , 2011, 2011 11th International Conference on ITS Telecommunications.

[90]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[91]  J. Flanagan Note on the Design of “Terminal‐Analog” Speech Synthesizers , 1957 .

[92]  Christophe d'Alessandro,et al.  Zeros of Z-transform representation with application to source-filter separation in speech , 2005, IEEE Signal Processing Letters.

[93]  Douglas A. Reynolds,et al.  Modeling of the glottal flow derivative waveform with application to speaker identification , 1999, IEEE Trans. Speech Audio Process..