论文信息 - GLOTTAL EXCITATION EXTRACTION OF VOICED SPEECH - JOINTLY PARAMETRIC AND NONPARAMETRIC APPROACHES - 字舞流文

GLOTTAL EXCITATION EXTRACTION OF VOICED SPEECH - JOINTLY PARAMETRIC AND NONPARAMETRIC APPROACHES

Yiqiao Chen | Yiqiao Chen

[1] A. Gray,et al. Least squares glottal inverse filtering from the acoustic speech waveform , 1979 .

[2] Jerry M. Mendel,et al. Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications , 1991, Proc. IEEE.

[3] Paavo Alku,et al. HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[4] Nupur Prakash,et al. Evaluation of MFCC for emotion identification in Hindi speech , 2011, 2011 IEEE 3rd International Conference on Communication Software and Networks.

[5] Stephen J. Wright,et al. Primal-Dual Interior-Point Methods , 1997 .

[6] Paolo Dario,et al. An Algorithm for the Least Square-Fitting of Ellipses , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[7] Thierry Dutoit,et al. Complex cepstrum-based decomposition of speech for glottal source estimation , 2009, INTERSPEECH.

[8] James Demmel,et al. Applied Numerical Linear Algebra , 1997 .

[9] Zhou Yanbing,et al. Higher order spectral analysis for vibration signals of the large steam turbine in slow-down process , 2011, 2011 2nd International Conference on Intelligent Control and Information Processing.

[10] John G. Proakis,et al. Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[11] John R. Deller,et al. On the time domain properties of the two-pole model of the glottal waveform and implications for LPC , 1983, Speech Commun..

[12] Rafik A. Goubran,et al. Robust voice activity detection using higher-order statistics in the LPC residual domain , 2001, IEEE Trans. Speech Audio Process..

[13] N. C. Nigam. Introduction to Random Vibrations , 1983 .

[14] M. Matausek,et al. A new approach to the determination of the glottal waveform , 1980 .

[15] S. Biswas,et al. Speaker Identification Using Cepstral Based Features and Discrete Hidden Markov Model , 2007, 2007 International Conference on Information and Communication Technology.

[16] Sanjay Mehrotra,et al. On the Implementation of a Primal-Dual Interior Point Method , 1992, SIAM J. Optim..

[17] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[18] Paavo Alku. An automatic method to estimate the time-based parameters of the glottal pulseform , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19] J. Liljencrants,et al. Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[20] M. J. D. Powell,et al. On the convergence of trust region algorithms for unconstrained minimization without derivatives , 2012, Comput. Optim. Appl..

[21] L. N. Vicente,et al. Trust-Region Interior-Point SQP Algorithms for a Class of Nonlinear Programming Problems , 1998 .

[22] Thierry Dutoit,et al. Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation , 2011, Speech Commun..

[23] Sung Joon Ahn,et al. Least Squares Orthogonal Distance Fitting of Curves and Surfaces in Space , 2004, Lecture Notes in Computer Science.

[24] Evelyn Abberton,et al. Laryngographic assessment of normal voice: A tutorial , 1989 .

[25] J. Makhoul,et al. Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[26] Jorge J. Moré,et al. Recent Developments in Algorithms and Software for Trust Region Methods , 1982, ISMP.

[27] John E. Dennis,et al. An Adaptive Nonlinear Least-Squares Algorithm , 1977, TOMS.

[28] C. L. Nikias,et al. Signal processing with higher-order spectra , 1993, IEEE Signal Processing Magazine.

[29] Meng-Lin Ku,et al. Higher-order statistics based sequential spectrum sensing for cognitive radio , 2011, 2011 11th International Conference on ITS Telecommunications.

[30] John R. Wolberg,et al. Data Analysis Using the Method of Least Squares: Extracting the Most Information from Experiments , 2005 .

[31] Karen O. Egiazarian,et al. Moving target classification in ground surveillance radar ATR system by using novel bicepstral-based information features , 2011, 2011 8th European Radar Conference.

[32] R. Gray,et al. Vector quantization , 1984, IEEE ASSP Magazine.

[33] P. Scott. Minimax and L_{1} curve fitting in non-Gaussian MAP estimation , 1975 .

[34] L. N. Vicente,et al. Trust-Region Interior-Point Algorithms for Minimization Problems with Simple Bounds , 1996 .

[35] Dimitris G. Manolakis,et al. Statistical and Adaptive Signal Processing: Spectral Estimation, Signal Modeling, Adaptive Filtering and Array Processing , 1999 .

[36] D. Marquardt. An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[37] Leon S. Lasdon,et al. Feature Article - Survey of Nonlinear Programming Applications , 1980, Oper. Res..

[38] Paavo Alku,et al. Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[39] DeLiang Wang,et al. Robust Speaker Recognition Using Binary Time-Frequency Masks , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[40] Z Xiangsun. A Self-adaptive Trust Region Method For Unconstrained Optimization , 2001 .

[41] Anders Forsgren,et al. Interior Methods for Nonlinear Optimization , 2002, SIAM Rev..

[42] Philippe L. Toint,et al. A retrospective trust-region method for unconstrained optimization , 2010, Math. Program..

[43] Mike Brookes,et al. Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[44] C. Kelley. Solving Nonlinear Equations with Newton's Method , 1987 .

[45] Serge Gratton,et al. Approximate Gauss-Newton Methods for Nonlinear Least Squares Problems , 2007, SIAM J. Optim..

[46] Jorge Nocedal,et al. A trust region method based on interior point techniques for nonlinear programming , 2000, Math. Program..

[47] Tu Bao Ho,et al. Temporal decomposition: a promising approach to VQ-based speaker identification , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[48] Athina P. Petropulu,et al. System reconstruction from higher order spectra slices , 1997, IEEE Trans. Signal Process..

[49] A. W. M. van den Enden,et al. Discrete Time Signal Processing , 1989 .

[50] D. Klatt,et al. Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[51] Li-Zhi Liao,et al. Convergence analysis of the Levenberg–Marquardt method , 2007, Optim. Methods Softw..

[52] Diego P. Ruiz,et al. Bispectrum estimation using AR‐modelling , 1999 .

[53] Sanjoy Dasgupta,et al. Random projection trees for vector quantization , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[54] Narendra Karmarkar,et al. A new polynomial-time algorithm for linear programming , 1984, Comb..

[55] Tilo Strutz,et al. Data Fitting and Uncertainty: A practical introduction to weighted least squares and beyond , 2010 .

[56] Mike Brookes,et al. Voice source cepstrum coefficients for speaker identification , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[57] Georgios B. Giannakis,et al. Bispectral analysis and model validation of texture images , 1995, IEEE Trans. Image Process..

[58] Francisco Herrera,et al. Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59] Florian A. Potra,et al. Q-superlinear convergence of the iterates in primal-dual interior-point methods , 2001, Math. Program..

[60] John H. L. Hansen,et al. Discrete-Time Processing of Speech Signals , 1993 .

[61] Byeong Gi Lee,et al. Lossless pole-zero modeling of speech signals , 1993, IEEE Trans. Speech Audio Process..

[62] Keiichi Funaki,et al. Recursive ARMAX speech analysis based on a glottal source model with phase compensation , 1999, Signal Process..

[63] Pietro Laface,et al. Language Identification using Acoustic Models and Speaker Compensated Cepstral-Time Matrices , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[64] Til T. Phan,et al. Text-Independent Speaker Identification , 1999 .

[65] Thomas F. Coleman,et al. An Interior Trust Region Approach for Nonlinear Minimization Subject to Bounds , 1993, SIAM J. Optim..

[66] John H. L. Hansen,et al. Speaker Identification Within Whispered Speech Audio Streams , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[67] Gunnar Fant,et al. Acoustic Theory Of Speech Production , 1960 .

[68] Douglas A. Reynolds,et al. A Gaussian mixture modeling approach to text-independent speaker identification , 1992 .

[69] Pasi Fränti,et al. Fast Agglomerative Clustering Using a k-Nearest Neighbor Graph , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70] J. Flanagan. Some properties of the glottal sound source. , 1958, Journal of speech and hearing research.

[71] Warren P. Mason. The approximate networks of acoustic filters , 1930 .

[72] Nicholas I. M. Gould,et al. Trust Region Methods , 2000, MOS-SIAM Series on Optimization.

[73] S. Chandra,et al. Experimental comparison between stationary and nonstationary formulations of linear prediction applied to voiced speech analysis , 1974 .

[74] Carl Tim Kelley,et al. Iterative methods for optimization , 1999, Frontiers in applied mathematics.

[75] Mark L. Nagurka,et al. A vector quantization method for nearest neighbor classifier design , 2004, Pattern Recognit. Lett..

[76] D. Gorinevsky. An approach to parametric nonlinear least square optimization and application to task-level learning control , 1997, IEEE Trans. Autom. Control..

[77] Yunjian Ge,et al. One curve-fit method for the evaluation of the total distortion of sinusoidal signal , 2010, The 2010 IEEE International Conference on Information and Automation.

[78] R. Miller. Nature of the Vocal Cord Wave , 1956 .

[79] Jr. J.P. Campbell,et al. Speaker recognition: a tutorial , 1997, Proc. IEEE.

[80] Bayya Yegnanarayana,et al. Determination of instants of significant excitation in speech using group delay function , 1995, IEEE Trans. Speech Audio Process..

[81] Bayya Yegnanarayana,et al. Robustness of group-delay-based method for extraction of significant instants of excitation from speech signals , 1999, IEEE Trans. Speech Audio Process..

[82] Zheng Bao,et al. Total least mean squares algorithm , 1998, IEEE Trans. Signal Process..

[83] Patrick A. Naylor,et al. The SIGMA Algorithm: A Glottal Activity Detector for Electroglottographic Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[84] Chrysostomos L. Nikias,et al. The complex cepstrum of higher order cumulants and nonminimum phase system identification , 1988, IEEE Trans. Acoust. Speech Signal Process..

[85] Christophe d'Alessandro,et al. The voice source as a causal/anticausal linear filter , 2003 .

[86] Athina P. Petropulu,et al. The complex cepstrum and bicepstrum: analytic performance evaluation in the presence of Gaussian noise , 1990, IEEE Trans. Acoust. Speech Signal Process..

[87] Y. Venkataramani,et al. Text Independent Composite Speaker Identification/Verification Using Multiple Features , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[88] Arie Yeredor,et al. The extended least squares criterion: minimization algorithms and applications , 2001, IEEE Trans. Signal Process..

[89] Dah-Chung Chang,et al. An automatic modulation classification technique using high-order statistics for multipath fading channels , 2011, 2011 11th International Conference on ITS Telecommunications.

[90] A. Rosenberg. Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[91] J. Flanagan. Note on the Design of “Terminal‐Analog” Speech Synthesizers , 1957 .

[92] Christophe d'Alessandro,et al. Zeros of Z-transform representation with application to source-filter separation in speech , 2005, IEEE Signal Processing Letters.

[93] Douglas A. Reynolds,et al. Modeling of the glottal flow derivative waveform with application to speaker identification , 1999, IEEE Trans. Speech Audio Process..