Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching

In this paper a preliminary study on voice excitation modeling by single glottal shape parameter selection is presented. A strategy for direct model selection by matching derivative glottal source estimates with LF-based candidates driven by the Rd parameter is explored by means of two state-of-the-art similarity measures and a novel one considering spectral envelope information. An experimental study on synthetic singing-voice was carried out aiming to compare the performance of the different measures and to observe potential relations with respect to different voice characteristics (e.g. vocal effort, pitch range, amount of aperiodicities and aspiration noise). The results of this study allow us to claim competitive performance of the proposed strategy and suggest us preferable source modeling conditions for stationary singing-voice.

[1]  Julius O. Smith,et al.  Toward a high-quality singing synthesizer with vocal texture control , 2002 .

[2]  C. Gobl,et al.  Exploiting Time and Frequency Domain Measures for Precise Voice ParameterisationSource , 2012 .

[3]  Jody Kreiman,et al.  Perception of aperiodicity in pathological voice. , 2005, The Journal of the Acoustical Society of America.

[4]  Nathalie Henrich-Bernardoni Etude de la source glottique en voix parlee et chantee : modelisation et estimation, mesures acoustiques et electroglottographiques, perception , 2001 .

[5]  G. Fant Dept. for Speech, Music and Hearing Quarterly Progress and Status Report the Lf-model Revisited. Transformations and Frequency Domain Analysis the Lf-model Revisited. Transformations and Frequency Domain Analysis* , 2022 .

[6]  X. Rodet EFFICIENT SPECTRAL ENVELOPE ESTIMATION AND ITS APPLICATION TO PITCH SHIFTING AND ENVELOPE PRESERVATION , 2005 .

[7]  Axel Röbel,et al.  Joint estimate of shape and time-synchronization of a glottal source model by phase flatness , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Thierry Dutoit,et al.  Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation , 2011, Speech Commun..

[9]  Axel Röbel,et al.  Improving Lpc Spectral Envelope Extraction Of Voiced Speech By True-Envelope Estimation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..