Detection of the period of voice based on wavelet

The quality analysis of pathological voice is not only a kind of objective evaluation for the human voice, but also an objective evidence for the application of voice disease diagnosis and the judgment of therapy. In this paper, the pathological voice quality analysis system is composed of five parts, which are voice recording, voice signal processing, and extraction of voice characteristic parameters, choice of these features and evaluation of voice quality. Firstly, this paper gives a brief introduction to the pathological voice quality analysis's background and purpose. Besides, it also details some methods of voice signal processing. For detection of voice period, vague fundamental frequency of voice is extracted firstly by using Fourier transform, and then the accurate detection of the voice period through EGG and wavelet transformation is performed. This method of period detection is highly accurate and practicable. Based on the period of voice, many features of voice such as the fundamental frequency, frequency perturbation, and amplitude perturbation can be extracted. This database is built by collecting healthy and pathological voice samples from Shengjing Hospital and related references. After analyzing the collected voice signals, the result shows that the pathological voice quality analysis system has the well evaluation performance in the practical application. These evaluation results reflect the quality of voice signal and can be used to assistant judge the treatment of voice disease and the recovery condition of treatment.

[1]  N Yanagihara,et al.  Significance of harmonic changes and noise components in hoarseness. , 1967, Journal of speech and hearing research.

[2]  C Manfredi,et al.  A comparative analysis of fundamental frequency estimation methods with application to pathological voices. , 2000, Medical engineering & physics.

[3]  P. Wan,et al.  [Relationship between voice acoustic parameters and vocal quality]. , 2008, Lin chuang er bi yan hou tou jing wai ke za zhi = Journal of clinical otorhinolaryngology, head, and neck surgery.

[4]  Yu Hui-ling The research on characteristics extraction based on voice signal , 2006 .

[5]  M. Hirano Objective evaluation of the human voice: clinical aspects. , 1989, Folia phoniatrica.

[6]  W. Gould,et al.  Vocal Shimmer in Sustained Phonation of Normal and Pathologic Voice , 1976, The Annals of otology, rhinology, and laryngology.

[7]  R S McGowan,et al.  Acoustic measurements of men's and women's voices: a study of context effects and covariation. , 1990, Journal of speech and hearing research.

[8]  Baikun Wan,et al.  [A review on the applications of acoustic analysis in diagnosing disease]. , 2007, Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi.

[9]  Qiru Zhou,et al.  Robust endpoint detection and energy normalization for real-time speech and speaker recognition , 2002, IEEE Trans. Speech Audio Process..