An automatic singing voice rectifier design

This paper proposes a new approach to automatic singing voice rectification. There are two components in the rectifier; one is the recognizer based on dynamic time warping and the other is the synthesizer based PSOLA (Pitch Synchronous Overlap and Add) for pitch shifting. The purpose of the recognizer is to identify the locations of off-key parts of the user's acoustic input. Then with the target music score, the synthesizer tries to correct the off-key parts by appropriate pitch shifting to match the give music score. We also attempt some singing and listening experiments for evaluating the feasibility of the rectifier and the results exhibit the satisfactory performance.

[1]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[2]  Jyh-Shing Roger Jang,et al.  A Query-by-Singing System based on Dynamic Programming , 2000 .

[3]  Jyh-Shing Roger Jang,et al.  An On-the-Fly Mandarin Singing Voice Synthesis System , 2002, IEEE Pacific Rim Conference on Multimedia.

[4]  Sau-Gee Chen,et al.  High quality and low complexity pitch modification of acoustic signals , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  K. Pavan Kumar SPEECH SYNTHESIS BASED ON SINUSOIDAL MODELING , 2004 .

[6]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[7]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[8]  Jyh-Shing Roger Jang,et al.  New refinement schemes for voice conversion , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).