Fundamental Frequency Estimation by Combinations of Various Methods

Several pitch estimation algorithms have been proposed over the decades, but they have tended to become more and more complex and cumbersome, some of them requiring much more computational power than a real-time application can afford. Rather than have one sophisticated algorithm, here we propose to combine the output of several conventional and relatively simple algorithms by various dedicated combination schemes. These combination methods perform a kind of weighted majority voting that helps find the correct solution when just a few of the basic algorithms go wrong. For testing purposes we compare the performance of the methods on a pitch-annotated corpora. The results show that with the combination schemes the amount of errors can be reduced by about 20-35% relative to the error of the best individual estimator

[1]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[2]  Fabrice Plante,et al.  A pitch extraction reference database , 1995, EUROSPEECH.

[3]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[4]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[6]  László Tóth,et al.  Fundamental frequency estimation by least-squares harmonic model fitting , 2005, INTERSPEECH.

[7]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.