Automatic meter classification in Persian poetries using support vector machines

In this paper, a meter classification system has been proposed for Persian poems based on features extracted from uttered poem. In the first stage, the utterance has been segmented into syllables using three features, pitch frequency and modified energy of each frame of the utterance and its temporal variations. In the second stage, each syllable is classified into long syllable and short syllable classes which is a historically convenient categorization in Persian literature. In this stage, the classifier is an SVM classifier with radial basis function kernel and employed features are the syllable temporal duration, zero crossing rate and PARCOR coefficients of each syllable. The sequence of extracted syllables classes is then compared with classic Persian meter styles using dynamic time warping, to make the system robust against syllables insertion, deletion or classification. The system has been evaluated on 136 poetries utterances from 12 Persian meter styles gathered from 8 speakers, using k-fold evaluation strategy. The results show 91% accuracy in three top meter style choices of the system.

[1]  Powsiri Klinkhachorn,et al.  Tone classification of syllable‐segmented Thai speech based on multilayer perception , 2002 .

[2]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[3]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4]  Liang-Yan Li,et al.  SVM-Based Classification Method for Poetry Style , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[5]  Chih-Jen Lin,et al.  A tutorial on?-support vector machines , 2005 .

[6]  Ying Liu,et al.  Entering Tone Recognition in a Support Vector Machine Approach , 2008, 2008 Fourth International Conference on Natural Computation.

[7]  Hamid R. Tizhoosh,et al.  On poem recognition , 2006, Pattern Analysis and Applications.

[8]  P. Klinkhachorn,et al.  Tone classification of syllable-segmented Thai speech based on multilayer perceptron , 2003, Proceedings of the 35th Southeastern Symposium on System Theory, 2003..

[9]  Ronald W. Schafer,et al.  Real-time digital hardware pitch detector , 1976 .

[10]  Yong Yi,et al.  Advanced studies on traditional Chinese poetry style identification , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[11]  Bernhard Schölkopf,et al.  A tutorial on v-support vector machines , 2005 .