论文信息 - Classification and clustering to identify spoken dialects in Indonesian

Classification and clustering to identify spoken dialects in Indonesian

This paper explains classification using Support Vector Machines (SVM) technique and clustering using K-means technique in identifying eight spoken dialects in Indonesian language. Dialect identification is important to build a better Automatic Speech Recognition system. The experiment in this research is divided into using three features of sound; Mel Frequency Cepstral Coefficient (MFCC), spectral flux, and spectral centroid, and compares it to model with MFCC features only. For methods, it uses one-against-one and all-at-once as comparison. The best result is from using SVM one-against-one with three features which gives 55%.

Dessi Puji Lestari | Jacqueline Ibrahim

[1] Rong Tong,et al. Chinese Dialect Identification Using Tone Features Based on Pitch Flux , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2] Chih-Jen Lin,et al. A Comparison of Methods for Multi-class Support Vector Machines , 2015 .

[3] Fawzi Suliman Alorifi,et al. Automatic Identification of Arabic Dialects USING Hidden Markov Models , 2008 .

[4] Joachim Diederich,et al. Accent Classification Using Support Vector Machines , 2007, 6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007).

[5] Julia Hirschberg,et al. Automatic Dialect and Accent Recognition and its Application to Speech Recognition , 2011 .

[6] George Tzanetakis,et al. Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[7] Dhita Khairunnisa Pramesthi. ANALISIS DIALEK SUARA TELEPONI DENGAN MEL-FREQUENCY CEPSTRAL COEFFICIENT DAN K-NEAREST NEIGHBOR BERBASIS PENGOLAHAN SINYAL DIGITAL , 2012 .

[8] Masayu Leylia Khodra,et al. Towards Robust Indonesian Speech Recognition with Spontaneous-Speech Adapted Acoustic Models , 2016, SLTU.

[9] James H. Martin,et al. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[10] Victor S. Sheng,et al. Does One-Against-All or One-Against-One Improve the Performance of Multiclass Classifications? , 2013, AAAI.