In this paper, a comparison between the automatic and manual approach of speech segmentation for the Arabic speech is conducted. In this approach, the automatic method, using the energy level measurement is compared to the manual segmentation procedure. The traditional zero crossing method commonly used for speech processing is also included in this work. The energy measurement method is based on dividing the uttered tokens into different levels. For instance, the Arabic language phonemes are divided into two energy regions: unvoiced phonemes which can be categorized as low energy include the sounds / س / (/s/) and / ه / (/h/). On the other hand, vowels and semi-vowels such as / / ( فتحه ) (/ ‘a /) and / و / (/w/) are labeled as high energy. Voiced fricatives, for instance, the sounds / ز / (/z/) and / /ع (/aa/) are classified as high energy phonemes.
[1]
Lamia Bouafif,et al.
Pitch detection and formant analysis of Arabic speech processing
,
2001
.
[2]
Ossama Emam,et al.
Language Model Based Arabic Word Segmentation
,
2003,
ACL.
[3]
Kris Demuynck,et al.
A Comparison of Different Approaches to Automatic Speech Segmentation
,
2002,
TSD.
[4]
Ossama Essa.
Using prosody in automatic segmentation of speech
,
1998,
ACM-SE 36.
[5]
T. M. Nazmy,et al.
A Novel Method for Arabic Consonant/Vowel Segmentation Using Wavelet Transform
,
2005,
Egypt. Comput. Sci. J..
[6]
Jan P. van Hemert,et al.
Automatic segmentation of speech
,
1991,
IEEE Trans. Signal Process..