The instructions under which raters quantify syllable prominence perception need to be simple in order to maintain immediate reactions. This leads to noise in the rating data that can be dealt with by normalization, e.g. setting central tendency = 0 and dispersion = 1 (as in Z-score normalization). Questions arise such as: Which parameter is adequate here to capture central tendency? Which reference distribution should the normalization be based on? In this paper 16 different normalization methods are evaluated. In a perception experiment using German read speech (prose and poetry), syllable prominence ratings were collected. From the rating data 16 complete “mirror” data-sets were computed according to the 16 methods. Each mirror data-set was correlated with the same set of measures from the underlying acoustic data, focusing on raw syllable duration which is seen as a rather straightforward acoustic aspect of syllable prominence. Correlation coefficients could be raised considerably by selected methods.
[1]
Hans Oesch,et al.
Historisch-kritische Gesamtausgabe der musikalischen Werke
,
1976
.
[2]
Paul Boersma,et al.
Praat: doing phonetics by computer
,
2003
.
[3]
Paul Boersma,et al.
Praat, a system for doing phonetics by computer
,
2002
.
[4]
C. Sappok,et al.
On the Normalization of Syllable Prominence Ratings
,
2012
.
[5]
G. Fant,et al.
Speech , Music and Hearing Quarterly Progress and Status Report Preliminaries to the study of Swedish prose reading and reading style
,
2007
.
[6]
Petra Wagner,et al.
Evaluating Different Rating Scales for Obtaining Judgments of Syllable Prominence from Naïve Listeners
,
2011,
ICPhS.