Asymmetries in the perception of synthesized speech

It was previously observed [1] that the order of presentation of paired stimuli influenced the number of different responses in same-different tasks in speech synthesis evaluation. This paper investigates this phenomenon within the context of cognitive psychology and demonstrates that, as the cognitive psychology literature suggests, there is an effect relating to the prototypicality of the stimulus.

[1]  Åke Hellström TEMPORAL ASYMMETRY AND “MAGNET EFFECT” IN SIMILARITY AND DISCRIMINATION OF PROTOTYPICAL AND NONPROTOTYPICAL STIMULI: CONSEQUENCES OF DIFFERENTIAL SENSATION WEIGHTING , 2007 .

[2]  Emmanuel M. Pothos,et al.  Formal Approaches in Categorization: Contents , 2011 .

[3]  J. D. Smith,et al.  Formal Approaches in Categorization: Prototype models of categorization: basic formulation, predictions, and limitations , 2011 .

[4]  E. Rosch,et al.  Family resemblances: Studies in the internal structure of categories , 1975, Cognitive Psychology.

[5]  Thomas H Rammsayer,et al.  Effects of time-order, interstimulus interval, and feedback in duration discrimination of noise bursts in the 50- and 1000-ms ranges. , 2004, Acta psychologica.

[6]  Robert A. J. Clark,et al.  Native and non-native speaker judgements on the quality of synthesized speech , 2010, INTERSPEECH.

[7]  E. Glenn Schellenberg,et al.  Asymmetries in the Discrimination of Musical Intervals: Going Out-of-Tune Is More Noticeable Than Going In-Tune , 2001 .

[8]  E. Rosch Cognitive reference points , 1975, Cognitive Psychology.

[9]  Åke Hellström Anatomy of stimulus comparison. , 2005 .

[10]  J. Edward Jackson,et al.  The User's Guide to Multidimensional Scaling , 1985 .

[11]  Anna C. Janska Further Investigation of MDS as a Tool for Evaluation of Speech Quality of Synthesized Speech , 2009 .

[12]  Simon King,et al.  The Blizzard Challenge 2008 , 2008 .

[13]  Robert A. J. Clark,et al.  Further exploration of the possibilities and pitfalls of multidimensional scaling as a tool for the evaluation of the quality of synthesized speech , 2010, SSW.