Development and analysis of an International Speech Test Signal (ISTS)

Abstract For analysing the processing of speech by a hearing instrument, a standard test signal is necessary which allows for reproducible measurement conditions, and which features as many of the most relevant properties of natural speech as possible, e.g. the average speech spectrum, the modulation spectrum, the variation of the fundamental frequency together with its appropriate harmonics, and the comodulation in different frequency bands. Existing artificial signals do not adequately fulfill these requirements. Moreover, recordings from natural speakers represent only one language and are therefore not internationally acceptable. For this reason, an International Speech Test Signal (ISTS) was developed. It is based on natural recordings but is largely non-intelligible because of segmentation and remixing. When using the signal for hearing aid measurements, the gain of a device can be described at different percentiles of the speech level distribution. The primary intention is to include this test signal with a new measurement method for a new hearing aid standard (IEC 60118-15). Sumario Para analizar el procesamiento del lenguaje por medio de un instrumento auditivo, se requiere de una señal estándar de prueba que permita obtener condiciones reproducibles de medición, y que contenga la mayor cantidad de las propiedades más relevantes del lenguaje natural como sea posible, p.e., el espectro promedio del lenguaje, el espectro de modulación, las variaciones en la frecuencia fundamental junto con sus armónicos apropiados, y la co-modulación en diferentes bandas de frecuencia. Las señales artificiales existentes no llenan adecuadamente estos requisitos. Más aún, los registros de un hablante natural representan solo un idioma y por lo tanto, no son aceptables internacionalmente. Por esta razón, se desarrolló una señal internacional de prueba para lenguaje (ISTS). Se basa en registros naturales pero es esencialmente ininteligible debido a la segmentación y el re-mezclado. Cuando se usa la señal para mediciones de un auxiliar auditivo, la ganancia del dispositivo puede ser descrita a diferentes percentiles de la distribución de nivel de intensidad del lenguaje. La intención primaria es incluir esta señal de prueba con un nuevo método de medición para un nuevo estándar para auxiliares auditivos. (IEC 60118-15).

[1]  Wouter A Dreschler,et al.  Release from informational masking by time reversal of native and non-native interfering speech. , 2005, The Journal of the Acoustical Society of America.

[2]  Reinier Plomp,et al.  Perception of Speech as a Modulated Signal , 1984 .

[3]  Wouter A. Dreschler,et al.  ICRA Noises: Artificial Noise Signals with Speech-like Spectral and Temporal Properties for Hearing Instrument Assessment: Ruidos ICRA: Señates de ruido artificial con espectro similar al habla y propiedades temporales para pruebas de instrumentos auditivos , 2001 .

[4]  W. Dreschler,et al.  ICRA noises: artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment. International Collegium for Rehabilitative Audiology. , 2001, Audiology : official organ of the International Society of Audiology.

[5]  Caroline L. Smith Handbook of the International Phonetic Association: a guide to the use of the International Phonetic Alphabet (1999). Cambridge: Cambridge University Press. Pp. ix+204. , 2000, Phonology.

[6]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[7]  I N Bronstein,et al.  Taschenbuch der Mathematik , 1966 .

[8]  Birger Kollmeier,et al.  The role of silent intervals for sentence intelligibility in fluctuating noise in hearing-impaired listeners , 2006, International journal of audiology.

[9]  D S Brungart,et al.  Informational and energetic masking effects in the perception of two simultaneous talkers. , 2001, The Journal of the Acoustical Society of America.

[10]  J. Żabiński American National Standards Institute (ANSI) , 2010 .

[11]  H. Dillon,et al.  An international comparison of long‐term average speech spectra , 1994 .

[12]  R M Cox,et al.  Distribution of short-term rms levels in conversational speech. , 1988, The Journal of the Acoustical Society of America.