SRA: A Web-based Research Tool for Spectral and Roughness Analysis of Sound Signals

SRA is a web-based tool that performs Spectral and Roughness Analysis on user-submitted sound files (.wav and .aif formats). Spectral analysis incorporates an improved Short-Time Fourier Transform (STFT) algorithm (1-2) and automates spectral peak-picking using Loris open- source C++ class library components. Users can set three spectral analysis/peak-picking parameters: analysis bandwidth, spectral-amplitude normalization, and spectral- amplitude threshold. These are described in detail within the tool, including suggestions on settings appropriate to the submitted files and research questions of interest. The spectral values obtained from the analysis enter a roughness calculation model (3-4), outputting roughness values at user- specified points within a file or roughness profiles at user- specified time intervals. The tool offers research background on spectral analysis, auditory roughness, and the algorithms used, including links to relevant publications. Spectral and roughness analysis of sound signals finds applications in music cognition, musical analysis, speech processing, and music teaching research, as well as in medicine and other areas. Presentation of the spectral analysis technique, the roughness estimation model, and the online tool is followed by a discussion of research studies employing the tool and an outline of future possible applications.

[1]  A. M. Mimpen,et al.  The ear as a frequency analyzer. II. , 1964, The Journal of the Acoustical Society of America.

[2]  Clive A. Greated,et al.  The Musician's Guide to Acoustics , 1987 .

[3]  Lippold Haken,et al.  Cell-Utes and Flutter-Tongued Cats: Sound Morphing Using Loris and the Reassigned Bandwidth-Enhanced Model , 2003, Computer Music Journal.

[4]  Nicola Dibben The Perception of Structural Stability in Atonal Music: The Influence of Salience, Stability, Horizontal Motion, Pitch Commonality, and Dissonance , 1999 .

[5]  Murray Campbell,et al.  The Musician's Guide to Acoustics , 1990 .

[6]  Pantelis N. Vassilakis AM depth versus degree of amplitude fluctuation: Implementation error, adjustment, and implications , 2000 .

[7]  Kelly Fitz,et al.  Algorithms for computing the time-corrected instantaneous frequency (reassigned) spectrogram, with applications. , 2004, The Journal of the Acoustical Society of America.

[8]  Sean A. Fulop,et al.  A Spectrogram for the Twenty-First Century , 2006 .

[9]  William M. Hartmann,et al.  Roughness and the critical bandwidth at low frequency , 1995 .

[10]  F. Lerdahl,et al.  Perception of musical tension in short chord sequences: The influence of harmonic function, sensory dissonance, horizontal motion, and musical training , 1996, Perception & psychophysics.

[11]  William Hutchinson,et al.  The acoustic component of western consonance , 1978 .

[12]  Kelly Fitz,et al.  Separation of components from impulses in reassigned spectrograms. , 2007, The Journal of the Acoustical Society of America.

[13]  R. Plomp,et al.  Tonal consonance and critical bandwidth. , 1965, The Journal of the Acoustical Society of America.

[14]  E. Zwicker,et al.  Subdivision of the audible frequency range into critical bands , 1961 .

[15]  Pantelis N. Vassilakis An improvisation on the Middle‐Eastern mijwiz; auditory roughness profiles and tension/release patterns , 2005 .

[16]  J. Vos Purity Ratings of Tempered Fifths and Major Thirds , 1986 .

[17]  E. Terhardt On the perception of periodic sound fluctuations (roughness) , 1974 .

[18]  H. Helmholtz,et al.  On the Sensations of Tone as a Physiological Basis for the Theory of Music , 2005 .

[19]  P. Coleman,et al.  Experiments in hearing , 1961 .

[20]  Patrick Flandrin,et al.  Improving the readability of time-frequency and time-scale representations by the reassignment method , 1995, IEEE Trans. Signal Process..

[21]  A. Kameoka,et al.  Consonance theory part I: consonance of dyads. , 1969, The Journal of the Acoustical Society of America.

[22]  A. Kameoka,et al.  Consonance theory part II: consonance of complex tones and its calculation method. , 1969, The Journal of the Acoustical Society of America.

[23]  Kelly Fitz,et al.  Correction to: 'On the Use of Time/Frequency Reassignment in Additive Sound Modeling' , 2002 .

[24]  Vangelis Lympouridis,et al.  Proceedings SMC'07, 4th Sound and Music Computing Conference , 2007 .

[25]  P. Vassilakis,et al.  Auditory roughness as means of musical expression , 2005 .

[26]  S McAdams,et al.  Two phase effects in roughness perception. , 1999, The Journal of the Acoustical Society of America.

[27]  William Hsu,et al.  Managing Gesture and Timbre for Analysis and Instrument Control in an Interactive Environment , 2006, NIME.

[28]  Niall J. L. Griffith,et al.  Multidimensional timbre analysis of shakuhachi honkyoku , 2005 .