Pattern Discovery Techniques for Music Audio

Human listeners are able to recognize structure in music through the perception of repetition and other relationships within a piece of music. This work aims to automate the task of music analysis. Music is “explained” in terms of embedded relationships, especially repetition of segments or phrases. The steps in this process are the transcription of audio into a representation with a similarity or distance metric, the search for similar segments, forming clusters of similar segments, and explaining music in terms of these clusters. Several pre-existing signal analysis methods have been used: monophonic pitch estimation, chroma (spectral) representation, and polyphonic transcription followed by harmonic analysis. Also, several algorithms that search for similar segments are described. Experience with these various approaches suggests that there are many ways to recover structure from music audio. Examples are offered using classical, jazz, and rock music.

[1]  Anssi Klapuri,et al.  AUTOMATIC TRANSCRIPTION OF MUSIC , 2003 .

[2]  Mark Goldstein,et al.  On Finding Rhythmic Patterns in Musical Lines , 1985, ICMC.

[3]  Simon Dixon,et al.  A Lightweight Multi-agent Musical Beat Tracking System , 2000, PRICAI.

[4]  David Sankoff,et al.  Comparison of musical sequences , 1990, Comput. Humanit..

[5]  Yoichi Muraoka,et al.  Musical understanding at the beat level: real-time beat tracking for audio signals , 1998 .

[6]  E. Narmour Music Expectation by Cognitive Rule-Mapping , 2000 .

[7]  Darrell Conklin,et al.  Representation and Discovery of Multiple Viewpoint Patterns , 2001, ICMC.

[8]  Mark Sandler,et al.  Segmentation of Musical Signals Using Hidden Markov Models. , 2001 .

[9]  Jonathan Foote,et al.  Visualizing Musical Structure and Rhythm via Self-Similarity , 2001, ICMC.

[10]  Mark Sandler,et al.  Finding Repeating Patterns in Acoustic Musical Signals : Applications for Audio Thumbnailing , 2002 .

[11]  Shlomo Dubnov,et al.  Automatic Modeling of Musical Style , 2001, ICMC.

[12]  Stephen Schwanauer,et al.  Pattern in Music , 1993 .

[13]  Bruce W. Pennycook,et al.  Real-time Recognition of Melodic Fragments Using the Dynamic Timewarp Algorithm , 1993, ICMC.

[14]  Masataka Goto A predominant-F/sub 0/ estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[15]  Eleanor Selfridge-Field,et al.  Melodic Similarity : concepts, procedures, and applications , 1998 .

[16]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[17]  Ichiro Fujinaga,et al.  Realtime Recognition of Orchestral Instruments , 2000, International Conference on Mathematics and Computing.

[18]  Shane S. Sturrock,et al.  Time Warps, String Edits, and Macromolecules – The Theory and Practice of Sequence Comparison . David Sankoff and Joseph Kruskal. ISBN 1-57586-217-4. Price £13.95 (US$22·95). , 2000 .

[19]  Roger B. Dannenberg,et al.  Pattern Discovery Techniques for Music Audio , 2002 .

[20]  Ning Hu,et al.  Discovering Musical Structure in Audio Recordings , 2002, ICMAI.

[21]  G. H. Wakefield,et al.  To catch a chorus: using chroma-based representations for audio thumbnailing , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[22]  William P. Birmingham,et al.  Algorithms for Chordal Analysis , 2002, Computer Music Journal.

[23]  Takuya Fujishima,et al.  Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music , 1999, ICMC.

[24]  David Cope,et al.  Experiments In Musical Intelligence , 1996 .

[25]  Colin Meek,et al.  Thematic Extractor , 2001, ISMIR.

[26]  Ilya Shmulevich,et al.  Graph-based smoothing of class data with applications in musical key finding , 1999, NSIP.

[27]  Ning Hu,et al.  A comparison of melodic database retrieval techniques using sung queries , 2002, JCDL '02.

[28]  Davide Rocchesso,et al.  SEPARATION OF SPEECH SIGNAL FROM COMPLEX AUDITORY SCENES , 2001 .

[29]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[30]  R. Cattell,et al.  Formal representation of human judgment , 1968 .

[31]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[32]  Jean-Gabriel Ganascia,et al.  Musical Pattern Extraction and Similarity Assessment , 2000, Readings in Music and Artificial Intelligence.

[33]  Sudha Rani Narasimhan Computer identification of musical instruments , 1992 .

[34]  William P. Birmingham,et al.  MUSART: Music Retrieval Via Aural Queries , 2001, ISMIR.

[35]  Roger B. Dannenberg,et al.  An On-Line Algorithm for Real-Time Accompaniment , 1984, ICMC.

[36]  Matija Marolt Transcription of polyphonic piano music with neural networks , 2000, 2000 10th Mediterranean Electrotechnical Conference. Information Technology and Electrotechnology for the Mediterranean Countries. Proceedings. MeleCon 2000 (Cat. No.00CH37099).