A Memory-Based Model for Music Analysis: Challenging the Gestalt Principles

We argue for a memory-based approach to music analysis which works with concrete musical experiences rather than with abstract rules or principles. New pieces of music are analyzed by combining fragments from structures of previously encountered pieces. The occurrence-frequencies of the fragments are used to determine the preferred analysis of a piece. We test some instances of this approach against a set of 1,000 manually annotated folksongs from the Essen Folksong Collection, yielding up to 85.9% phrase accuracy. A qualitative analysis of our results indicates that there are grouping phenomena that challenge the commonly accepted Gestalt principles of proximity, similarity and parallelism. These grouping phenomena can neither be explained by other musical factors, such as meter and harmony. We argue that music perception may be much more memory-based than previously assumed.

[1]  L. Polansky,et al.  Temporal Gestalt Perception in Music , 1980 .

[2]  Jean-Gabriel Ganascia,et al.  Musical Pattern Extraction and Similarity Assessment , 2000, Readings in Music and Artificial Intelligence.

[3]  Thomas H. Stoffer,et al.  Representation of Phrase Structure in the Perception of Music , 1985 .

[4]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[5]  Rajeev Raman,et al.  String-Matching techniques for musical similarity and melodic recognition , 1998 .

[6]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[7]  Eugene Narmour,et al.  The Analysis and Cognition of Basic Melodic Structures: The Implication-Realization Model , 1990 .

[8]  C. S. Wetherell,et al.  Probabilistic Languages: A Review and Some Open Questions , 1980, CSUR.

[9]  S. Handel Listening As Introduction to the Perception of Auditory Events , 1989 .

[10]  William H. Press,et al.  Numerical recipes in C , 2002 .

[11]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[12]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[13]  Emilios Cambouropoulos,et al.  Musical Rhythm: A Formal Model for Determining Local Boundaries, Accents and Metre in a Melodic Surface , 1996, Joint International Conference on Cognitive and Systematic Musicology.

[14]  Dominik Hörnel,et al.  Investigating the Influence of Representations and Algorithms in Music Classification , 2001, Comput. Humanit..

[15]  Kenneth Ward Church,et al.  A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams , 1991 .

[16]  Max Wertheimer,et al.  Untersuchungen zur Lehre von der Gestalt , .

[17]  Eugene Charniak,et al.  Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[18]  Jenny R Saffran,et al.  Infant memory for musical experiences , 2000, Cognition.

[19]  D. Deutsch,et al.  The Internal Representation of Pitch Sequences in Tonal Music , 1981 .

[20]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[21]  Rens Bod,et al.  Using an Annotated Language Corpus as a Virtual Stochastic Grammar , 1993, AAAI.

[22]  E. Narmour The Analysis and Cognition of Melodic Complexity: The Implication-Realization Model , 1992 .

[23]  T. A. Cartwright,et al.  Distributional regularity and phonotactic constraints are useful for segmentation , 1996, Cognition.

[24]  Taylor L. Booth,et al.  Probabilistic Representation of Formal Languages , 1969, SWAT.

[25]  David Cope Pattern Matching as an Engine for the Computer Simulation of Musical Style , 1990, ICMC.

[26]  Udo Mattusch,et al.  Emulating Gestalt Mechanisms by Combining Symbolic and Subsymbolic Information Processing Procedures , 1996, Joint International Conference on Cognitive and Systematic Musicology.

[27]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[28]  Rens Bod,et al.  Beyond Grammar: An Experience-Based Theory of Language , 1998 .

[29]  M. Wertheimer Untersuchungen zur Lehre von der Gestalt. II , 1923 .

[30]  Nick Chater,et al.  Distributional Bootstrapping: From Word Class to Proto-Sentence , 2019, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[31]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[32]  Ralph Grishman,et al.  A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[33]  H. C. Longuet-Higgins,et al.  Perception of melodies , 1976, Nature.

[34]  E. Cambouropoulos A Formal Theory for the Discovery of Local Boundaries in a Melodic Surface. , 1996 .

[35]  Menno van Zaanen Bootstrapping Syntax and Recursion using Alginment-Based Learning , 2000, ICML.