Discovering simple rules in complex data: A meta-learning algorithm and some surprising musical discoveries

This article presents a new rule discovery algorithm named PLCG that can find simple, robust partial rule models (sets of classification rules) in complex data where it is difficult or impossible to find models that completely account for all the phenomena of interest. Technically speaking, PLCG is an ensemble learning method that learns multiple models via some standard rule learning algorithm, and then combines these into one final rule set via clustering, generalization, and heuristic rule selection. The algorithm was developed in the context of an interdisciplinary research project that aims at discovering fundamental principles of expressive music performance from large amounts of complex real-world data (specifically, measurements of actual performances by concert pianists). It will be shown that PLCG succeeds in finding some surprisingly simple and robust performance principles, some of which represent truly novel and musically meaningful discoveries. A set of more systematic experiments shows that PLCG usually discovers significantly simpler theories than more direct approaches to rule learning (including the state-of-the-art learning algorithm RIPPER), while striking a compromise between coverage and precision. The experiments also show how easy it is to use PLCG as a meta-learning strategy to explore different parts of the space of rule models.

[1]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[2]  Raúl E. Valdés-Pérez,et al.  Machine Discovery in Chemistry: New Results , 1995, Artif. Intell..

[3]  Raúl E. Valdés-Pérez,et al.  Principles of Human Computer Collaboration for Knowledge Discovery in Science , 1999, Artif. Intell..

[4]  Simon Dixon,et al.  Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[5]  William W. Cohen Efficient Pruning Methods for Separate-and-Conquer Rule Learning Systems , 1993, IJCAI.

[6]  Johannes Fürnkranz,et al.  Incremental Reduced Error Pruning , 1994, ICML.

[7]  Gerhard Widmer,et al.  Large-scale Induction of Expressive Performance Rules: First Quantitative Results , 2000, ICMC.

[8]  Gerhard Widmer Combining Knowledge-Based and Instance-Based Learning to Exploit Qualitative Knowledge , 1993, Informatica.

[9]  Simon Dixon,et al.  Beat Tracking with Musical Knowledge , 2000, ECAI.

[10]  Pedro M. Domingos Knowledge Discovery Via Multiple Models , 1998, Intell. Data Anal..

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Emilios Cambouropoulos,et al.  From MIDI to Traditional Musical Notation , 2000 .

[13]  S. Muggleton,et al.  Protein secondary structure prediction using logic-based machine learning. , 1992, Protein engineering.

[14]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[15]  Raúl E. Valdés-Pérez,et al.  A New Theorem in Particle Physics Enabled by Machine Discovery , 1996, Artif. Intell..

[16]  JOHANNES FÜRNKRANZ,et al.  Separate-and-Conquer Rule Learning , 1999, Artificial Intelligence Review.

[17]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[18]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[19]  Gerhard Widmer,et al.  Using AI and machine learning to study expressive music performance: project survey and first report , 2001, AI Commun..

[20]  Emilios Cambouropoulos Automatic Pitch Spelling: From Numbers to Sharps and Flats , 2001 .

[21]  E. Vald Principles of human-computer collaboration for knowledge discovery in science , 1999 .

[22]  Tom Fawcett,et al.  Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions , 1997, KDD.

[23]  Richard A. Lewis,et al.  Drug design by machine learning: the use of inductive logic programming to model the structure-activity relationships of trimethoprim analogues binding to dihydrofolate reductase. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Gerhard Widmer,et al.  Machine Discoveries: A Few Simple, Robust Local Expression Principles , 2002 .

[25]  Sholom M. Weiss,et al.  Rule-based Machine Learning Methods for Functional Prediction , 1995, J. Artif. Intell. Res..

[26]  Pedro M. Domingos Unifying Instance-Based and Rule-Based Induction , 1996, Machine Learning.

[27]  Johan Sundberg,et al.  How can music be expressive? , 1993, Speech Commun..

[28]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[29]  Lawrence Hunter,et al.  Artificial Intelligence and Molecular Biology , 1992, AI Mag..

[30]  J. Ross Quinlan,et al.  Learning logical definitions from relations , 1990, Machine Learning.

[31]  Rá Ul,et al.  A New Theorem in Particle Physics Enabled by Machine Discovery , 2022 .

[32]  Gerhard Widmer,et al.  The Musical Expression Project: A Challenge for Machine Learning and Knowledge Discovery , 2001, PKDD.

[33]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[34]  Rá Ul,et al.  Machine Discovery in Chemistry: New Results , 1995 .

[35]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[36]  Gerhard Widmer,et al.  In Search of the Horowitz Factor: Interim Report on a Musical Discovery Project , 2002, ALT.

[37]  Jude Shavlik,et al.  Using neural networks to refine existing biological knowledge , 1992 .