Patterns Based Classifiers

Data mining is one of the most important areas in the 21 century for its applications are wide ranging. This includes medicine, finance, commerce and engineering, to name a few. Pattern mining is amongst the most important and challenging techniques employed in data mining. Patterns are collections of items which satisfy certain properties. Emerging Patterns are those whose frequencies change significantly from one dataset to another. They represent strong contrast knowledge and have been shown very successful for constructing accurate and robust classifiers. In this paper, we examine various kinds of patterns. We also investigate efficient pattern mining techniques and discuss how to exploit patterns to construct effective classifiers.

[1]  Kotagiri Ramamohanarao,et al.  Using emerging patterns and decision trees in rare-class classification , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[2]  James Bailey,et al.  A fast algorithm for computing hypergraph transversals and its application in mining emerging patterns , 2003, Third IEEE International Conference on Data Mining.

[3]  Huiqing Liu,et al.  Simple rules underlying gene expression profiles of more than six subtypes of acute lymphoblastic leukemia (ALL) patients , 2003, Bioinform..

[4]  Dimitrios Gunopulos,et al.  Data mining, hypergraph transversals, and machine learning (extended abstract) , 1997, PODS '97.

[5]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[6]  Nils J. Nilsson,et al.  MLC++, A Machine Learning Library in C++. , 1995 .

[7]  Kotagiri Ramamohanarao,et al.  Incremental Maintenance on the Border of the Space of Emerging Patterns , 2004, Data Mining and Knowledge Discovery.

[8]  Kotagiri Ramamohanarao,et al.  Instance-Based Classification by Emerging Patterns , 2000, PKDD.

[9]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[10]  Dimitrios Gunopulos,et al.  Data mining, hypergraph transversals, and machine learning (extended abstract) , 1997, PODS.

[11]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[12]  Huiqing Liu,et al.  Discovery of significant rules for classifying cancer diagnosis data , 2003, ECCB.

[13]  Kotagiri Ramamohanarao,et al.  The Space of Jumping Emerging Patterns and Its Incremental Maintenance Algorithms , 2000, ICML.

[14]  Jinyan Li,et al.  Identifying good diagnostic gene groups from gene expression profiles using the concept of emerging patterns. , 2002 .

[15]  James Bailey,et al.  Fast Algorithms for Mining Emerging Patterns , 2002, PKDD.

[16]  Ronald Christensen,et al.  Log-Linear Models and Logistic Regression , 1997 .

[17]  Kotagiri Ramamohanarao,et al.  A Bayesian Approach to Use Emerging Patterns for Classification , 2003, ADC.

[18]  Evangelos Simoudis,et al.  Mining business databases , 1996, CACM.

[19]  Kotagiri Ramamohanarao,et al.  Exploring constraints to efficiently mine emerging patterns from large high-dimensional datasets , 2000, KDD '00.

[20]  Vipin Kumar,et al.  Mining needle in a haystack: classifying rare classes via two-phase rule induction , 2001, SIGMOD '01.

[21]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[22]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[23]  Jinyan Li,et al.  CAEP: Classification by Aggregating Emerging Patterns , 1999, Discovery Science.

[24]  Zhou Wang,et al.  Exploiting Maximal Emerging Patterns for Classification , 2004, Australian Conference on Artificial Intelligence.

[25]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[26]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[27]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[28]  Kotagiri Ramamohanarao,et al.  The Application of Emerging Patterns for Improving the Quality of Rare-Class Classification , 2004, PAKDD.

[29]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[30]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[31]  Michèle Sebag,et al.  Delaying the Choice of Bias: A Disjunctive Version Space Approach , 1996, ICML.

[32]  J. Hoffman Numerical Methods for Engineers and Scientists , 2018 .

[33]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[34]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[35]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[36]  Kotagiri Ramamohanarao,et al.  An Efficient Single-Scan Algorithm for Mining Essential Jumping Emerging Patterns for Classification , 2002, PAKDD.

[37]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[38]  Dr. Alex A. Freitas Data Mining and Knowledge Discovery with Evolutionary Algorithms , 2002, Natural Computing Series.

[39]  James Bailey,et al.  Classification Using Constrained Emerging Patterns , 2003, WAIM.

[40]  Kotagiri Ramamohanarao,et al.  Fast discovery and the generalization of strong jumping emerging patterns for building compact and accurate classifiers , 2006, IEEE Transactions on Knowledge and Data Engineering.

[41]  Richard J. Cleary,et al.  Statistical Methods for Engineers , 1999 .

[42]  Jiawei Han,et al.  Knowledge Discovery in Databases: An Attribute-Oriented Approach , 1992, VLDB.

[43]  Kotagiri Ramamohanarao,et al.  Combining the Strength of Pattern Frequency and Distance for Classification , 2001, PAKDD.

[44]  R. Kotagiri,et al.  Expanding the Training Data Space Using Emerging Patterns and Genetic Methods , 2005 .

[45]  R. M. Bethea,et al.  Statistical Methods for Engineers and Scientists. , 1985 .

[46]  Stephen D. Bay,et al.  Detecting Group Differences: Mining Contrast Sets , 2001, Data Mining and Knowledge Discovery.

[47]  Kotagiri Ramamohanarao,et al.  DeEPs: A New Instance-Based Lazy Discovery and Classification System , 2004, Machine Learning.

[48]  Kotagiri Ramamohanarao,et al.  Efficiently Mining Interesting Emerging Patterns , 2003, WAIM.

[49]  Kotagiri Ramamohanarao,et al.  A weighting scheme based on emerging patterns for weighted support vector machines , 2005, 2005 IEEE International Conference on Granular Computing.

[50]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[51]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[52]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[53]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[54]  Roberto J. Bayardo,et al.  Efficiently mining long patterns from databases , 1998, SIGMOD '98.

[55]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[56]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[57]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..