Analogy-based classifiers for nominal or numerical data

Abstract Introduced a decade ago, analogy-based classification methods constitute a noticeable addition to the set of instance-based learning techniques. They provide valuable results in terms of accuracy on many classical datasets. They rely on the notion of analogical proportions which are statements of the form “A is to B as C is to D”. Analogical proportions have been in particular formalized in Boolean and numerical settings. In both cases, one of the four components of the proportion can be computed from the three others, when the proportion holds. Analogical classifiers look for all triples of examples in the sample set that are in analogical proportion with the item to be classified on a maximal number of attributes and for which the corresponding analogical proportion equation on the class has a solution. In this paper when classifying a new item, we specially emphasize an approach where the whole set of triples that can be built from the sample set is not considered. We just focus on a small part of the candidate triples. Namely, in order to restrict the scope of the search, we first look for examples that are as similar as possible to the new item to be classified. We then only consider the pairs of examples presenting the same dissimilarity as between the new item and one of its closest neighbors. In this way, we implicitly build triples that are in analogical proportion on all attributes with the new item. Then the classification is made on the basis of an additive aggregation of the truth values corresponding to the pairs that can be analogically associated with the pairs made of the target item and one of its nearest neighbors. We then only deal with pairs leading to a solvable analogical equation for the class. This new algorithm provides results as good as previous analogical classifiers with a lower average complexity, both in nominal and numerical cases.

[1]  Gilles Richard,et al.  Multiple-valued extensions of analogical proportions , 2016, Fuzzy Sets Syst..

[2]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[3]  Henri Prade,et al.  Handling Analogical Proportions in Classical Logic and Fuzzy Logics Settings , 2009, ECSQARU.

[4]  François Yvon,et al.  Du quatrième de proportion comme principe inductif : une proposition et son application à l’apprentissage de la morphologie [Inference with formal analogical proportions: application to the automatic learning of morphology] , 2006, TAL.

[5]  Gilles Richard,et al.  Analogical classification: A new way to deal with examples , 2014, ECAI.

[6]  Philippe Langlais Moranapho: Un Système Multilingue D’analyse Morphologique Basé Sur L’analogie Formelle , 2011 .

[7]  Eyke Hüllermeier,et al.  Combining Instance-Based Learning and Logistic Regression for Multilabel Classification , 2009, ECML/PKDD.

[8]  D. McSherry Case-based reasoning techniques for estimation , 1993 .

[9]  Gilles Richard,et al.  From Analogical Proportion to Logical Proportions , 2013, Logica Universalis.

[10]  P. Langley,et al.  Average-case analysis of a nearest neighbor algorthim , 1993, IJCAI 1993.

[11]  Gilles Richard,et al.  Analogical Classification: Handling Numerical Data , 2014, SUM.

[12]  François Yvon,et al.  An Analogical Learner for Morphological Analysis , 2005, CoNLL.

[13]  Laurent Miclet,et al.  Analogical Proportions in a Lattice of Sets of Alignments Built on the Common Subwords in a Finite Language , 2012, Computational Approaches to Analogical Reasoning.

[14]  Gilles Richard,et al.  Trying to Understand How Analogical Classifiers Work , 2012, SUM.

[15]  D. Rumelhart,et al.  A model for analogical reasoning. , 1973 .

[16]  Marco Ragni,et al.  Analyzing Raven's Intelligence Test: Cognitive Model, Demand, and Complexity , 2014, Computational Approaches to Analogical Reasoning.

[17]  Nobuhiro Yugami,et al.  Theoretical Analysis of the Nearest Neighbor Classifier in Noisy Domains , 1996, ICML.

[18]  Henri Prade,et al.  Oddness/evenness-based classifiers for Boolean or numerical data , 2017, Int. J. Approx. Reason..

[19]  Gilles Richard,et al.  Evenness-Based Reasoning with Logical Proportions Applied to Classification , 2015, SUM.

[20]  Laurent Miclet,et al.  Analogical Dissimilarity: Definition, Algorithms and Two Experiments in Machine Learning , 2008, J. Artif. Intell. Res..

[21]  Gergonne Analise. Application de la méthode des moindres quarrés à l'interpolation des suites , 1816 .

[22]  Gilles Richard,et al.  Homogeneous Logical Proportions: Their Uniqueness and Their Role in Similarity-Based Prediction , 2012, KR.

[23]  Laurent Miclet,et al.  Learning by Analogy: A Classification Rule for Binary and Nominal Data , 2007, IJCAI.

[24]  Susan Craw,et al.  Using Case-Base Data to Learn Adaptation Knowledge for Design , 2001, IJCAI.

[25]  Gilles Richard,et al.  Classification Based on Homogeneous Logical Proportions , 2013, SGAI Conf..

[26]  Stuart J. Russell,et al.  A Logical Approach to Reasoning by Analogy , 1987, IJCAI.

[27]  Yves Lepage,et al.  Analogy and Formal Languages , 2004, FGMOL.

[28]  Gilles Richard,et al.  A New View of Conformity and Its Application to Classification , 2015, ECSQARU.

[29]  Gilles Richard,et al.  Analogical Proportions and Multiple-Valued Logics , 2013, ECSQARU.

[30]  Gilles Richard,et al.  Analogical Classification: A Rule-Based View , 2014, IPMU.

[31]  Mary B. Hesse V — On Defining Analogy , 1960 .

[32]  Gilles Richard,et al.  When intelligence is just a matter of copying , 2012, ECAI.

[33]  Gilles Richard,et al.  Reasoning with Logical Proportions , 2010, KR.

[34]  Gilles Richard,et al.  Analogical Classifiers: A Theoretical Perspective , 2016, ECAI.

[35]  François Yvon,et al.  Formal Models of Analogical Proportions , 2007 .

[36]  Gilles Richard,et al.  Homogenous and Heterogeneous Logical Proportions , 2014, FLAP.

[37]  Laurent Miclet,et al.  Relation d'analogie et distance sur un alphabet défini par des traits , 2004 .

[38]  Henri Prade,et al.  From analogical proportions in lattices to proportional analogies in formal concepts , 2014, ECAI.

[39]  Gilles Richard,et al.  Oddness-Based Classifiers for Boolean or Numerical Data , 2015, KI.

[40]  Gilles Richard,et al.  Multiple-Valued Logic Interpretations of Analogical, Reverse Analogical, and Paralogical Proportions , 2010, 2010 40th IEEE International Symposium on Multiple-Valued Logic.

[41]  W. J. Conover,et al.  Practical Nonparametric Statistics , 1972 .

[42]  Gilles Richard,et al.  Analogy-preserving functions: A way to extend Boolean samples , 2017, IJCAI.