Learning and Generalization of Abstract Semantic Relations: Preliminary Investigation of Bayesian Approaches

Learning and Generalization of Abstract Semantic Relations: Preliminary Investigation of Bayesian Approaches Dawn Chen (sdchen@ucla.edu) Department of Psychology Hongjing Lu (hongjing@ucla.edu) Departments of Psychology and Statistics Keith J. Holyoak (holyoak@lifesci.ucla.edu) Department of Psychology University of California, Los Angeles Los Angeles, CA 90095 USA lakes, where the former sentence type contains an antonymous pair (Glass, Holyoak & Kossan, 1977), suggesting that some sense of antonymy is available prior to any formal instruction about this concept. Abstract A deep problem in cognitive science is to explain the acquisition of abstract semantic relations, such as antonymy and synonymy. Are such relations necessarily part of an innate representational endowment provided to humans? Or, is it possible for a learning system to acquire abstract relations from non-relational inputs of realistic complexity (avoiding hand-coding)? We present a series of computational experiments using Bayesian methods in an effort to learn and generalize abstract semantic relations, using as inputs pairs of specific concepts represented by feature vectors created by Latent Semantic Analysis. The Problem of Relation Learning Regardless of whether abstract relations are learned or mature over the course of development, there is no doubt that adults can distinguish among instances of relations such as antonymy versus synonymy. In the present paper we pose the following computational problem: Given as inputs a modest number of pairs of concepts that instantiate an abstract relation (e.g., day-night and hot-cold, which instantiate antonymy), is it possible to extract a representation of the abstract relation that may then be used to accurately classify novel instantiations (e.g., valley- mountain)? Most recent connectionist models of relation learning (e.g., Rogers & McClelland, 2008) have focused on the acquisition of small numbers of specific input-output pairs (e.g., “canary” + “can” → “fly”), but have not demonstrated the capacity to generalize to novel inputs dissimilar to the training items. In contrast, achieving such generalization is the central aim of our project. Moreover, an important constraint we imposed is that inputs to the learning system could not be hand-coded, as has been commonplace in the literature on computational models of analogy and relation learning. For example, Doumas, Hummel, and Sandhofer (2008) showed how structured relations corresponding to relative adjectives such as bigger-than can be extracted by bottom-up mechanisms given inputs consisting of unstructured feature vectors of objects. However, the modelers ensured that “size” features were present among the relatively small feature set defining the inputs, setting the stage for selecting these size features to form a part of the to-be-learned relational predicate. While perceptual relations may indeed be derived from the perceptual features of objects, this assumption is unwarranted for more abstract relations, for which hand-coding of features is even more problematic. In addition, realistic semantic representations would seem to require very large numbers of features, raising all the difficulties associated with search in a large Keywords: Bayesian inference; induction; generalization; abstract relations; machine learning; LSA Introduction An intelligent human adult can recognize that the concepts day and night are related in much the same way as hot and cold, but not in the same way as day and hour. This ability to appreciate abstract semantic relations is fundamental to analogical reasoning, and is arguably a core component of what is special about the human mind (Penn, Holyoak & Povinelli, 2007). But how are such abstract relations acquired? If they are learned, how this could be achieved is far from obvious. On the face of it, no perceptual or other features seem to be available to represent such abstract relations as antonymy, synonymy, or superordination. Almost by default, it might be assumed that abstract relations must be innate (Fodor, 1975). Research on cognitive development has clearly established the phenomenon of a relational shift (Gentner & Rattermann, 1991), such that children process relations more effectively with increasing age. In particular, children move from a focus on global similarities of objects to similarities defined by specific dimensions, such as size or color (Smith, 1989; Smith & Sera, 1992). Less is known about the development of abstract relations that seem yet further divorced from perceptual similarity (see Halford, 1993). Analyses of corpora of child speech have identified systematic use of antonyms by children aged 2-5 years (Jones & Murphy, 2005). Children aged 6-7 years are more accurate in detecting the falsity of sentences such as Some valleys are mountains as compared to Some valleys are

[1]  G. Halford Children's Understanding: The Development of Mental Models , 1993 .

[2]  Keith J. Holyoak,et al.  Children's Ability to Detect Semantic Contradictions , 1977 .

[3]  D. McDermott LANGUAGE OF THOUGHT , 2012 .

[4]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[5]  Leonidas A A Doumas,et al.  A theory of the discovery and predication of relational concepts. , 2008, Psychological review.

[6]  J. Quesada,et al.  Analogy-making as Predication Using Relational Information and LSA Vectors , 2004 .

[7]  Michael I. Jordan,et al.  Bayesian parameter estimation via variational methods , 2000, Stat. Comput..

[8]  Linda B. Smith,et al.  A developmental analysis of the polar structure of dimensions , 1992, Cognitive Psychology.

[9]  M. Lynne Murphy,et al.  Using Corpora to Investigate Antonym Acquisition , 2005 .

[10]  Ricardo Silva,et al.  Small sets of interacting proteins suggest latent linkage mechanisms through analogical reasoning , 2007 .

[11]  Darwin ’ s mistake : Explaining the discontinuity between human and nonhuman minds , .

[12]  Stella Vosniadou,et al.  Similarity and analogical reasoning: Similarity and Analogical Reasoning , 1989 .

[13]  D. Gentner,et al.  Language and the career of similarity. , 1991 .

[14]  Michael B. W. Wolfe,et al.  Use of latent semantic analysis for predicting psychological phenomena: Two issues and proposed solutions , 2003, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[15]  Charles Kemp,et al.  The discovery of structural form , 2008, Proceedings of the National Academy of Sciences.

[16]  James L. McClelland,et al.  Précis of Semantic Cognition: A Parallel Distributed Processing Approach , 2008, Behavioral and Brain Sciences.

[17]  Michael L. Littman,et al.  Corpus-based Learning of Analogies and Semantic Relations , 2005, Machine Learning.

[18]  Linda B. Smith From global similarities to kinds of similarities: the construction of dimensions in development , 1989 .

[19]  Derek C. Penn,et al.  Darwin's mistake: Explaining the discontinuity between human and nonhuman minds , 2008, Behavioral and Brain Sciences.

[20]  Katherine A. Heller,et al.  Analogical Reasoning with Relational Bayesian Sets , 2007, AISTATS.