Hierarchical Linear Disentanglement of Data-Driven Conceptual Spaces

Conceptual spaces are geometric meaning representations in which similar entities are represented by similar vectors. They are widely used in cognitive science, but there has been relatively little work on learning such representations from data. In particular, while standard representation learning methods can be used to induce vector space embeddings from text corpora, these differ from conceptual spaces in two crucial ways. First, the dimensions of a conceptual space correspond to salient semantic features, known as quality dimensions, whereas the dimensions of learned embeddings typically lack any clear interpretation. This has been partially addressed in previous work, which has shown that it is possible to identify directions in learned vector spaces which capture semantic features. Second, conceptual spaces are normally organised into a set of domains, each of which is associated with a separate vector space. In contrast, learned embeddings represent all entities in a single vector space. Our hypothesis in this paper is that such single-space representations are sub-optimal for learning quality dimensions, due to the fact that semantic features are often only relevant to a subset of the entities. We show that this issue can be mitigated by identifying features in a hierarchical fashion. Intuitively, the top-level features split the vector space into domains,allowing us to subsequently identify domain-specific quality dimensions.

[1]  Steven Schockaert,et al.  MEmbER: Max-Margin Based Embeddings for Entity Retrieval , 2017, SIGIR.

[2]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[3]  Phil Blunsom,et al.  Neural Variational Inference for Text Processing , 2015, ICML.

[4]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[5]  Gemma Boleda,et al.  Distributional vectors encode referential attributes , 2015, EMNLP.

[6]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[7]  Pieter Abbeel,et al.  InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[8]  Roberto Navigli,et al.  Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities , 2016, Artif. Intell..

[9]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[10]  Marie-Catherine de Marneffe,et al.  Deriving Adjectival Scales from Continuous Space Word Representations , 2013, EMNLP.

[11]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[12]  Bernhard Schölkopf,et al.  Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations , 2018, ICML.

[13]  Samy Bengio,et al.  Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.

[14]  Steven Schockaert,et al.  Inducing semantic relations from conceptual spaces: A data-driven approach to plausible reasoning , 2015, Artif. Intell..

[15]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[16]  Matthew Purver,et al.  Conceptualizing Creativity: From Distributional Semantics to Conceptual Spaces , 2015, ICCC.

[17]  Steven Schockaert,et al.  Modelling Salient Features as Directions in Fine-Tuned Semantic Spaces , 2018, CoNLL.

[18]  John G. Breslin,et al.  A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis , 2016, EMNLP.

[19]  R. Shepard Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space , 1957 .

[20]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[21]  Byron C. Wallace,et al.  Learning Disentangled Representations of Texts with Application to Biomedical Abstracts , 2018, EMNLP.

[22]  Alex McLean,et al.  Unifying Conceptual Spaces: Concept Formation in Musical Creative Systems , 2010, Minds and Machines.

[23]  Wallace Koehler,et al.  Information science as "Little Science":The implications of a bibliometric analysis of theJournal of the American Society for Information Science , 2001, Scientometrics.

[24]  N. Foo Conceptual Spaces—The Geometry of Thought , 2022 .