A corpus-based toy model for DisCoCat

The categorical compositional distributional (DisCoCat) model of meaning rigorously connects distributional semantics and pregroup grammars, and has found a variety of applications in computational linguistics. From a more abstract standpoint, the DisCoCat paradigm predicates the construction of a mapping from syntax to categorical semantics. In this work we present a concrete construction of one such mapping, from a toy model of syntax for corpora annotated with constituent structure trees, to categorical semantics taking place in a category of free R-semimodules over an involutive commutative semiring R.

[1]  Dimitri Kartsaklis,et al.  Reasoning about Meaning in Natural Language with Compact Closed Categories and Frobenius Algebras , 2014, ArXiv.

[2]  Zellig S. Harris,et al.  Mathematical structures of language , 1968, Interscience tracts in pure and applied mathematics.

[3]  Joachim Lambek,et al.  Type Grammar Revisited , 1997, LACL.

[4]  Mark Steedman,et al.  The syntactic process , 2004, Language, speech, and communication.

[5]  Stephen Clark,et al.  The Frobenius anatomy of word meanings II: possessive relative pronouns , 2014, J. Log. Comput..

[6]  J. Firth,et al.  Papers in linguistics, 1934-1951 , 1957 .

[7]  Peter Selinger,et al.  Dagger Compact Closed Categories and Completely Positive Maps: (Extended Abstract) , 2007, QPL.

[8]  Dimitri Kartsaklis,et al.  A Frobenius Model of Information Structure in Categorical Compositional Distributional Semantics , 2015, ArXiv.

[9]  Stephen Clark,et al.  Mathematical Foundations for a Compositional Distributional Model of Meaning , 2010, ArXiv.

[10]  Dimitri Kartsaklis,et al.  Open System Categorical Quantum Semantics in Natural Language Processing , 2015, CALCO.

[11]  J. Zwart The Minimalist Program , 1998, Journal of Linguistics.

[12]  J. Firth Papers in linguistics , 1958 .

[13]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[14]  J. Lambek The Mathematics of Sentence Structure , 1958 .

[15]  Mehrnoosh Sadrzadeh,et al.  Distributional Sentence Entailment Using Density Matrices , 2015, TTCS.

[16]  Samson Abramsky,et al.  Categorical quantum mechanics , 2008, 0808.1023.

[17]  Stephen Clark,et al.  The Frobenius anatomy of word meanings I: subject and object relative pronouns , 2013, J. Log. Comput..

[18]  Dov M. Gabbay,et al.  Handbook of Quantum Logic and Quantum Structures: Quantum Logic , 2009 .