We present a modular architecture for processing informal mathematical language as found in textbooks and mathematical publica- tions. We point at its properties relevant in addressing three aspects of informal mathematical discourse: (i) the interleaved symbolic and natu- ral language, (ii) the linguistic, domain, and notational context, and (iii) the imprecision of the informal language. The objective in the modular approach is to enable parameterisation of the system with respect to the natural language of the text and the mathematical domain of discourse. Informal mathematical discourse in textbooks and mathematical publications is partly written in natural language and partly in a symbolic notation—even within a single utterance. Be it information retrieval or text mining mathematical documents, flexible human-oriented mathematical user interfaces, or automated verification of informal proofs crucially rely on automated analysis of the informal language. In (8,9,2) we presented methods of, respectively: parsing, lexical analysis, and domain-specific interpretation of informal mathematical proofs, and introduced linguistic resources necessary for processing. In this paper, we present a modular architecture of a system for processing mathematical language based on those resources and emphasise three core aspects of the informal mathematical discourse it addresses: the interleaved symbolic and natural language, the linguistic, domain, and notational context, and the imprecision of the informal language. The objective in the modular approach is to enable parameterisation of the system with respect to the natural language of the text in question, the mathematical domain of the discourse, and the mathematical notation. We first briefly discuss the above-mentioned aspects of the mathematical language based on example utterances, then we present our processing architecture, and finally outline the related work on mathematical discourse and our further work.
[1]
Magdalena Wolska,et al.
Modeling anaphora in informal mathematical dialogue
,
2006
.
[2]
Magdalena Wolska,et al.
Interpreting semi-formal utterances in dialogs about mathematical proofs
,
2006,
Data Knowl. Eng..
[3]
Ivana Kruijff-Korbayová,et al.
Analysis of Mixed Natural and Symbolic Input in Mathematical Dialogs
,
2004,
ACL.
[4]
Fairouz Kamareddine,et al.
Restoring Natural Language as a Computerised Mathematics Input Method
,
2007,
Calculemus/MKM.
[5]
Sabina Jeschke,et al.
Natural Language Processing Methods for Extracting Information from Mathematical Texts
,
2008
.
[6]
Uwe Reyle,et al.
From Discourse to Logic - Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory
,
1993,
Studies in linguistics and philosophy.
[7]
Claus Zinn,et al.
Supporting the formal verification of mathematical texts
,
2006,
J. Appl. Log..
[8]
Uwe Reyle,et al.
From discourse to logic
,
1993
.
[9]
G. Lakoff,et al.
Where Mathematics Comes From
,
2000
.
[10]
Magdalena Wolska,et al.
Lexical-semantic interpretation of language input in mathematical dialogs
,
2004
.
[11]
Petr Sgall,et al.
The Meaning Of The Sentence In Its Semantic And Pragmatic Aspects
,
1986
.
[12]
G. Lakoff,et al.
Where mathematics comes from : how the embodied mind brings mathematics into being
,
2002
.
[13]
Varol Akman,et al.
Book Review -- Hans Kamp and Uwe Reyle, From Discourse to Logic: Introduction to Model-theoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory
,
1995
.