A factuality profiler for eventualities in text

Event factuality is the level of information expressing the factual status of eventualities mentioned in text. That is, it conveys whether eventualities are characterized as corresponding to facts, to possibilities, or to situations that do not hold in the world. As such, it touches on two categories more standardly assumed in the literature: modality and evidentiality. They both have been widely discussed in linguistics and philosophy, but it is not until recently that have started to receive some attention within the area of NLP. Factuality is a necessary component for reasoning about eventualities in discourse. Inferences derived from events that have not happened, or that are possible, are different from those derived from events judged as factual. It is also essential for any task involving temporal ordering. The creation of event timelines needs to be aware of the different status of eventualities presented as uncertain or counterfactual. My dissertation aims at designing and developing a factuality profiler, namely a tool devoted to the identification of the factuality degree associated to eventualities mentioned in discourse. Event factuality cannot be conceived independently from language users, who are understood here as the sources of factuality information. Their inclusion in the model is fundamental. Two sources can assign different factuality values to the same event. Because of that, the factuality profiler must be capable of representing different and possibly contradictory information about the factuality nature of any event. De Facto, the tool I am presenting here, is grounded on the linguistic strategies we speakers employ to signal degrees of factuality in discourse. These involve information at different levels: lexical, syntactic, and rhetoric. De Facto implements an algorithm based on the grammatical structuring of factuality in languages like English, and is informed with a set of linguistic resources compiled from a data-driven approach. For evaluating De Facto, I created FactBank, a corpus annotated with factuality information. The interannotation agreement score for the task of assigning factuality values to events is kcohen = 0.81. Running De Facto against the gold standard results in F1=0.74 (macro-averaging), F1=0.85 (micro-averaging) and, in terms of interannotation agreement, kcohen =0.72.

[1]  Vasileios Hatzivassiloglou,et al.  Domain -independent detection, extraction, and labeling of Atomic Events , 2003 .

[2]  Gerda Hassler,et al.  7. Evidentiality and reported speech in Romance languages , 2002 .

[3]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[4]  James Pustejovsky,et al.  From structure to interpretation : A double-layered annotation for event factuality , 2008 .

[5]  John McCarthy Modality, Si! Modal Logic, No! , 1997, Stud Logica.

[6]  A. Kratzer The Notional Category of Modality , 2008 .

[7]  C. Condoravdi,et al.  Computing relative polarity for textual inference , 2006 .

[8]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[9]  P. L. Peterson Fact Proposition Event , 1997 .

[10]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[11]  Randy J. LaPolla,et al.  Syntax: Structure, Meaning, and Function , 1999 .

[12]  Talmy Givón,et al.  The Binding Hierarchy and the Typology of Complements , 1980 .

[13]  J. van der Auwera,et al.  Modality’s semantic map , 1998 .

[14]  D. Biber,et al.  Styles of stance in English: Lexical and grammatical marking of evidentiality and affect , 1989 .

[15]  Helena Calsamiglia,et al.  Role and Position of Scientific Voices: Reported Speech in the Media , 2003 .

[16]  R. M. Hare Meaning and Speech Acts , 1970 .

[17]  Otto Jespersen,et al.  The Philosophy of Grammar , 1924 .

[18]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[19]  Carlota Smith,et al.  Modes of discourse , 2003 .

[20]  Dan I. Moldovan,et al.  A Semantic Approach to Recognizing Textual Entailment , 2005, HLT.

[21]  Dan Jurafsky,et al.  Automatic Extraction of Opinion Propositions and their Holders , 2004 .

[22]  R. Jakobson Shifters, Verbal Categories, and the Russian Verb , 1971 .

[23]  Peter Aldhous,et al.  Before and after , 2002, Nature.

[24]  Sabine Bergler,et al.  Evidential analysis of reported speech , 1992 .

[25]  F. Boas Handbook of American Indian languages. Part 1 , 1911 .

[26]  Alan Lee,et al.  Attribution and its annotation in the Penn Discourse TreeBank , 2006, Trait. Autom. des Langues.

[27]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[28]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[29]  Christopher D. Manning,et al.  Learning to recognize features of valid textual entailments , 2006, NAACL.

[30]  Gilbert Lazard,et al.  On the grammaticalization of evidentiality , 2001 .

[31]  Donald Nute,et al.  Counterfactuals , 1975, Notre Dame J. Formal Log..

[32]  K. Bach,et al.  Linguistic Communication and Speech Acts , 1983 .

[33]  Andrew Hickl,et al.  A Discourse Commitment-Based Framework for Recognizing Textual Entailment , 2007, ACL-PASCAL@ACL.

[34]  B. Geurts Presuppositions and Anaphors in Attitude Contexts , 1998 .

[35]  L. T. F. Gamut Logic, language, and meaning , 1991 .

[36]  Eduard Hovy,et al.  A question/answer typology with surface text patterns , 2002 .

[37]  James Pustejovsky,et al.  Annotating and Recognizing Event Modality in Text , 2006, FLAIRS.

[38]  Lauri Karttunen,et al.  Some observations on factivity , 1971 .

[39]  English Grammar,et al.  An English Grammar , 1904, Nature.

[40]  F. D. Haan The relation between modality and evidentiality , 2001 .

[41]  Noriko Kando,et al.  Certainty Identification in Texts: Categorization Model and Manual Tagging Results , 2023 .

[42]  T. Givón Evidentiality and Epistemic Space , 1982 .

[43]  Suzanne Eggins,et al.  An Introduction to Systemic Functional Linguistics , 1994 .

[44]  Linda R. Waugh Reported speech in journalistic discourse: The relation of function and text , 1995 .

[45]  Padmini Srinivasan,et al.  The Language of Bioscience: Facts, Speculations, and Statements In Between , 2004, HLT-NAACL 2004.

[46]  Gerald Gazdar,et al.  Pragmatics: Implicature, Presupposition, and Logical Form , 1978 .

[47]  James Pustejovsky,et al.  Temporal and Event Information in Natural Language Text , 2005, Lang. Resour. Evaluation.

[48]  I. Sag,et al.  Interrogative Investigations , 2001 .

[49]  Karl Erich Heidolph,et al.  Progress in linguistics : a collection of papers , 1970 .

[50]  R. L. Trask Key concepts in language and linguistics , 2000 .

[51]  P. Dendale,et al.  Introduction: evidentiality and related notions , 2001 .

[52]  James Pustejovsky,et al.  SlinkET: A Partial Modal Parser for Events , 2006, LREC.

[53]  Michael Halliday,et al.  An Introduction to Functional Grammar , 1985 .

[54]  A. Wierzbicka English Speech Act Verbs: A Semantic Dictionary , 1987 .

[55]  LAURI KARTTUNEN,et al.  PRESUPPOSITION AND LINGUISTIC CONTEXT , 1974 .

[56]  Martin M. Soubbotin,et al.  Use of Patterns for Detection of Likely Answer Strings: A Systematic Approach , 2002, TREC.

[57]  James Pustejovsky,et al.  Determining Modality and Factuality for Text Entailment , 2007, International Conference on Semantic Computing (ICSC 2007).

[58]  Frans Zwarts,et al.  Polarity, veridicality and temporal connectives , 1993 .

[59]  Terry Nadasdi,et al.  The expression of evidentiality in French-English bilingual discourse , 1999, Language in Society.

[60]  J. Coates The semantics of the modal auxiliaries , 1983 .

[61]  Irene Heim,et al.  Presupposition Projection and the Semantics of Attitude Verbs , 1992, J. Semant..

[62]  James Pustejovsky,et al.  Automating Temporal Annotation with TARSQI , 2005, ACL.

[63]  Arul Menezes,et al.  Effectively Using Syntax for Recognizing False Entailment , 2006, NAACL.

[64]  J. P. Thorne,et al.  The semantics of modal verbs , 1969, Journal of Linguistics.

[65]  Scott DeLancey,et al.  The mirative and evidentiality , 2001 .

[66]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[67]  Xavier Carreras,et al.  Introduction to the CoNLL-2005 Shared Task: Semantic Role Labeling , 2005, CoNLL.

[68]  James Pustejovsky,et al.  Evita: A Robust Event Recognizer For QA Systems , 2005, HLT.

[69]  James Pustejovsky,et al.  Determining Modality and Factuality for Text Entailment , 2007 .

[70]  J. Hooper On Assertive Predicates , 1975 .

[71]  Eduard Hovy,et al.  Identifying Opinion Holders for Question Answering in Opinion Texts , 2005 .

[72]  Ilana Mushin Evidentiality and epistemological stance , 2001 .

[73]  Steven Bethard,et al.  Finding event, temporal and causal structure in text: a machine learning approach , 2007 .

[74]  Daniel Dor,et al.  Representations, attitudes and factivity evaluations : an epistemically-based analysis of lexical selection , 1995 .

[75]  Patrick Dendale,et al.  Pouvoir, un marqueur d'évidentialité , 1994 .

[76]  J. Austin How to do things with words , 1962 .

[77]  F. Palmer,et al.  Mood and modality , 1986 .

[78]  Peter W Culicover,et al.  The Semantic Basis of Control in English , 2003 .

[79]  Charles N. Li,et al.  Direct speech and indirect speech: A functional study , 1986 .

[80]  G. P. Henderson,et al.  An Essay in Modal Logic. , 1953 .

[81]  木村 和夫 Pragmatics , 1997, Language Teaching.

[82]  Claire Cardie,et al.  Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns , 2005, HLT.

[83]  Rashmi Prasad,et al.  The Penn Discourse Treebank , 2004, LREC.

[84]  安平鎬,et al.  Evidentiality , 2018, A Grammar of Nganasan.

[85]  Robert Stalnaker,et al.  Presuppositions of Compound Sentences , 2008 .

[86]  Christopher D. Manning,et al.  Learning to distinguish valid textual entailments , 2006 .

[87]  James H. Martin,et al.  Identification of Event Mentions and their Semantic Class , 2006, EMNLP.

[88]  S. Thompson “Object complements” and conversation towards a realistic account , 2002 .

[89]  Toshiyuki Ogihara,et al.  Non-Factual "Before" and Adverbs of Quantification , 1995 .

[90]  T. Givón,et al.  English grammar : a function-based introduction , 1995 .

[91]  Orvokki Tellervo Heinämäki,et al.  Semantics of English temporal connectives , 1974 .

[92]  Susan T. Dumais,et al.  An Analysis of the AskMSR Question-Answering System , 2002, EMNLP.

[93]  Daniel G. Bobrow,et al.  Preventing existence , 2001, FOIS.

[94]  Wick R. Miller,et al.  Acoma grammar and texts , 1965 .

[95]  Barbara Di Eugenio,et al.  Squibs and Discussions: The Kappa Statistic: A Second Look , 2004, CL.

[96]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[97]  Max J. Cresswell,et al.  A New Introduction to Modal Logic , 1998 .

[98]  David I. Beaver,et al.  A Uniform Analysis of 'Before' and 'After' , 2003 .

[99]  Michael Glanzberg Felicity and Presupposition Triggers , 2003 .

[100]  Sabine Bergler,et al.  The Semantics of Collocational Patterns for Reporting Verbs , 1991, EACL.

[101]  Jean-Pierre Koenig,et al.  Sublexical Modality And The Structure Of Lexical Semantic Representations , 2001 .

[102]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[103]  Margaret Field,et al.  The role of factive predicates in the indexicalization of stance: A discourse perspective☆ , 1997 .

[104]  R. Huddleston Introduction to the Grammar of English: Verbs, nouns and adjectives: the boundaries between them , 1984 .

[105]  Barbara A. Fox Evidentiality: Authority, Responsibility, and Entitlement in English Conversation , 2008 .

[106]  Johanna Nichols,et al.  Evidentiality: The Linguistic Coding of Epistemology , 1986 .

[107]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[108]  Victoria L. Rubin Stating with Certainty or Stating with Doubt: Intercoder Reliability Results for Manual Annotation of Epistemically Modalized Statements , 2007, NAACL.

[109]  Martin M. Soubbotin,et al.  Use of Patterns for Detection of Answer Strings: a Systematic Approach Essentials of the Approach , 2022 .

[110]  Wayne H. Ward,et al.  Towards Robust Semantic Role Labeling , 2007, CL.

[111]  Sanda M. Harabagiu,et al.  An Answer Bank for Temporal Inference , 2006, LREC.

[112]  Laurence R. Horn,et al.  On the semantic properties of logical operators in english' reproduced by the indiana university lin , 1972 .

[113]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[114]  Laurence R. Horn A Natural History of Negation , 1989 .

[115]  Anaïd Donabédian,et al.  Towards a semasiological account of evidentials : An enunciative approach of -er in Modern Western Armenian , 2001 .

[116]  N. Cocchiarella,et al.  Situations and Attitudes. , 1986 .

[117]  Ray Cattell NEGATIVE TRANSPORTATION AND TAG QUESTIONS , 1973 .

[118]  Z. Vendler Linguistics in Philosophy , 1967 .

[119]  M. Ehrman THE MEANING OF THE MODALS IN PRESENT-DAY AMERICAN ENGLISH , 1966 .