Domain Adaptation for Automatic Detection of Speculative Sentences

The use of speculative, uncertain or vague sentences is common in both spoken and written language. Automatic detection of such sentences plays significant role in natural language processing and social sciences as it could enhance information extraction systems and lead to faster and more reliable analyses in social sciences. However, this problem has only been addressed in biomedical and encyclopedic domains. In this paper, we address automatic speculation detection (as a binary sentence classification task) in monetary policy domain, and for the first time, on the transcripts of spoken language. We build two new speculation detection datasets and a dictionary of speculation triggers using expert annotations, and benchmark the performance of automatic speculation detection systems in this new domain.

[1]  János Csirik,et al.  The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.

[2]  K. Hyland,et al.  Hedging in scientific research articles , 1998 .

[3]  Alex Cukierman,et al.  The Politics of Ambiguity , 1987 .

[4]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[5]  György Szarvas,et al.  Hedge Classification in Biomedical Texts with a Weakly Supervised Selection of Keywords , 2008, ACL.

[6]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[7]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[8]  Ellen Riloff,et al.  Learning subjective nouns using extraction pattern bootstrapping , 2003, CoNLL.

[9]  Maria Georgescul,et al.  A Hedgehop over a Max-Margin Framework Using Hedge Cues , 2010, CoNLL Shared Task.

[10]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[11]  Gary James Jason Hedging as a Fallacy of Language , 1988 .

[12]  Ted Briscoe,et al.  Weakly Supervised Learning for Hedge Classification in Scientific Literature , 2007, ACL.

[13]  Michael Strube,et al.  Finding Hedges by Chasing Weasels: Hedge Detection Using Wikipedia Tags and Shallow Linguistic Features , 2009, ACL.

[14]  Eileen Fitzpatrick,et al.  Verification and Implementation of Language-Based Deception Indicators in Civil and Criminal Narratives , 2008, COLING.

[15]  Padmini Srinivasan,et al.  The Language of Bioscience: Facts, Speculations, and Statements In Between , 2004, HLT-NAACL 2004.

[16]  Adam Meirowitz,et al.  Informational Party Primaries and Strategic Ambiguity , 2005 .

[17]  D. Stasavage,et al.  Does it Pay to Be Transparent? International Evidence from Central Bank Forecasts , 2001 .

[18]  Georg Vanberg,et al.  The Value of Vagueness: Delegation, Defiance, and Judicial Opinions , 2008 .

[19]  K. Hyland,et al.  Hedging in academic writing and EAF textbooks , 1994 .