What is disputed on the web?

We present a method for automatically acquiring of a corpus of disputed claims from the web. We consider a factual claim to be disputed if a page on the web suggests both that the claim is false and also that other people say it is true. Our tool extracts disputed claims by searching the web for patterns such as "falsely claimed that X" and then using a statistical classifier to select text that appears to be making a disputed claim. We argue that such a corpus of disputed claims is useful for a wide range of applications related to information credibility on the web, and we report what our current corpus reveals about what is being disputed on the web.

[1]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[2]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[3]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[4]  Ellen Riloff,et al.  Automatically Constructing a Dictionary for Information Extraction Tasks , 1993, AAAI.

[5]  Javier Monserrat Dawkins: The God delusion , 2007 .

[6]  Doug Downey,et al.  It’s a Contradiction – no, it’s not: A Case Study using Functional Relations , 2008, EMNLP.

[7]  Susan T. Dumais,et al.  A Bayesian Approach to Filtering Junk E-Mail , 1998, AAAI 1998.

[8]  D. R. Danielson,et al.  How do users evaluate the credibility of Web sites?: a study with over 2,500 participants , 2003, DUX '03.

[9]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[10]  David G. Stork,et al.  Pattern Classification , 1973 .

[11]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[12]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[13]  Kentaro Inui,et al.  Grasping Major Statements and Their Contradictions Toward Information Credibility Analysis of Web Contents , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[14]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[15]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[16]  Dan I. Moldovan,et al.  Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[17]  Daniel G. Bobrow,et al.  Entailment, intensionality and text understanding , 2003, HLT-NAACL 2003.

[18]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[19]  Simon Buckingham Shum,et al.  Cohere: Towards Web 2.0 Argumentation , 2008, COMMA.

[20]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[21]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[22]  Aniket Kittur,et al.  What's in Wikipedia?: mapping topics and conflict using socially annotated category structure , 2009, CHI.

[23]  Sanda M. Harabagiu,et al.  Negation, Contrast and Contradiction in Text Processing , 2006, AAAI.

[24]  Ellen Riloff Bootstrapping for text learning tasks , 1999 .

[25]  K. Hengeveld Mood and modality , 2004 .

[26]  Tamara Sumner,et al.  Using Machine Learning to Support Quality Judgments , 2005, D Lib Mag..

[27]  Krishnendu Chatterjee,et al.  Assigning trust to Wikipedia content , 2008, Int. Sym. Wikis.

[28]  Yuji Matsumoto,et al.  Statement map: assisting information crediblity analysis by visualizing arguments , 2009, WICOW.

[29]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[30]  Oren Etzioni,et al.  The Tradeoffs Between Open and Traditional Relation Extraction , 2008, ACL.

[31]  C. Alessi The God delusion. , 2012, The Health service journal.

[32]  Joshua Evan Blumenstock,et al.  Size matters: word count as a measure of quality on wikipedia , 2008, WWW.

[33]  Christopher D. Manning,et al.  Finding Contradictions in Text , 2008, ACL.

[34]  John Mark Agosta,et al.  Highlighting disputed claims on the web , 2010, WWW '10.

[35]  Rada Mihalcea,et al.  Linking Documents to Encyclopedic Knowledge , 2008, IEEE Intelligent Systems.

[36]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.