Get out the vote: Determining support or opposition from Congressional floor-debate transcripts

We investigate whether one can determine from the transcripts of U.S. Congressional floor debates whether the speeches represent support of or opposition to proposed legislation. To address this problem, we exploit the fact that these speeches occur as part of a discussion; this allows us to use sources of information regarding relationships between discourse segments, such as whether a given utterance indicates agreement with the opinion expressed by another. We find that the incorporation of such information yields substantial improvements over classifying speeches in isolation.

[1]  Janyce Wiebe,et al.  A Computational Theory of Perspective and Reference in Narrative , 1988, ACL.

[2]  Marti A. Hearst Direction-based text interpretation as an information access refinement , 1992 .

[3]  Warren Sack,et al.  On the Computation of Point of View , 1994, AAAI.

[4]  Janyce Wiebe,et al.  Tracking Point of View in Narrative , 1994, Comput. Linguistics.

[5]  Steven S. Smith,et al.  The American Congress , 1995 .

[6]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[7]  The Theory and Practice of Discourse Parsing and Summarization , 2000 .

[8]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[9]  Avrim Blum,et al.  Learning from Labeled and Unlabeled Data using Graph Mincuts , 2001, ICML.

[10]  Ben Taskar,et al.  Learning Probabilistic Models of Relational Structure , 2001, ICML.

[11]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[12]  Peter D. Turney Thumbs Up, Thumbs Down , 2013, Journal of Cell Science.

[13]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[14]  Marc Moens,et al.  Articles Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status , 2002, CL.

[15]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[16]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[17]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[18]  Ben Taskar,et al.  Learning Probabilistic Models of Link Structure , 2003, J. Mach. Learn. Res..

[19]  Walter Daelemans,et al.  Evaluation of Machine Learning Methods for Natural Language Processing Tasks , 2002, LREC.

[20]  Ben Taskar,et al.  Max-Margin Markov Networks , 2003, NIPS.

[21]  Mari Ostendorf,et al.  Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data , 2003, NAACL.

[22]  Ramakrishnan Srikant,et al.  Mining newsgroups using networks arising from social behavior , 2003, WWW '03.

[23]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[24]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[25]  M. Laver,et al.  Extracting Policy Positions from Political Texts Using Words as Data , 2003, American Political Science Review.

[26]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[27]  Julia Hirschberg,et al.  Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies , 2004, ACL.

[28]  Miles James Efron Cultural Orientation: Classifying Subjective Documents by Cociation Analysis , 2004, AAAI Technical Report.

[29]  Ben Taskar,et al.  Learning associative Markov networks , 2004, ICML.

[30]  Gregory Grefenstette,et al.  Coupling Niche Browsers and Affect Analysis for an Opinion Mining Application , 2004, RIAO.

[31]  Andrew McCallum,et al.  Conditional Models of Identity Uncertainty with Application to Noun Coreference , 2004, NIPS.

[32]  Alekh Agarwal Sentiment Analysis : A New Approach for Effective Use of Linguistic Knowledge and Exploiting Similarities in a Set of Documents to be Classified . , 2005 .

[33]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[34]  Jenefer Robinson A Sentimental Education , 2005 .

[35]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[36]  William W. Cohen,et al.  On the collective classification of email "speech acts" , 2005, SIGIR '05.

[37]  Mirella Lapata,et al.  Collective Content Selection for Concept-to-Text Generation , 2005, HLT.

[38]  Claire Cardie,et al.  Optimizing to Arbitrary NLP Metrics using Ensemble Selection , 2005, HLT.

[39]  James P. Callan,et al.  Language processing technologies for electronic rulemaking: a project highlight , 2005, DG.O.

[40]  Grace Hui Yang,et al.  Near-duplicate detection for eRulemaking , 2005, DG.O.

[41]  Namhee Kwon,et al.  Multidimensional text analysis for eRulemaking , 2006, DG.O.

[42]  Xiaojin Zhu,et al.  Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .

[43]  Dustin Hillard,et al.  Automated classification of congressional legislation , 2006, DG.O.

[44]  Rob Malouf,et al.  A Preliminary Investigation into Sentiment Analysis of Informal Political Discourse , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[45]  Claire Cardie,et al.  Using natural language processing to improve eRulemaking: project highlight , 2006, DG.O.