A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic

The analysis of a corpus of micro-blogs on the topic of the 2011 UK referendum about the Alternative Vote has been undertaken as a joint activity by text miners and social scientists. To facilitate the collaboration, the corpus and its analysis is managed in a Web-accessible framework that allows users to upload their own textual data for analysis and to manage their own text annotation resources used foranalysis. The framework also allows annotations to be searched, and the analysis to be re-run after amending the analysis resources. The corpus is also doubly human-annotated stating both whether each tweet is overall positive or negative in sentiment and whether it is for or against the proposition of the referendum.

[1]  Fabio Rinaldi,et al.  FACILE: Description of the NE System Used for MUC-7 , 1998, MUC.

[2]  Robert M. Entman,et al.  Framing: Toward Clarification of a Fractured Paradigm , 1993 .

[3]  Swapna Somasundaran,et al.  Recognizing Stances in Online Debates , 2009, ACL.

[4]  William J. Black,et al.  A Suite of Tools for Marking Up Textual Data for Temporal Text Mining Scenarios , 2004, LREC.

[5]  Hideki Mima,et al.  Automatic recognition of multi-word terms:. the C-value/NC-value method , 2000, International Journal on Digital Libraries.

[6]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[7]  Tingting Mu,et al.  Supporting the education evidence portal via text mining , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[8]  Rob Procter,et al.  The e-Social Science research agenda , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[9]  E Ray Dorsey,et al.  The coming crisis , 2013, Neurology.

[10]  Alan Ritter,et al.  Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[11]  Sophia Ananiadou,et al.  Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench , 2012, LREC.

[12]  Swapna Somasundaran,et al.  QA with Attitude: Exploiting Opinion Type Analysis for Improving Question Answering in On-line Discussions and the News , 2007, ICWSM.

[13]  Roger Burrows,et al.  The Coming Crisis of Empirical Sociology , 2007, Sociology.

[14]  Sophia Ananiadou,et al.  Argo: an integrative, interactive, text mining-based workbench supporting curation , 2012, Database J. Biol. Databases Curation.

[15]  Katerina T. Frantzi,et al.  Automatic recognition of multi-word terms , 1998 .

[16]  David A. Ferrucci,et al.  UIMA: an architectural approach to unstructured information processing in the corporate research environment , 2004, Natural Language Engineering.