Natural Language Processing for Social Media

This book presents the state-of-the-art in research and empirical studies in the field of Natural Language Processing (NLP) for the semantic analysis of social media data. Over the past few years, online social networking sites have revolutionized the way we communicate with individuals, groups and communities, and altered everyday practices. The unprecedented volume and variety of user-generated content and the user interaction network constitute new opportunities for understanding social behavior and building socially intelligent systems. Much research work on social networks and the mining of the social web is based on graph theory. That is apt because a social structure is made up of a set of social actors and a set of the dyadic ties between these actors. We believe that the graph-mining methods for structure, information diffusion or influence spread in social networks needs to combined with the content analysis of social media. This provides the opportunity for new applications that use the information publicly available as a result of social interactions. The intended audience of this book is researchers who are interested in developing tools and applications for automatic analysis of social media texts. We assume that the readers have basic knowledge in the area of natural language processing and machine learning. This book will help the readers better understand computational linguistics and social media analysis, in particular text-mining techniques and NLP applications (such as summarization, localization detection, sentiment and emotion analysis, topic detection and machine translation) designed specifically for social media texts.

[1]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[2]  Joel D. Martin,et al.  Sentiment, emotion, purpose, and style in electoral tweets , 2015, Inf. Process. Manag..

[3]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[4]  Saif Mohammad,et al.  Using Hashtags to Capture Fine Emotion Categories from Tweets , 2015, Comput. Intell..

[5]  Fatiha Sadat,et al.  Automatic Identification of Arabic Language Varieties and Dialects in Social Media , 2014, SocialNLP@COLING.

[6]  S. Ozdemir,et al.  Analysis of the relation between Turkish twitter messages and stock market index , 2012, 2012 6th International Conference on Application of Information and Communication Technologies (AICT).

[7]  Wen-Tai Hsieh,et al.  Predicting TV Audience Rating with Social Media , 2013, SocialNLP@IJCNLP.

[8]  Adrian Popescu,et al.  Mining User Home Location and Gender from Flickr Tags , 2010, ICWSM.

[9]  Mike Thelwall,et al.  Sentiment in Twitter events , 2011, J. Assoc. Inf. Sci. Technol..

[10]  Laura Elisabeth Jehl,et al.  Machine Translation for Twitter , 2010 .

[11]  Timothy Baldwin,et al.  langid.py: An Off-the-shelf Language Identification Tool , 2012, ACL.

[12]  Stavros Valsamidis,et al.  A Framework for Opinion Mining in Blogs for Agriculture , 2013 .

[13]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[14]  Philip J. Stone,et al.  The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information , 2007 .

[15]  José M. Molina López,et al.  Combining Machine Learning Techniques and Natural Language Processing to Infer Emotions Using Spanish Twitter Corpus , 2013, PAAMS.

[16]  Cyrus Shahabi,et al.  MediaQ: mobile multimedia management system , 2014, MMSys '14.

[17]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[18]  James Caverlee,et al.  Summarizing User-Contributed Comments , 2011, ICWSM.

[19]  Kazutoshi Sumiya,et al.  Measuring geographical regularities of crowd behaviors for Twitter-based geo-social event detection , 2010, LBSN '10.

[20]  Richard Colbaugh,et al.  Estimating sentiment orientation in social media for intelligence monitoring and analysis , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[21]  Paul Barford,et al.  A Learning-Based Approach for IP Geolocation , 2010, PAM.

[22]  Georgios Zervas,et al.  Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud , 2015, Manag. Sci..

[23]  Timothy Baldwin,et al.  Text-Based Twitter User Geolocation Prediction , 2014, J. Artif. Intell. Res..

[24]  Bo Pang,et al.  The effect of wording on message propagation: Topic- and author-controlled natural experiments on Twitter , 2014, ACL.

[25]  Jason Baldridge,et al.  Hierarchical Discriminative Classification for Text-Based Geolocation , 2014, EMNLP.

[26]  Chris Callison-Burch,et al.  Arabic Dialect Identification , 2014, CL.

[27]  Ralf D. Brown,et al.  Selecting and Weighting N-Grams to Identify 1100 Languages , 2013, TSD.

[28]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[29]  Kareem Darwish,et al.  Using Twitter to Collect a Multi-Dialectal Corpus of Arabic , 2014, ANLP@EMNLP.

[30]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[31]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[32]  Xiaojie Yuan,et al.  Exploiting Social Media for Stock Market Prediction with Factorization Machine , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[33]  Jens Grossklags,et al.  An online experiment of privacy authorization dialogues for social applications , 2013, CSCW.

[34]  Brendan T. O'Connor,et al.  TweetMotif: Exploratory Search and Topic Summarization for Twitter , 2010, ICWSM.

[35]  Fred Popowich,et al.  Opinion Polarity Identification through Adjectives , 2010, ArXiv.

[36]  Fernando Diaz,et al.  Extracting information nuggets from disaster- Related messages in social media , 2013, ISCRAM.

[37]  Haofen Wang,et al.  Towards Effective Event Detection, Tracking and Summarization on Microblog Data , 2011, WAIM.

[38]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[39]  Wenyi Huang,et al.  Inferring nationalities of Twitter users and studying inter-national linking , 2014, HT.

[40]  M. de Rijke,et al.  Credibility-inspired ranking for blog post retrieval , 2012, Information Retrieval.

[41]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[42]  Kalina Bontcheva,et al.  Where's @wally?: a classification approach to geolocating users based on their social ties , 2013, HT.

[43]  Timothy Baldwin,et al.  Automatic Identification of Locative Expressions from Social Media Text: A Comparative Analysis , 2014, LocWeb '14.

[44]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[45]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[46]  Hila Becker,et al.  Identifying content for planned events across social media sites , 2012, WSDM '12.

[47]  Sheila Kinsella,et al.  "I'm eating a sandwich in Glasgow": modeling locations with tweets , 2011, SMUC '11.

[48]  Shannon Vallor,et al.  Social Networking and Ethics , 2012 .

[49]  T. Murata,et al.  Breaking News Detection and Tracking in Twitter , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[50]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[51]  Nanyun Peng,et al.  Learning Polylingual Topic Models from Code-Switched Social Media Documents , 2014, ACL.

[52]  Timothy Baldwin,et al.  Accurate Language Identification of Twitter Messages , 2014 .

[53]  Michael J. Paul,et al.  Summarizing Contrastive Viewpoints in Opinionated Text , 2010, EMNLP.

[54]  Gerhard Weikum,et al.  KORE: keyphrase overlap relatedness for entity disambiguation , 2012, CIKM.

[55]  Ming Zhou,et al.  Joint Inference of Named Entity Recognition and Normalization for Tweets , 2012, ACL.

[56]  Uwe F. Mayer Bootstrapped language identification for multi-site internet domains , 2012, KDD.

[57]  Chris Callison-Burch,et al.  Machine Translation of Arabic Dialects , 2012, NAACL.

[58]  Alan R. Dennis,et al.  Trading on Twitter: The Financial Information Content of Emotion in Social Media , 2014, 2014 47th Hawaii International Conference on System Sciences.

[59]  Eric N. Forsyth Improving automated lexical and discourse analysis of online chat dialog , 2007 .

[60]  Liang Zhou,et al.  On the Summarization of Dynamically Introduced Information: Online Discussions and Blogs , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[61]  Svitlana Volkova,et al.  Inferring User Political Preferences from Streaming Communications , 2014, ACL.

[62]  Kalina Bontcheva,et al.  Microblog-genre noise and impact on semantic annotation accuracy , 2013, HT.

[63]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[64]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[65]  Miles Osborne,et al.  Streaming First Story Detection with application to Twitter , 2010, NAACL.

[66]  Peter Ford Dominey,et al.  Learning to talk about events from narrated video in a construction grammar framework , 2005, Artif. Intell..

[67]  Ana-Maria Popescu,et al.  Detecting controversial events from twitter , 2010, CIKM.

[68]  Kate Starbird,et al.  Designing for the deluge: understanding & supporting the distributed, collaborative work of crisis volunteers , 2014, CSCW.

[69]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[70]  J. Mikels,et al.  Characterization of the Affective Norms for English Words by discrete emotional categories , 2007, Behavior research methods.

[71]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[72]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[73]  Li Shang,et al.  ETree: Effective and Efficient Event Modeling for Real-Time Online Social Media Networks , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[74]  Elizabeth D. Liddy,et al.  Discerning Emotions in Texts , 2004, AAAI 2004.

[75]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[76]  Eduard H. Hovy,et al.  Structured Event Retrieval over Microblog Archives , 2012, NAACL.

[77]  P. Ekman An argument for basic emotions , 1992 .

[78]  Hendrik Blockeel,et al.  SCCQL: A constraint-based clustering system , 2013 .

[79]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[80]  Eduard H. Hovy,et al.  Weakly Supervised User Profile Extraction from Twitter , 2014, ACL.

[81]  Chien Chin Chen,et al.  TSCAN: a novel method for topic summarization and content anatomy , 2008, SIGIR '08.

[82]  David Yarowsky,et al.  Hierarchical Bayesian Models for Latent Attribute Detection in Social Media , 2011, ICWSM.

[83]  Cristian Danescu-Niculescu-Mizil,et al.  Brighter than Gold: Figurative Language in User Generated Comparisons , 2014, EMNLP.

[84]  Bu-Sung Lee,et al.  TwiNER: named entity recognition in targeted twitter stream , 2012, SIGIR '12.

[85]  Zheng Chen,et al.  Study of Stock Prediction Based on Social Network , 2013, 2013 International Conference on Social Computing.

[86]  Steve Uhlig,et al.  IP geolocation databases: unreliable? , 2011, CCRV.

[87]  Alexander Porshnev,et al.  Machine Learning in Prediction of Stock Market Indicators Based on Historical Data and Data from Twitter Sentiment Analysis , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[88]  Danah Boyd,et al.  Social Network Sites: Definition, History, and Scholarship , 2007, J. Comput. Mediat. Commun..

[89]  Ed H. Chi,et al.  Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles , 2011, CHI.

[90]  Kamalakar Karlapalem,et al.  TEA: Episode Analytics on Short Messages , 2014, #MSM.

[91]  Stephan Vogel,et al.  Language Model Adaptation for Statistical Machine Translation via Structured Query Models , 2004, COLING.

[92]  Dale Schuurmans,et al.  Combining Naive Bayes and n-Gram Language Models for Text Classification , 2003, ECIR.

[93]  A. Culotta,et al.  Using County Demographics to Infer Attributes of Twitter Users , 2014 .

[94]  David Ratcliffe,et al.  Finding Fires with Twitter , 2013, ALTA.

[95]  Defeng Guo,et al.  Enhanced stock prediction using social network and statistical model , 2014, 2014 IEEE Workshop on Advanced Research and Technology in Industry Applications (WARTIA).

[96]  Gabriel Doyle,et al.  Mapping Dialectal Variation by Querying Social Media , 2014, EACL.

[97]  Liam Peyton,et al.  Personal Health Information Leak Prevention in Heterogeneous Texts , 2009 .

[98]  Benoît Sagot,et al.  The French Social Media Bank: a Treebank of Noisy User Generated Content , 2012, COLING.

[99]  Theodoros Tzouramanis,et al.  A robust gender inference model for online social networks and its application to LinkedIn and Twitter , 2014, First Monday.

[100]  Ellen Riloff,et al.  Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[101]  Hui Lin,et al.  Graph-based submodular selection for extractive summarization , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[102]  Sung-Hyon Myaeng,et al.  A hybrid mood classification approach for blog text , 2006 .

[103]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[104]  Arjun Mukherjee,et al.  Detecting Campaign Promoters on Twitter Using Markov Random Fields , 2014, 2014 IEEE International Conference on Data Mining.

[105]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[106]  Diana Inkpen,et al.  A hierarchical approach to mood classification in blogs , 2011, Natural Language Engineering.

[107]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[108]  Jie Yin,et al.  Using Social Media to Enhance Emergency Situation Awareness , 2012, IEEE Intelligent Systems.

[109]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[110]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[111]  Sabine Buchholz,et al.  Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[112]  Noah A. Smith,et al.  A Dependency Parser for Tweets , 2014, EMNLP.

[113]  Cecilia Ovesdotter Alm,et al.  Emotions from Text: Machine Learning for Text-based Emotion Prediction , 2005, HLT.

[114]  Robert G. Capra,et al.  Factors mediating disclosure in social network sites , 2011, Comput. Hum. Behav..

[115]  Yitong Li,et al.  Graph-Based Multi-Tweet Summarization using Social Signals , 2012, COLING.

[116]  Xiaohua Hu,et al.  Text Mining the Biomedical Literature for Identification of Potential Virus/Bacterium as Bio-Terrorism Weapons , 2008, Terrorism Informatics.

[117]  Lluís F. Hurtado,et al.  Political Tendency Identification in Twitter using Sentiment Analysis Techniques , 2014, COLING.

[118]  Johan Bos,et al.  Predicting the 2011 Dutch Senate Election Results with Twitter , 2012 .

[119]  Michael C. Frank,et al.  A Robust Framework for Estimating Linguistic Alignment in Twitter Conversations , 2016, WWW.

[120]  Ee-Peng Lim,et al.  Comments-oriented blog summarization by sentence extraction , 2007, CIKM '07.

[121]  Geoffrey I. Webb,et al.  PRICAI 2006: Trends in Artificial Intelligence, 9th Pacific Rim International Conference on Artificial Intelligence, Guilin, China, August 7-11, 2006, Proceedings , 2006, PRICAI.

[122]  Aleem Hossain,et al.  Crowded: a crowd-sourced perspective of events as they happen , 2013, Defense, Security, and Sensing.

[123]  Julio Gonzalo,et al.  Towards real-time summarization of scheduled events from twitter streams , 2012, HT '12.

[124]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[125]  Diana Inkpen,et al.  Textual risk mining for maritime situational awareness , 2014, 2014 IEEE International Inter-Disciplinary Conference on Cognitive Methods in Situation Awareness and Decision Support (CogSIMA).

[126]  Timothy Baldwin,et al.  Classifying Dialogue Acts in One-on-One Live Chats , 2010, EMNLP.

[127]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[128]  ChengXiang Zhai,et al.  Generating comparative summaries of contradictory opinions in text , 2009, CIKM.

[129]  Peter Szolovits,et al.  Evaluating the state-of-the-art in automatic de-identification. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[130]  Deepa Paranjpe,et al.  Learning document aboutness from implicit user feedback and document structure , 2009, CIKM.

[131]  Claire Cardie,et al.  Sentiment analysis on evolving social streams: how self-report imbalances can help , 2014, WSDM.

[132]  Hila Becker,et al.  Hip and trendy: Characterizing emerging trends on Twitter , 2011, J. Assoc. Inf. Sci. Technol..

[133]  Patrick Paroubek,et al.  Twitter Based System: Using Twitter for Disambiguating Sentiment Ambiguous Adjectives , 2010, *SEMEVAL.

[134]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[135]  Diana Inkpen,et al.  General Topic Annotation in Social Networks: A Latent Dirichlet Allocation Approach , 2013, Canadian Conference on AI.

[136]  Robert Power,et al.  A sensitive Twitter earthquake detector , 2013, WWW.

[137]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[138]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[139]  Brendan T. O'Connor,et al.  Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters , 2013, NAACL.

[140]  Nathanael Chambers,et al.  Learning for Microblogs with Distant Supervision: Political Forecasting with Twitter , 2012, EACL.

[141]  Xun Wang,et al.  Update Summarization using a Multi-level Hierarchical Dirichlet Process Model , 2012, COLING.

[142]  Henry A. Kautz,et al.  Modeling the impact of lifestyle on health at scale , 2013, WSDM.

[143]  Yiming Yang,et al.  Topic-conditioned novelty detection , 2002, KDD.

[144]  Fred Popowich,et al.  Domain Adaptation Techniques for Machine Translation and Their Evaluation in a Real-World Setting , 2012, Canadian Conference on AI.

[145]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[146]  Aron Culotta,et al.  Inferring the origin locations of tweets with quantitative confidence , 2013, CSCW.

[147]  Mitsuru Ishizuka,et al.  Compositionality Principle in Recognition of Fine-Grained Emotions from Text , 2009, ICWSM.

[148]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[149]  Shamima Mithun,et al.  Exploiting Rhetorical Relations in Blog Summarization , 2010, Canadian Conference on AI.

[150]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..

[151]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[152]  Alan Ritter,et al.  Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[153]  Xiaodi Zhu,et al.  An Empirical Study of the Financial Community Network on Twitter , 2013 .

[154]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[155]  Fatiha Sadat,et al.  Automatic identification of arabic dialects in social media , 2014, SoMeRA@SIGIR.

[156]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[157]  Susan T. Dumais,et al.  Similarity Measures for Short Segments of Text , 2007, ECIR.

[158]  Marilyn A. Walker,et al.  Really? Well. Apparently Bootstrapping Improves the Performance of Sarcasm and Nastiness Classifiers for Online Dialogue , 2013, ArXiv.

[159]  Jugal K. Kalita,et al.  Experiments in Microblog Summarization , 2010, 2010 IEEE Second International Conference on Social Computing.

[160]  Claire Grover,et al.  Re-using an Argument Corpus to Aid in the Curation of Social Media Collections , 2014, LREC.

[161]  Amit Kumar Agrawal,et al.  Sentiment Analysis of Wimbledon Tweets , 2014, #MSM.

[162]  Vincent Ng,et al.  Vote Prediction on Comments in Social Polls , 2014, EMNLP.

[163]  David Yarowsky,et al.  Classifying latent user attributes in twitter , 2010, SMUC '10.

[164]  Stan Szpakowicz,et al.  Identifying Expressions of Emotion in Text , 2007, TSD.

[165]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[166]  R. Plutchik,et al.  Emotion: Theory, Research, and Experience. Vol. 1. Theories of Emotion , 1981 .

[167]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[168]  Jacob Eisenstein Systematic patterning in phonologically‐motivated orthographic variation , 2015 .

[169]  Ramesh Nallapati,et al.  Event threading within news topics , 2004, CIKM '04.

[170]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[171]  Carlo Strapparava,et al.  SemEval-2007 Task 14: Affective Text , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[172]  Claire Cardie,et al.  Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts , 2014, EMNLP.

[173]  Rahma Sellami,et al.  Collaboratively Constructed Linguistic Resources for Language Variants and their Exploitation in NLP Application - the case of Tunisian Arabic and the Social Media , 2014, LG-LP@COLING.

[174]  Jeffrey Nichols,et al.  Home Location Identification of Twitter Users , 2014, TIST.

[175]  Ning Zhou,et al.  A Hybrid Probabilistic Model for Unified Collaborative and Content-Based Image Tagging , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[176]  Dirk Thorleuchter,et al.  Protecting research and technology from espionage , 2013, Expert Syst. Appl..

[177]  Jon Oberlander,et al.  Whose Thumb Is It Anyway? Classifying Author Personality from Weblog Text , 2006, ACL.

[178]  Lars Backstrom,et al.  Find me if you can: improving geographical prediction with social and spatial proximity , 2010, WWW '10.

[179]  Yaser Al-Onaizan,et al.  Improved Sentence-Level Arabic Dialect Classification , 2014, VarDial@COLING.

[180]  Mirella Lapata,et al.  Tweet Recommendation with Graph Co-Ranking , 2012, ACL.

[181]  Dieter Fox,et al.  Bayesian Filtering for Location Estimation , 2003, IEEE Pervasive Comput..

[182]  Jacob Eisenstein,et al.  Phonological Factors in Social Media Writing , 2013 .

[183]  Ani Nahapetian,et al.  Tweet analysis for user health monitoring , 2014, 2014 4th International Conference on Wireless Mobile Communication and Healthcare - Transforming Healthcare Through Innovations in Mobile and Wireless Technologies (MOBIHEALTH).

[184]  Trevor Cohn,et al.  Predicting and Characterising User Impact on Twitter , 2014, EACL.

[185]  Lilja Øvrelid,et al.  Lexical Categories for Improved Parsing of Web Data , 2012, COLING.

[186]  Hailiang Chen,et al.  Sentiment revealed in social media and its effect on the stock market , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[187]  Diana Inkpen,et al.  Prior and contextual emotion of words in sentential context , 2014, Comput. Speech Lang..

[188]  Victoria Bobicev,et al.  Learning Sentiments from Tweets with Personal Health Information , 2012, Canadian Conference on AI.

[189]  Ana-Maria Popescu,et al.  Extracting events and event descriptions from Twitter , 2011, WWW.

[190]  Mike Thelwall,et al.  Twitter, MySpace, Digg: Unsupervised Sentiment Analysis in Social Media , 2012, TIST.

[191]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[192]  Edward N. Zalta,et al.  Stanford encyclopedia of philosophy: a dynamic reference work , 2001, JCDL '01.

[193]  Craig MacDonald,et al.  On choosing an effective automatic evaluation metric for microblog summarisation , 2014, IIiX.

[194]  M. de Rijke,et al.  Incorporating Query Expansion and Quality Indicators in Searching Microblog Posts , 2011, ECIR.

[195]  Uta Lösch,et al.  Mapping microblog posts to encyclopedia articles , 2011, GI-Jahrestagung.

[196]  M. de Rijke,et al.  Adding semantics to microblog posts , 2012, WSDM '12.

[197]  Li Bing,et al.  Public Sentiment Analysis in Twitter Data for Prediction of a Company's Stock Price Movements , 2014, 2014 IEEE 11th International Conference on e-Business Engineering.

[198]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[199]  Vincent Martin Predicting the French Stock Market Using Social Media Analysis , 2013, SMAP.

[200]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[201]  Susan T. Dumais,et al.  Mark my words!: linguistic style accommodation in social media , 2011, WWW.

[202]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[203]  Markus Dickinson,et al.  Does Size Matter? Text and Grammar Revision for Parsing Social Media Data , 2013 .

[204]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[205]  Farnoush Banaei-Kashani,et al.  GeoDec: A framework to visualize and query geospatial data for decision-making , 2010, IEEE MultiMedia.

[206]  Xerxes P. Kotval,et al.  Visualization of entities within social media: Toward understanding users' needs , 2013, Bell Labs Technical Journal.