Predicting the Occurrence of Life Events from User's Tweet History

This paper addresses the new problem of predicting the occurrence of Twitter user's life events using word occurrence tendencies in user's tweet histories. Many previous studies have addressed public event prediction and life event extraction on Twitter. Most of the methods for these two problems use tweets that explicitly refer to event occurrence. However, users who will experience a life event are unlikely to post tweets that reference the occurrence of the life event explicitly; thus, existing methods to find tweets that refer to event occurrence explicitly are not applicable to life event prediction. Therefore, we assume that users who will experience a life event tend to post tweets that refer to life event occurrence implicitly and propose a method to identify such a tendency to predict life events. First, we extract users who experienced a specific life event and collect their past tweets to identify features that implicitly indicate the life event occurrence. We use the word occurrence tendency in such tweets as training data to construct a life event prediction model. We chose five life events, that is, "Giving birth," "Getting a job offer," "Leaving the hospital," "Pregnancy," and "Marriage," and assessed the prediction performance of the proposed method for each event. Experimental results demonstrate that the proposed method outperformed a baseline method for all selected life events except "Leaving the hospital" and achieved the highest prediction accuracy for "Giving birth." We suppose that this event resulted in the highest prediction accuracy because all users who gave birth experienced pregnancy and common features appeared in their tweets over a long period.

[1]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[2]  Kira Radinsky,et al.  Learning causality for news events prediction , 2012, WWW.

[3]  Claire Cardie,et al.  Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts , 2014, EMNLP.

[4]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[5]  Adam Jatowt,et al.  Product Purchase Prediction Based on Time Series Data Analysis in Social Media , 2015, 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT).

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Antal van den Bosch,et al.  Estimating Time to Event from Tweets Using Temporal Expressions , 2014 .

[8]  Harith Alani,et al.  Personal Life Event Detection from Social Media , 2014, HT.

[9]  Claire Cardie,et al.  Early Stage Influenza Detection from Twitter , 2013, ArXiv.

[10]  Paul Mulholland,et al.  Identifying Prominent Life Events on Twitter , 2015, K-CAP.

[11]  Adam Jatowt,et al.  ChronoSeeker: search engine for future and past events , 2010, ICUIMC '10.

[12]  Tat-Seng Chua,et al.  From Tweets to Wellness: Wellness Event Detection from Twitter Streams , 2016, AAAI.

[13]  Tat-Seng Chua,et al.  What Does Social Media Say about Your Stress? , 2016, IJCAI.

[14]  F. A. Kunneman,et al.  Predicting time-to-event from Twitter messages , 2013 .

[15]  Hila Becker,et al.  Identifying content for planned events across social media sites , 2012, WSDM '12.