Identifying Important Life Events from Twitter Using Semantic and Syntactic Patterns

Identifying global events from social media has been the focus of much research in recent years. However, the identification of personal life events poses new requirements and challenges that have received relatively little research attention. In this paper we explore a new approach for life event identification, where we expand social media posts into both semantic, and syntactic networks of content. Frequent graph patterns are mined from these networks and used as features to enrich life-event classifiers. Results show that our approach significantly outperforms the best performing baseline in accuracy (by 4.48% points) and F-measure (by 4.54% points) when used to identify five major life events identified from the psychology literature: Getting Married, Having Children, Death of a Parent, Starting School, and Falling in Love. In addition, our results show that, while semantic graphs are effective at discriminating the theme of the post (e.g. the topic of marriage), syntactic graphs help identify whether the post describes a personal event (e.g. someone getting married).

[1]  Nick E. Green,et al.  Detecting Life Events in Feeds from Twitter , 2013, 2013 IEEE Seventh International Conference on Semantic Computing.

[2]  Paul Mulholland,et al.  Identifying Prominent Life Events on Twitter , 2015, K-CAP.

[3]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[4]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[5]  Mohammad Al Hasan,et al.  FSM-H: Frequent Subgraph Mining Algorithm in Hadoop , 2014, 2014 IEEE International Congress on Big Data.

[6]  Harith Alani,et al.  Personal Life Event Detection from Social Media , 2014, HT.

[7]  Shirui Pan,et al.  Finding the best not the most: regularized loss minimization subgraph selection for graph classification , 2015, Pattern Recognit..

[8]  Lawrence B. Holder,et al.  Substucture Discovery in the SUBDUE System , 1994, KDD Workshop.

[9]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[10]  Claire Cardie,et al.  Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts , 2014, EMNLP.

[11]  D. Rubin,et al.  Age Effects in Cultural Life Scripts. , 2011, Applied cognitive psychology.

[12]  Wei Wang,et al.  Graph classification based on pattern co-occurrence , 2009, CIKM.

[13]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[14]  Brendan T. O'Connor,et al.  Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters , 2013, NAACL.

[15]  Matthew Hurst,et al.  Event Detection and Tracking in Social Streams , 2009, ICWSM.

[16]  Takashi Washio,et al.  Pruning Strategies Based on the Upper Bound of Information Gain for Discriminative Subgraph Mining , 2009, PKAW.

[17]  Michael A. Covington,et al.  A Fundamental Algorithm for Dependency Parsing , 2004 .

[18]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[19]  Raphaël Troncy,et al.  Using social media to identify events , 2011, WSM '11.

[20]  Philip S. Yu,et al.  Mining Brain Networks Using Multiple Side Views for Neurological Disorder Identification , 2015, 2015 IEEE International Conference on Data Mining.

[21]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[22]  Charles L. Wayne Topic detection and tracking in English and Chinese , 2000, IRAL '00.