Acquisition of Knowledge with Time Information from Twitter

In this paper, we propose a knowledge acquisition method for non-task-oriented dialogue systems. Such dialogue systems need a wide variety of knowledge for generating appropriate and sophisticated responses. However, constructing such knowledge is costly. To solve this problem, we focus on a relation about each tweet and the posted time. First, we extract event words, such as verbs, from tweets. Second, we generate frequency distribution for five different time divisions: e.g., a monthly basis. Then, we remove burst words on the basis of variance for obtaining refined distributions. We checked high ranked words in each time division. As a result, we obtained not only common sense things such as “sleep” in night but also interesting activities such as “recruit” in April and May (April is the beginning of the recruitment process for the new year in Japan.) and “raise the spirits/plow into” around 9 AM for inspiring oneself at the beginning of his/her work of the day. In addition, the knowledge that our method extracts probably contributes to not only dialogue systems but also text mining and behavior analysis of data on social media and so on.

[1]  Joelle Pineau,et al.  A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[2]  Daisuke Kawahara,et al.  Design of Word Association Games using Dialog Systems for Acquisition of Word Association Knowledge , 2016, AKBC@NAACL-HLT.

[3]  Antoine Bordes,et al.  Training Millions of Personalized Dialogue Agents , 2018, EMNLP.

[4]  Gerhard Weikum,et al.  Knowlywood: Mining Activity Knowledge From Hollywood Narratives , 2015, CIKM.

[5]  James Pustejovsky,et al.  FactBank: a corpus annotated with event factuality , 2009, Lang. Resour. Evaluation.

[6]  Yotaro Watanabe,et al.  Is a 204 cm Man Tall or Small ? Acquisition of Numerical Common Sense from the Web , 2013, ACL.

[7]  Tatsuya Kawahara,et al.  Conversational system for information navigation based on POMDP with user focus tracking , 2015, Comput. Speech Lang..

[8]  Wenlin Yao,et al.  Temporal Event Knowledge Acquisition via Identifying Narratives , 2018, ACL.

[9]  Jianfeng Gao,et al.  Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models , 2017, IJCNLP.

[10]  Ming Zhou,et al.  EventWiki: A Knowledge Base of Major Events , 2018, LREC.

[11]  Joelle Pineau,et al.  Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus , 2017, Dialogue Discourse.

[12]  Erik Cambria,et al.  Augmenting End-to-End Dialogue Systems With Commonsense Knowledge , 2018, AAAI.

[13]  Joelle Pineau,et al.  The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems , 2015, SIGDIAL Conference.

[14]  Ryuichiro Higashinaka,et al.  What Information Should a Dialogue System Understand?: Collection and Analysis of Perceived Information in Chat-Oriented Dialogue , 2018, IWSDS.

[15]  Kazutaka Shimada,et al.  Trivia Score and Ranking Estimation Using Support Vector Regression and RankNet , 2018, PACLIC.

[16]  Daisuke Kawahara,et al.  Large-Scale Acquisition of Commonsense Knowledge via a Quiz Game on a Dialogue System , 2016 .

[17]  Oliver Lemon,et al.  Alana v2: Entertaining and Informative Open-domain Social Dialogue using Ontologies and Entity Linking , 2018 .

[18]  Jianfeng Gao,et al.  A Persona-Based Neural Conversation Model , 2016, ACL.

[19]  Ryuichiro Higashinaka,et al.  On the difficulty of improving hand-crafted rules in chat-oriented dialogue systems , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[20]  Masaru Kitsuregawa,et al.  Modeling Situations in Neural Chat Bots , 2017, ACL.

[21]  Christopher Potts,et al.  Did It Happen? The Pragmatic Complexity of Veridicality Assessment , 2012, CL.