Information Retrieval Technology

Extensive use of social media for communication has made it a desired resource in human behavior intensive tasks like product popularity, public polls and more recently for public health surveillance tasks such as lifestyle associated diseases and mental health. In this paper, we exploited Twitter data for detecting pregnancy cases and used tweets about pregnancy to study trigger terms associated with maternal physical and mental health. Such systems can enable clinicians to offer a more comprehensive health care in real time. Using a Twitter-based corpus, we have developed an ensemble Long-short Term Memory (LSTM) – Recurrent Neural Networks (RNN) and Convolution Neural Networks (CNN) network representation model to learn legitimate pregnancy cases discussed online. These ensemble representations were learned by a SVM classifier, which can achieve F1-score of 95% in predicting pregnancy accounts discussed in tweets. We also further investigate the words most commonly associated with physical disease symptoms ‘Distress’ and negative emotions ‘Annoyed’ sentiment. Results from our sentiment analysis study are quite encouraging, identifying more accurate triggers for pregnancy sentiment classes.

[1]  Zhaochun Ren,et al.  Neural Attentive Session-based Recommendation , 2017, CIKM.

[2]  Craig MacDonald,et al.  Learning to combine representations for medical records search , 2013, SIGIR.

[3]  Qiaozhu Mei,et al.  PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks , 2015, KDD.

[4]  Guokun Lai,et al.  Explicit factor models for explainable recommendation based on phrase-level sentiment analysis , 2014, SIGIR.

[5]  Zhiyuan Liu,et al.  Max-Margin DeepWalk: Discriminative Learning of Network Representation , 2016, IJCAI.

[6]  Ben Carterette,et al.  Combining multi-level evidence for medical record retrieval , 2012, SHB '12.

[7]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[8]  M. W Gardner,et al.  Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences , 1998 .

[9]  Deli Zhao,et al.  Network Representation Learning with Rich Text Information , 2015, IJCAI.

[10]  Julian J. McAuley,et al.  Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering , 2016, WWW.

[11]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[12]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[13]  Yue Wang,et al.  Exploring the Query Expansion Methods for Concept Based Representation , 2014, TREC.

[14]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[15]  Edward Y. Chang,et al.  Mining Product Adopter Information from Online Reviews for Improving Product Recommendation , 2016, ACM Trans. Knowl. Discov. Data.

[16]  M. de Rijke,et al.  Information Discovery in E-commerce: Half-day SIGIR 2018 Tutorial , 2018, SIGIR.

[17]  Natalie S. Glance,et al.  Star Quality: Aggregating Reviews to Rank Products and Merchants , 2010, ICWSM.

[18]  Xiaoming Li,et al.  Leveraging Product Adopter Information from Online Reviews for Product Recommendation , 2015, ICWSM.

[19]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[20]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[21]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[22]  Xiangnan He,et al.  Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention , 2017, SIGIR.

[23]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[24]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[25]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[26]  Aixin Sun,et al.  Mobile phone name extraction from internet forums: a semi-supervised approach , 2016, World Wide Web.

[27]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[28]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[29]  Yue Wang,et al.  A Study of Concept-based Weighting Regularization for Medical Records Search , 2014, ACL.

[30]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[31]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[32]  Jun Zhang,et al.  A Neural Collaborative Filtering Model with Interaction-based Neighborhood , 2017, CIKM.