An ensemble classification approach for prediction of user’s next location based on Twitter data

The study presents a machine learning approach to predict a user’s next location of visit based on their previous tweets and personality traits. The future behavior of the people is reflected by their profile and past behaviors; this motivates the work presented in this study. Nowadays, people are fond of sharing their experiences related to travel, a visit to a restaurants or hotel or some historic places etc. on the social media platforms. Twitter, Instagram and Facebook are one of the popular social media platforms where this sharing of information can be found. This study used Twitter data for the analysis and to develop a model using machine learning (ML) to predict a user’s next visiting location. ML enables a computer to learn from the historical records and use this knowledge for prediction and decision making for new data. Prediction accuracy is one of the important and required parameter for any prediction model. If prediction model does not provide good prediction accuracy then the model cannot be accepted as reliable. Considering this phenomena, this study used an ensemble classification approach (ESA) to develop a prediction model for the problem under study. ESA trains different classifiers on the same data and use voting method to select the most accurate prediction. This study used ESA for first time to perform such type of study and results reveal that ESA certainly enhances the prediction accuracy of the model which is highly desirable.

[1]  Durga Toshniwal,et al.  Prediction of places of visit using tweets , 2016, Knowledge and Information Systems.

[2]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[3]  Henry A. Kautz,et al.  nEmesis: Which Restaurants Should You Avoid Today? , 2013, HCOMP.

[4]  Krishna P. Gummadi,et al.  Inferring user interests in the Twitter social network , 2014, RecSys '14.

[5]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[6]  Jiawei Han,et al.  Geographical topic discovery and comparison , 2011, WWW.

[7]  Jeffrey Nichols,et al.  Recommending targeted strangers from whom to solicit information on social media , 2013, IUI '13.

[8]  Michael E. Theologou,et al.  Providing recommendations on location-based social networks , 2016, J. Ambient Intell. Humaniz. Comput..

[9]  Qi Gao,et al.  Twitter-Based User Modeling for News Recommendations , 2013, IJCAI.

[10]  Aristides Gionis,et al.  From chatter to headlines: harnessing the real-time web for personalized news recommendation , 2012, WSDM '12.

[11]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[12]  Jeffrey Nichols,et al.  Who Will Retweet This? Detecting Strangers from Twitter to Retweet Information , 2015, ACM Trans. Intell. Syst. Technol..

[13]  Susan Gauch,et al.  Personalized News Recommendation Using Twitter , 2013, 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[14]  Sachin Kumar,et al.  Severity analysis of powered two wheeler traffic accidents in Uttarakhand, India , 2017 .

[15]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[16]  Mudhakar Srivatsa,et al.  When twitter meets foursquare: tweet location prediction using foursquare , 2014, MobiQuitous.

[17]  Toly Chen,et al.  An innovative fuzzy and artificial neural network approach for forecasting yield under an uncertain learning environment , 2018, J. Ambient Intell. Humaniz. Comput..

[18]  Padhraic Smyth,et al.  Modeling human location data with mixtures of kernel densities , 2014, KDD.

[19]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[20]  Jan Pedersen,et al.  Inferring User Interests From Microblogs , 2014 .

[21]  Scott A. Golder,et al.  Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures , 2011 .

[22]  He Ma,et al.  A positive and unlabeled learning framework based on extreme learning machine for drug-drug interactions discovery , 2018, Journal of Ambient Intelligence and Humanized Computing.

[23]  Bruno Martins,et al.  Predicting future locations with hidden Markov models , 2012, UbiComp.

[24]  Eric S Walsh,et al.  A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system , 2017, PloS one.

[25]  Cindy K. Chung,et al.  The development and psychometric properties of LIWC2007 , 2007 .

[26]  Durga Toshniwal,et al.  Predicting future place of visit using user's personality profile , 2016, 2016 International Conference on Computational Techniques in Information and Communication Technologies (ICCTICT).

[27]  Margaret L. Kern,et al.  Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[28]  Jennifer Golbeck,et al.  Predicting Personality from Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[29]  Chris Tseng,et al.  Classifying twitter data with Naïve Bayes Classifier , 2012, 2012 IEEE International Conference on Granular Computing.

[30]  Jeffrey Nichols,et al.  Understanding individuals' personal values from social media word use , 2014, CSCW.