Topic mining of tourist attractions based on a seasonal context aware LDA model

With the rise of personalized travel recommendation in recent years, automatic analysis and summary of the tourist attraction is of great importance in decision making for both tourists and tour operators. To this end, many probabilistic topic models have been proposed for feature extraction of tourist attraction. However, existing state-of-the-art probabilistic topic models overlook the fact that tourist attractions tend to have distinct characteristics with respect to specific seasonal context. In this article, we contribute the innovative idea of using seasonal contextual information to refine the characteristics of tourist attractions. Along this line, we first propose STLDA, a season topic model based on latent Dirichlet allocation which can capture meaningful topics corresponding to various seasonal contexts for each attraction. Then, an inference algorithm using Gibbs sampling is put forward to learn the model parameters of our proposed model. In order to verify the effectiveness of STLDA model, we present a detailed experimental study using collected real-world textual data of tourist attractions. The experimental analysis results show that the superiority of STLDA over the basic LDA model in providing a representative and comprehensive summarization related to each tourist attraction. More importantly, it has great significance for improving the level of personalized attraction recommendation.

[1]  Antonio Moreno,et al.  Intelligent tourism recommender systems: A survey , 2014, Expert Syst. Appl..

[2]  David B. Dunson,et al.  Probabilistic topic models , 2012, Commun. ACM.

[3]  Hsin-Min Lu,et al.  Detecting short-term cyclical topic dynamics in the user-generated content and news , 2015, Decis. Support Syst..

[4]  Yan-Ying Chen,et al.  Travel Recommendation by Mining People Attributes and Travel Group Types From Community-Contributed Photos , 2013, IEEE Transactions on Multimedia.

[5]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[6]  Kuan-Yu Chen,et al.  Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling , 2007, IEEE Transactions on Knowledge and Data Engineering.

[7]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[8]  Olatz Arbelaitz,et al.  Web usage and content mining to extract knowledge for modelling the users of the Bidasoa Turismo website and to adapt it , 2013, Expert Syst. Appl..

[9]  Tao Mei,et al.  Travel Recommendation via Author Topic Model Based Collaborative Filtering , 2015, MMM.

[10]  Ryota Tomioka,et al.  Discovering Emerging Topics in Social Streams via Link-Anomaly Detection , 2014, IEEE Transactions on Knowledge and Data Engineering.

[11]  A. Leask Progress in visitor attraction research: Towards more effective management , 2010 .

[12]  Hui Xiong,et al.  A Cocktail Approach for Travel Package Recommendation , 2014, IEEE Transactions on Knowledge and Data Engineering.

[13]  Thomas L. Griffiths,et al.  The Author-Topic Model for Authors and Documents , 2004, UAI.

[14]  Lise Getoor,et al.  Topic Modeling for Wikipedia Link Disambiguation , 2014, ACM Trans. Inf. Syst..

[15]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[16]  Stefan M. Rüger,et al.  Weakly Supervised Joint Sentiment-Topic Detection from Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[17]  Ling Chen,et al.  LCARS , 2014, ACM Trans. Inf. Syst..

[18]  Seyed Reza Shahamiri,et al.  A systematic review of scholar context-aware recommender systems , 2015, Expert Syst. Appl..

[19]  Helena Strömberg,et al.  Trying on change – Trialability as a change moderator for sustainable travel behaviour , 2016 .

[20]  Charalampos Konstantopoulos,et al.  Mobile recommender systems in tourism , 2014, J. Netw. Comput. Appl..

[21]  Jiang-Ming Yang,et al.  Generating location overviews with images and tags by mining user-generated travelogues , 2009, ACM Multimedia.

[22]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[23]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[24]  Richard Butler,et al.  Seasonality in tourism: issues and implications , 1998 .

[25]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[26]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[27]  Michael J. Paul,et al.  A Two-Dimensional Topic-Aspect Model for Discovering Multi-Faceted Topics , 2010, AAAI.

[28]  Xinbo Gao,et al.  Attraction recommendation: Towards personalized tourism via collective intelligence , 2016, Neurocomputing.

[29]  Changhu Wang,et al.  Equip tourists with knowledge mined from travelogues , 2010, WWW '10.

[30]  Yan Liu,et al.  Topic-link LDA: joint models of topic and author community , 2009, ICML '09.

[31]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[32]  Tanji Hu,et al.  Summarizing tourist destinations by mining user-generated travelogues and photos , 2011, Comput. Vis. Image Underst..

[33]  Li Zhang,et al.  Intelligent facial emotion recognition and semantic-based topic detection for a humanoid robot , 2013, Expert Syst. Appl..

[34]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[35]  Ching-Hsue Cheng,et al.  Recommendation system for popular tourist attractions in Taiwan using Delphi panel and repertory grid techniques. , 2015 .

[36]  Byung-Won On,et al.  LDA topics: Representation and evaluation , 2015, J. Inf. Sci..

[37]  Wei Wang,et al.  Recommender system application developments: A survey , 2015, Decis. Support Syst..

[38]  J. Mixter Fast , 2012 .

[39]  Konstantin Vorontsov,et al.  Additive regularization of topic models , 2015, Machine Learning.

[40]  R. Glynn,et al.  The Wilcoxon Signed Rank Test for Paired Comparisons of Clustered Data , 2006, Biometrics.

[41]  Jonathan Corcoran,et al.  Examining the spatial–temporal dynamics of bus passenger travel behaviour using smart card data and the flow-comap , 2014 .

[42]  Younghoon Kim,et al.  TWILITE: A recommendation system for Twitter using a probabilistic model based on latent Dirichlet allocation , 2014, Inf. Syst..

[43]  Ping Jiang,et al.  Displacement prediction of landslide based on generalized regression neural networks with K-fold cross-validation , 2016, Neurocomputing.

[44]  Juan-Zi Li,et al.  Knowledge discovery through directed probabilistic topic models: a survey , 2010, Frontiers of Computer Science in China.

[45]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[46]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[47]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Nargis Pervin,et al.  Fast, Scalable, and Context-Sensitive Detection of Trending Topics in Microblog Post Streams , 2013, TMIS.