Web Usage Mining in Tourism - A Query Term Analysis and Clustering Approach

According to current research, one of the most promising applications for web usage mining (WUM) is in identifying homogenous user subgroups (Liu, 2008). This paper presents a prototypical workflow and tools for analyzing user sessions to extract business intelligence hidden in web log data. By considering a leading Swedish destination gateway, we demonstrate how query term analysis in combination with session clustering can be utilized to effectively explore the information needs of website users. The system thus overcomes many of the limitations of typical web site analysis tools that only offer general statistics and ignore the opportunities offered by unsupervised learning techniques.

[1]  A. Tjoa,et al.  Information and Communication Technologies in Tourism , 1996, Springer Vienna.

[2]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[3]  Wilfried Grossmann,et al.  Using Data Mining in Analysing Local Tourism Patterns , 1999 .

[4]  Dimitrios Buhalis,et al.  Information and Communication Technologies in Tourism 1999 , 1999 .

[5]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[6]  Charles F. Hofacker,et al.  Website-generated market-research data Tracing the tracks left behind by visitors , 2001 .

[7]  Ignacio Olmeda,et al.  Data Mining Techniques and Applications for Tourism Internet Marketing , 2002 .

[8]  Vincent Cho,et al.  Towards Using Knowledge Discovery Techniques in Database Marketing for the Tourism Industry , 2002 .

[9]  Rick L. Andrews,et al.  A Comparison of Segment Retention Criteria for Finite Mixture Logit Models , 2003 .

[10]  Arno Scharl,et al.  An Integrated Approach to Measure Web Site Effectiveness in the European Hotel Industry , 2003, J. Inf. Technol. Tour..

[11]  Francesco Ricci,et al.  E-commerce and tourism , 2004, CACM.

[12]  Abhijit Biswas,et al.  The Internet's impact on marketing: Introduction to the JBR special issue on "Marketing on the web--behavioral, strategy and practices and public policy" , 2004 .

[13]  Amit Bhatnagar,et al.  Segmenting consumers based on the benefits and risks of Internet shopping , 2004 .

[14]  A. Frew Information and Communication Technologies in Tourism 2005 , 2005 .

[15]  Donald K. Wedding,et al.  Discovering Knowledge in Data, an Introduction to Data Mining , 2005, Inf. Process. Manag..

[16]  Jamie Murphy,et al.  Using Log File Analysis and Website Assessment to Improve Hospitality Websites , 2005, ENTER.

[17]  Masahito Yamamoto,et al.  Automatic Classification of Websites based on Keyword Extraction of Nouns , 2006, ENTER.

[18]  Jamie Murphy,et al.  Information and Communication Technologies in Tourism 2006 , 2006 .

[19]  José G. Dias,et al.  Latent class modeling of website users’ search patterns: Implications for online market segmentation , 2007 .

[20]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[21]  Olfa Nasraoui,et al.  Web data mining: exploring hyperlinks, contents, and usage data , 2008, SKDD.

[22]  Karl Wöber,et al.  A Comparative Study of Information Needs of City Travelers in Europe , 2008, J. Inf. Technol. Tour..

[23]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[24]  Olfa Nasraoui,et al.  Web Usage Mining , 2011 .

[25]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[26]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.