Tourist behavior analysis through geotagged photographies: A method to identify the country of origin

Much information can be extracted from geotagged photographies posted on online image databases like Flickr or Panoramio. Recent works have demonstrated that some treatment of this data can provide a good estimation of tourism behavior. Tourism represents today and for several years an important factor in the regional economy. Understanding and analyzing the tourist behavior corresponds to a significant demand from institutions. For this purpose, many studies have been launched. Many specialists of tourism need to separate tourists according to their place of residence. In the context of two projects supported by territorial collectivities, this paper introduces a new paradigm to estimate photographer's country of residence. Each user will be described by his photographic timeline. This timeline allows to compute intermediate properties: travel time at a destination, number of trips, number of visited countries... This generation of symbolic data is essential and allows to synthesize the richness of the timeline in front of the recognition task to achieve. Classification algorithms will then be introduced, some sets with experts of science of tourism, others using data clustering and supervised learning techniques. We compared these methods for two distinct questions: firstly we classify photographers into two categories (French/non-French for example); secondly we find the country of residence of each user. It demonstrates that, using learning algorithms or expert-defined rules permits to identify users residence efficiently. We are thus able to meet the request of experts in tourism and refine even more the analysis of tourist behavior.

[1]  Mor Naaman,et al.  How flickr helps us make sense of the world: context and content in community-contributed media collections , 2007, ACM Multimedia.

[2]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[3]  Noel B. Salazar Imaged or Imagined? Cultural Representations and the ‘Tourismification’ of Peoples and Places , 2007 .

[4]  Katrin Grünfeld Integrating spatio-temporal information in environmental monitoring data--a visualization approach applied to moss data. , 2005, The Science of the total environment.

[5]  Vassilis Kostakos,et al.  Instrumenting the City: Developing Methods for Observing and Understanding the Digital Cityscape , 2006, UbiComp.

[6]  Mor Naaman,et al.  Generating summaries and visualization for large collections of geo-referenced photographs , 2006, MIR '06.

[7]  Silvia Sussmann,et al.  Does nationality affect tourist behavior , 1995 .

[8]  R. Chalfen Photography's role in tourism; some unexplored relationships. , 1979 .

[9]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[10]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[11]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[12]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[13]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .