Semantic similarity analysis of user-generated content for theme-based route planning

Several technical approaches to a touristic tour planning, which connect popular points and routes of interest or provide locations related to specific themes, have been published in recent years. Hereby, points of interest are found and evaluated on the basis of user-generated web content. However, no approach exists to the author's knowledge, which allows truly individual theme route planning. Individual means, that a user flexibly defines start point and destination and receives an optimised route, which will guide him through a townscape/landscape with most interesting features being situated along the proposed way. We introduce two methods to find such an individual theme route based on user-generated content. The basis for both methods is the determination of semantic similarity between a selected Wikipedia concept (e.g. a specific architectural style) and other geo-referenced Wikipedia concepts (e.g. a building). The first method has been termed the continuum method: it uses semantic similarity measures together with a density distribution from theme-related, geo-tagged photos in the web, in order to create a continuous ‘surface of attractiveness’. Such a conceptual continuum can – together with the static geometric length of network features – form the basis of an assignment of impedance values to a navigation graph. The second method has been termed the spot sequence method: it models the theme route as a specific version of the travelling salesman problem. A route is composed by sequentially adding visit points to a navigation graph from the start to the end point. Priorities are derived from the ranked semantic similarity values. The achieved results have been compared and evaluated on a basis of a user survey.

[1]  Changhu Wang,et al.  Equip tourists with knowledge mined from travelogues , 2010, WWW '10.

[2]  Ross Purves,et al.  Exploring place through user-generated content: Using Flickr tags to describe city cores , 2010, J. Spatial Inf. Sci..

[3]  Carsten Keßler,et al.  Bottom-Up Gazetteers: Learning from the Implicit Semantics of Geotags , 2009, GeoS.

[4]  Gerhard Weber,et al.  RouteCheckr: personalized multicriteria routing for mobility impaired pedestrians , 2008, Assets '08.

[5]  Tanji Hu,et al.  Summarizing tourist destinations by mining user-generated travelogues and photos , 2011, Comput. Vis. Image Underst..

[7]  L. Puczkó,et al.  Trailing Goethe, Humbert and Ulysses: cultural routes in tourism , 2007 .

[8]  L. Tiina Sarjakoski,et al.  Variable-Scale Map For Small-display Cartography , 2002 .

[9]  Xing Xie,et al.  On theme location discovery for travelogue services , 2011, SIGIR.

[10]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[11]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[12]  Saral Jain,et al.  Antourage: mining distance-constrained trips from flickr , 2010, WWW '10.

[13]  Benjamin Adams,et al.  Inferring Thematic Places from Spatially Referenced Natural Language Descriptions , 2013 .

[14]  Adrian Popescu,et al.  Mining tourist information from user-supplied collections , 2009, CIKM.

[15]  Atsuyuki Okabe,et al.  A kernel density estimation method for networks, its computational method and a GIS‐based tool , 2009, Int. J. Geogr. Inf. Sci..

[16]  Changhu Wang,et al.  Photo2Trip: generating travel routes from geo-tagged photos for trip planning , 2010, ACM Multimedia.

[17]  Jérôme Gensel,et al.  Content zooming and exploration for mobile maps , 2012, GIScience 2012.

[18]  Darren Gergle,et al.  The tower of Babel meets web 2.0: user-generated content and its applications in a multilingual context , 2010, CHI.

[19]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[20]  Chei Sian Lee,et al.  Determining Services for the Mobile Tourist , 2010, J. Comput. Inf. Syst..

[21]  Jochen Schaab,et al.  Automated Footprint Generation from Geotags with Kernel Density Estimation and Support Vector Machines , 2009, Spatial Cogn. Comput..

[22]  H. Holone,et al.  Users Are Doing It For Themselves: Pedestrian Navigation With User Generated Content , 2007, The 2007 International Conference on Next Generation Mobile Applications, Services and Technologies (NGMAST 2007).

[23]  Simone Paolo Ponzetto,et al.  WikiRelate! Computing Semantic Relatedness Using Wikipedia , 2006, AAAI.

[24]  Takahiro Hara,et al.  Semantic relatedness measurement based on Wikipedia link co-occurrence analysis , 2011, Int. J. Web Inf. Syst..

[25]  Herman Arnold Engelbrecht,et al.  Measuring Conceptual Similarity by Spreading Activation over Wikipedia's Hyperlink Structure , 2010, PWNLP@COLING.

[26]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[27]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[28]  Jinyoung Kim,et al.  TripTip: a trip planning service with tag-based recommendation , 2009, CHI Extended Abstracts.

[29]  Eneko Agirre,et al.  WikiWalk: Random walks on Wikipedia for Semantic Relatedness , 2009, Graph-based Methods for Natural Language Processing.