Mining checkins from location-sharing services for client-independent IP geolocation

Accurately determining the geographic location of an Internet host is important for location-aware applications such as location-based advertising and network diagnostics. Despite their fast response time, widely used database-driven geolocation approaches provide only inaccurate locations. Delay measurement based approaches improve the estimation accuracy but still suffer from a limited precision (about 10 km) and a long response time (tens of seconds) to localize a single PC, which cannot meet the demand of precise and real-time geolocation for location-aware applications. In this paper, we propose a new geolocation approach, Checkin-Geo, which exploits geolocation resources fundamentally different from existing database-driven (using DNS, Whois, etc.) or network delay measurement based approaches. In particular, we leverage the location data that users are willing to share in location-sharing services and logs of user logins from PCs for real-time and accurate geolocation. Experimental results show that compared to existing geolocation techniques, Checkin-Geo achieves 1) a median estimation error of 799 meters (an order of magnitude smaller than existing approaches), and 2) a negligible response time, which are promising for accurate location-aware applications.

[1]  David Wetherall,et al.  Towards IP geolocation using delay and topology measurements , 2006, IMC '06.

[2]  Kent Larson,et al.  Using a Live-In Laboratory for Ubiquitous Computing Research , 2006, Pervasive.

[3]  Chun-Ta Lu,et al.  A Framework of Mining Semantic Regions from Trajectories , 2011, DASFAA.

[4]  Roger M. Needham,et al.  Denial of service , 1993, CCS '93.

[5]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[6]  Emin Gün Sirer,et al.  Octant: A Comprehensive Framework for the Geolocalization of Internet Hosts , 2007, NSDI.

[7]  D. Defays,et al.  An Efficient Algorithm for a Complete Link Method , 1977, Comput. J..

[8]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[9]  Helen J. Wang,et al.  Mining the Web and the Internet for Accurate IP Address Geolocations , 2009, IEEE INFOCOM 2009.

[10]  Cecilia Mascolo,et al.  An Empirical Study of Geographic User Activity Patterns in Foursquare , 2011, ICWSM.

[11]  Aleksandar Kuzmanovic,et al.  Towards Street-Level Client-Independent IP Geolocation , 2011, NSDI.

[12]  Margaret Martonosi,et al.  Identifying Important Places in People's Lives from Cellular Network Data , 2011, Pervasive.

[13]  Steve Uhlig,et al.  IP geolocation databases: unreliable? , 2011, CCRV.

[14]  Lakshminarayanan Subramanian,et al.  An investigation of geographic mapping techniques for internet hosts , 2001, SIGCOMM.

[15]  Naranker Dulay,et al.  Predicting human behaviour from selected mobile phone data points , 2010, UbiComp.

[16]  Nick Feamster,et al.  Geographic locality of IP prefixes , 2005, IMC '05.

[17]  Serge Fdida,et al.  Constraint-Based Geolocation of Internet Hosts , 2004, IEEE/ACM Transactions on Networking.

[18]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[19]  Martín Abadi,et al.  How to tell an airport from a home: techniques and applications , 2010, Hotnets-IX.

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.