Discovering geographic locations in web pages using urban addresses

This paper presents an approach that helps to discover geographic locations from the recognition, extraction, and geocoding of urban addresses found in Web pages. Experiments that evaluate the presence and incidence of urban addresses in Web pages are described. Experimental results, based on a collection of over 4 million documents from the Brazilian Web, show the feasibility and effectiveness of the proposed method.

[1]  Berthier A. Ribeiro-Neto,et al.  A brief survey of web data extraction tools , 2002, SGMD.

[2]  Mário J. Silva,et al.  Adding geographic scopes to web resources , 2006, Comput. Environ. Urban Syst..

[3]  Alberto H. F. Laender,et al.  Semantic Expansion of Geographic Web Queries Based on Natural Language Positioning Expressions , 2007, Trans. GIS.

[4]  Alberto H. F. Laender,et al.  The role of gazetteers in geographic knowledge discovery on the Web , 2005, Third Latin American Web Congress (LA-WEB'2005).

[5]  Kevin S. McCurley,et al.  Geospatial mapping and navigation of the web , 2001, WWW '01.

[6]  Mark Sanderson,et al.  Spatio-textual Indexing for Geographical Search on the Web , 2005, SSTD.

[7]  M. Sanderson,et al.  Analyzing geographic queries , 2004 .

[8]  Xing Xie,et al.  Detecting geographic locations from web resources , 2005, GIR '05.

[9]  Marty Himmelstein Local Search: The Internet Is the Yellow Pages , 2005, Computer.

[10]  Claudia Bauzer Medeiros,et al.  The Web as a Data Source for Spatial Databases , 2003, GeoInfo.

[11]  Ray R. Larson,et al.  Geographic information retrieval and spatial browsing , 1996 .

[12]  David W. Embley,et al.  Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages , 1999, Data Knowl. Eng..

[13]  Dan Wu,et al.  On assigning place names to geography related web pages , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[14]  Robert Weibel,et al.  Spatial information retrieval and geographical ontologies an overview of the SPIRIT project , 2002, SIGIR '02.

[15]  Alia I. Abdelmoty,et al.  Building a Geographical Ontology for Intelligent Spatial Search on the Web , 2005, Databases and Applications.

[16]  Frederico T. Fonseca,et al.  Assessing the Certainty of Locations Produced by an Address Geocoding System , 2007, GeoInformatica.

[17]  Ron Sivan,et al.  Web-a-where: geotagging web content , 2004, SIGIR '04.