We describe a method and system design for improved data discovery in an integrated network of open geospatial data that supports collaborative policy development between governments and local constituents. Metadata about civic data (such as thematic categories, user-generated tags, geo-references, or attribute schemata) primarily rely on technical vocabularies that reflect scientific or organizational hierarchies. By contrast, public consumers of data often search for information using colloquial terminology that does not align with official metadata vocabularies. For example, citizens searching for data about bicycle collisions in an area are unlikely to use the search terms with which organizations like Departments of Transportation describe relevant data. Users may also search with broad terms, such as “traffic safety”, and will then not discover data tagged with narrower official terms, such as “vehicular crash”. This mismatch raises the question of how to bridge the users’ ways of talking and searching with the language of technical metadata. In similar situations, it has been beneficial to augment official metadata with semantic annotations that expand the discoverability and relevance recommendations of data, supporting more inclusive access. Adopting this strategy, we develop a method for automated semantic annotation, which aggregates similar thematic and geographic information. A novelty of our approach is the development and application of a crosscutting base vocabulary that supports the description of geospatial themes. The resulting annotation method is integrated into a novel open access collaboration platform (Esri’s ArcGIS Hub) that supports public dissemination of civic data and is in use by thousands of government agencies. Our semantic annotation method improves data discovery for users across organizational repositories and has the potential to facilitate the coordination of community and organizational work, improving the transparency and efficacy of government policies. 2012 ACM Subject Classification Information systems → Digital libraries and archives
[1]
Marijn Janssen,et al.
Open data policies, their implementation and impact: A framework for comparison
,
2014,
Gov. Inf. Q..
[2]
Amit P. Sheth,et al.
Semantic Modelling of Smart City Data
,
2014
.
[3]
George A. Miller,et al.
WordNet: A Lexical Database for English
,
1995,
HLT.
[4]
Tony H. Grubesic,et al.
Geographic Information, Maps, and GIS
,
2016
.
[5]
Yaser A. Bishr,et al.
Overcoming the Semantic and Other Barriers to GIS Interoperability
,
1998,
Int. J. Geogr. Inf. Sci..
[6]
R. Kitchin.
The real-time city? Big data and smart urbanism
,
2013
.
[7]
Mark Jensen,et al.
The UNEP Ontologies and the OBO Foundry
,
2016,
ICBO/BioCreative.
[8]
Sean Bechhofer,et al.
Research Objects: Towards Exchange and Reuse of Digital Knowledge
,
2010
.
[9]
Christophe Debruyne,et al.
Serving Ireland's Geospatial Information as Linked Data
,
2016,
International Semantic Web Conference.
[10]
Dennis Nicholson.
The Intellectual Foundation of Information Organization
,
2003
.
[11]
James A. Hendler,et al.
The Semantic Web" in Scientific American
,
2001
.
[12]
Werner Kuhn,et al.
Spatial discovery and the research library
,
2016,
Trans. GIS.
[13]
Matthew S. Mayernik,et al.
Research data and metadata curation as institutional issues
,
2016,
J. Assoc. Inf. Sci. Technol..