Semantic Management of Streaming Data

One of the fundamental challenges facing the unprecedented data deluge produced by the sensor networks is how to manage time-series streaming data so that they can be reasoning-ready and provenance-aware. Semantic web technology shows great promise but lacks adequate support for the notion of time. We present a system for the representation, indexing and querying of time-series data, especially streaming data, using the semantic web approach. This system incorporates a special RDF vocabulary and a semantic interpretation for time relationships. The resulting framework, which we refer to as Time-Annotated RDF, provides basic functionality for the representation and querying of time-related data. The capabilities of Time-Annotated RDF were implemented as a suite of Java APIs on top of Tupelo, a semantic content management middleware, to provide transparent integration among heterogeneous data, as present in streams and other data sources, and their metadata. We show how this system supports commonly used time-related queries using Time-Annotated SPARQL introduced in this paper as well as an analysis of the TA-RDF data model. Such prototype system has already seen successful usage in a virtual sensor project where near-real-time radar data streams need to be fetched, indexed, processed and re-published as new virtual sensor streams.

[1]  Amit P. Sheth,et al.  Analyzing theme, space, and time: an ontology-based approach , 2006, GIS '06.

[2]  Tony Fountain,et al.  The Open Source DataTurbine Initiative: Streaming Data Middleware for Environmental Observing Systems , 2009 .

[3]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[4]  Andre Bolles,et al.  Streaming SPARQL - Extending SPARQL to Process Data Streams , 2008, ESWC.

[5]  V. S. Subrahmanian,et al.  Scaling RDF with Time , 2008, WWW.

[6]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[7]  Jennifer Widom,et al.  Continuous queries over data streams , 2001, SGMD.

[8]  I. Arpinar,et al.  ES 3 N : A Semantic Approach to Data Management in Sensor Networks , 2006 .

[9]  Mark H. Hansen,et al.  Sensor-Internet Share and Search — Enabling Collaboration of Citizen Scientists , 2007 .

[10]  Amit P. Sheth,et al.  An Ontological Representation of Time Series Observations on the Semantic Sensor Web , 2009 .

[11]  S. Buraga,et al.  A RDF-based model for expressing spatio-temporal relations between Web sites , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002..

[12]  Claudio Gutiérrez,et al.  Introducing Time into RDF , 2007, IEEE Transactions on Knowledge and Data Engineering.

[13]  Xiaowen Wu,et al.  A new framework for on-demand virtualization, repurposing and fusion of heterogeneous sensors , 2009, 2009 International Symposium on Collaborative Technologies and Systems.

[14]  M. Yuan,et al.  A Typology of Spatiotemporal Information Queries , 2002 .

[15]  James D. Myers,et al.  Virtual Sensors in a Web 2.0 Virtual Watershed , 2008, 2008 IEEE Fourth International Conference on eScience.