A bottom‐up design for spatial search in large networks and clouds

Networked systems, such as telecom networks and cloud infrastructures, generate and hold vast amounts of conguration and operational data. The goal of this work is to make all this data available through a real-time search process named network search , which will enable new real-time management solutions. The thesis contains several contributions towards engineering a network search system. Key elements of our design are a weakly structured information model that includes spatial properties, a query language that supports location- and schema-oblivious search queries, a peer-to-peer architecture, a set of echo protocols for scalable query processing, and an indexing protocol for ecient routing for spatial queries. The data against which network search is performed is maintained in local real-time databases close to the data sources. The design follows a bottom-up approach in the sense that the topology for query routing is constructed from the underlying network topology. We have built a prototype of the system on a cloud testbed and developed applications that use network search functionality. Testbed measurements suggest that it is feasible to engineer a network search system that processes queries at low latency and low overhead and that can scale to 100'000 nodes. Simulation results for spatial queries show that query processing achieves response times and incurs overhead close to an optimal protocol, and that query result remains accurate under signicant churn.

[1]  Rolf Stadler,et al.  Protocols for Distributed Embedded Management , 2013 .

[2]  Márk Jelasity,et al.  PeerSim: A scalable P2P simulator , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[3]  Rolf Stadler,et al.  A query language for network search , 2013, 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013).

[4]  Amos Israeli,et al.  Self-stabilization of dynamic systems assuming only read/write atomicity , 1990, PODC '90.

[5]  Beng Chin Ooi,et al.  VBI-Tree: A Peer-to-Peer Framework for Supporting Multi-Dimensional Indexing Schemes , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[6]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[7]  Rolf Stadler,et al.  Scalable matching and ranking for network search , 2013, Proceedings of the 9th International Conference on Network and Service Management (CNSM 2013).

[8]  Anand Sivasubramaniam,et al.  DPTree: A Balanced Tree Based Indexing Framework for Peer-to-Peer Systems , 2006, Proceedings of the 2006 IEEE International Conference on Network Protocols.

[9]  Timos K. Sellis,et al.  Topological relations in the world of minimum bounding rectangles: a study with R-trees , 1995, SIGMOD '95.

[10]  Jon Louis Bentley,et al.  K-d trees for semidynamic point sets , 1990, SCG '90.

[11]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[12]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[13]  Verena Kantere,et al.  Storing and Indexing Spatial Data in P2P Systems , 2009, IEEE Transactions on Knowledge and Data Engineering.

[14]  Hector Garcia-Molina,et al.  One torus to rule them all: multi-dimensional queries in P2P systems , 2004, WebDB '04.

[15]  Dimitris Sacharidis,et al.  Index-based query processing on distributed multidimensional data , 2012, GeoInformatica.

[16]  M. Frans Kaashoek,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM 2004.

[17]  Dimitris Papadias,et al.  Spatial Relations, Minimum Bounding Rectangles, and Spatial Data Structures , 1997, Int. J. Geogr. Inf. Sci..

[18]  James Aspnes,et al.  Skip graphs , 2003, SODA '03.

[19]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM 2001.

[20]  Rolf Stadler,et al.  Management by network search , 2012, 2012 IEEE Network Operations and Management Symposium.

[21]  Daniel F. Macedo,et al.  Spatial query processing in wireless sensor networks - A survey , 2014, Inf. Fusion.

[22]  Hans-Peter Kriegel,et al.  Geodetic Distance Queries on R-Trees for Indexing Geographic Data , 2013, SSTD.

[23]  Masahiro Jibiki,et al.  Hierarchy-aware skip graph for sensing resource discoveries on large-scale sensor overlay networks , 2015, Comput. Commun..

[24]  Rolf Stadler Protocols for Distributed Management , 2012 .

[25]  Yang Chen,et al.  Pharos: accurate and decentralised network coordinate system , 2009, IET Commun..

[26]  Benoit Donnet,et al.  A Survey on Network Coordinates Systems, Design, and Security , 2010, IEEE Communications Surveys & Tutorials.

[27]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[28]  Rolf Stadler,et al.  Spatial search in networked systems , 2015, 2015 11th International Conference on Network and Service Management (CNSM).

[29]  Shyam Antony,et al.  PRoBe: Multi-dimensional Range Queries in P2P Networks , 2005, WISE.