Real-Time Data Management for Big Data

Users have come to expect reactivity from mobile and web applications, i.e. they assume that changes made by other users become visible immediately. However, developers are challenged with building reactive applications on top of traditional pulloriented databases, because they are ill-equipped to push new information to the client. Systems for data stream management and processing, on the other hand, are natively push-oriented and thus facilitate reactive behavior, but they do not follow the same collection-based semantics as traditional databases: Instead of database collections, stream-oriented systems are based on a notion of potentially unbounded sequences of data items. In this tutorial, we survey and categorize the system space between pull-oriented databases and push-oriented stream management systems, using their respectively facilitated means of data retrieval as a reference point. A particular emphasis lies on the novel system class of real-time databases which combine the push-based access paradigm of stream-oriented systems with the collection-based query semantics of traditional databases. We explore why real-time databases deserve distinction in a separate system class and dissect their di erent architectures to highlight issues, derive open challenges, and discuss avenues for addressing them.

[1]  Norbert Ritter,et al.  Scalable data management: NoSQL data stores in research and practice , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[2]  Norbert Ritter,et al.  Quaestor: Query Web Caching for Database-as-a-Service Providers , 2017, Proc. VLDB Endow..

[3]  Norbert Ritter,et al.  Scalable Data Management: An In-Depth Tutorial on NoSQL Data Stores , 2017, BTW.

[4]  Lukasz Golab,et al.  Data Stream Management , 2017, Data Stream Management.

[5]  Henrik Loeser,et al.  "One Size Fits All": An Idea Whose Time Has Come and Gone? , 2011, BTW.

[6]  Norbert Ritter,et al.  Real-time stream processing for Big Data , 2016, it Inf. Technol..

[7]  Norbert Ritter,et al.  NoSQL database systems: a survey and decision guidance , 2017, Computer Science - Research and Development.

[8]  Michael Stonebraker,et al.  The 8 requirements of real-time stream processing , 2005, SGMD.