Research Progress of Stream Data Query in Network Space

In recent years, there has been widespread concern about the problems of stream data query both academic and industrial communities. The problems obtained some results. At the same time, big data stream brings great benefits for information society. Information query about stream data form has also brought crucial challenges. However, it is seldom about the research of big data stream query in network space. This paper analyzes the characteristics of stream data query in massive data, discusses the challenges and research issues of data stream for big data query. Finally the works for the data stream query are surveyed.

[1]  Qi Kai,et al.  Real-Time Processing for High Speed Data Stream over Large Scale Data , 2012 .

[2]  Hongjun Lu,et al.  Stabbing the sky: efficient skyline computation over sliding windows , 2005, 21st International Conference on Data Engineering (ICDE'05).

[3]  Mikhail J. Atallah,et al.  Computing all skyline probabilities for uncertain data , 2009, PODS.

[4]  Ben Y. Zhao,et al.  Parallelizing Skyline Queries for Scalable Distribution , 2006, EDBT.

[5]  Srinivasan Parthasarathy,et al.  Facilitating interactive distributed data stream processing and mining , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[6]  S. Muthukrishnan,et al.  Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries , 2001, VLDB.

[7]  Prashant J. Shenoy,et al.  A platform for scalable one-pass analytics using MapReduce , 2011, SIGMOD '11.

[8]  Walid G. Aref,et al.  M3: Stream Processing on Main-Memory MapReduce , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[9]  Hua Lu,et al.  Parallel Distributed Processing of Constrained Skyline Queries by Filtering , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[10]  Yan Yang,et al.  Estimation of the Number of Distinct Values over Data Stream Based on Compound Sliding Window , 2013, J. Softw..

[11]  Shyam Antony,et al.  Thread Cooperation in Multicore Architectures for Frequency Counting over Multiple Data Streams , 2009, Proc. VLDB Endow..

[12]  Yufei Tao,et al.  Distributed Skyline Retrieval with Low Bandwidth Consumption , 2009, IEEE Transactions on Knowledge and Data Engineering.

[13]  Zhenhua Wang,et al.  Continuously Maintaining Sliding Window Skylines in a Sensor Network , 2007, DASFAA.

[14]  Li Jian Processing Compressed Sliding Window Continuous Queries over Data Streams , 2004 .

[15]  Geng Yin Mining Method for Data Quality Detection Rules , 2012 .

[16]  Li Jianzhong and Liu Xianmin,et al.  An Important Aspect of Big Data: Data Usability , 2013 .

[17]  Ryen W. White,et al.  Stream prediction using a generative model based on frequent episodes in event sequences , 2008, KDD.

[18]  Jianzhong Li,et al.  Efficient Top-k Keyword Search on XML Streams , 2008, 2008 The 9th International Conference for Young Computer Scientists.

[19]  Gao Ming,et al.  A Survey on Management of Data Provenance , 2010 .

[20]  Jeffrey F. Naughton,et al.  Maximizing the Output Rate of Multi-Way Join Queries over Streaming Information Sources , 2003, VLDB.

[21]  Jianzhong Li,et al.  O(ε)-Approximation to physical world by sensor networks , 2013, 2013 Proceedings IEEE INFOCOM.

[22]  Jonghyun Park,et al.  Parallel Skyline Computation on Multicore Architectures , 2009, ICDE.

[23]  Sun Sheng Efficient Processing of Continuous Skyline Query over Distributed Data Streams , 2009 .

[24]  Liang Chen,et al.  GATES: a grid-based middleware for processing distributed data streams , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[25]  Aoying Zhou,et al.  Data-Intensive Science and Engineering:Requirements and Challenges , 2012 .

[26]  Wang Yuan Fault-Tolerant Parallel Skyline Computation in Cloud Computing Environment , 2011 .

[27]  V.K. Goyal,et al.  Estimation from lossy sensor data: jump linear modeling and Kalman filtering , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[28]  Rajeev Motwani,et al.  Sampling from a moving window over streaming data , 2002, SODA '02.

[29]  Andrey Brito,et al.  Scalable and Low-Latency Data Processing with Stream MapReduce , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[30]  Zhou Li-xin Algorithm on Computing Skyline over Probabilistic Data Stream , 2009 .

[31]  Jeffrey F. Naughton,et al.  Evaluating window joins over unbounded streams , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[32]  Arbee L. P. Chen,et al.  A Tree-Based Approach for Event Prediction Using Episode Rules over Event Streams , 2008, DEXA.

[33]  Lukasz Golab,et al.  Processing Sliding Window Multi-Joins in Continuous Queries over Data Streams , 2003, VLDB.

[34]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[35]  Zhu Hui Data Stream Prediction Based on Episode Rule Matching , 2012 .

[36]  Anthony K. H. Tung,et al.  Continuous Skyline Queries for Moving Objects , 2006, IEEE Transactions on Knowledge and Data Engineering.

[37]  Yasuhiko Morimoto,et al.  Skyline Query for Selecting Spatial Objects by Utilizing Surrounding Objects , 2013, J. Comput..

[38]  Jian Pei,et al.  Towards Progressive and Load Balancing Distributed Computation: A Case Study on Skyline Analysis , 2010, Journal of Computer Science and Technology.

[39]  Lu Liu,et al.  Muppet: MapReduce-Style Processing of Fast Data , 2012, Proc. VLDB Endow..

[40]  Philip S. Yu,et al.  Loadstar: A Load Shedding Scheme for Classifying Data Streams , 2005, SDM.

[41]  Wang Yuan,et al.  Network Big Data: Present and Future: Network Big Data: Present and Future , 2014 .

[42]  Ding Lin Efficient Skyline Query Processing of Massive Data Based on Map-Reduce , 2011 .

[43]  Jeffrey Xu Yu,et al.  Probabilistic Skyline Operator over Sliding Windows , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[44]  Abhinandan Das,et al.  Approximate join processing over data streams , 2003, SIGMOD '03.