Automatic extraction of topics on big data streams through scalable advanced analysis
暂无分享,去创建一个
[1] Michael W. Berry,et al. Text mining : applications and theory , 2010 .
[2] Wichian Premchaiswadi,et al. Optimizing and Tuning MapReduce Jobs to Improve the Large‐Scale Data Analysis Process , 2013, Int. J. Intell. Syst..
[3] Weiguo Fan,et al. Effective and efficient dimensionality reduction for large-scale and streaming data preprocessing , 2006, IEEE Transactions on Knowledge and Data Engineering.
[4] Rob Pike,et al. Interpreting the data: Parallel analysis with Sawzall , 2005, Sci. Program..
[5] Bin Tang,et al. Data Replication in Data Intensive Scientific Applications with Performance Guarantee , 2011, IEEE Transactions on Parallel and Distributed Systems.
[6] Richard T. Snodgrass,et al. Main Memory-Based Algorithms for Efficient Parallel Aggregation for Temporal Databases , 2004, Distributed and Parallel Databases.
[7] Michael Stonebraker,et al. A comparison of approaches to large-scale data analysis , 2009, SIGMOD Conference.
[8] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[9] Beng Chin Ooi,et al. The performance of MapReduce , 2010, Proc. VLDB Endow..
[10] Anthony K. H. Tung,et al. MAP-JOIN-REDUCE: Toward Scalable and Efficient Data Analysis on Large Clusters , 2011, IEEE Transactions on Knowledge and Data Engineering.
[11] Kai Wang,et al. Accelerating MapReduce with Distributed Memory Cache , 2009, 2009 15th International Conference on Parallel and Distributed Systems.
[12] Victor W. Marek,et al. Scalable hybrid stream and hadoop network analysis system , 2014, ICPE.
[13] Himanshu Shah,et al. Big Data Application Architecture Q & A , 2013, Apress.
[14] Douglas Stott Parker,et al. Map-reduce-merge: simplified relational data processing on large clusters , 2007, SIGMOD '07.
[15] Craig MacDonald,et al. MapReduce indexing strategies: Studying scalability and efficiency , 2012, Inf. Process. Manag..
[16] Himanshu Shah,et al. Big Data Application Architecture Q&A: A Problem - Solution Approach , 2013 .
[17] Charu C. Aggarwal,et al. Data Streams - Models and Algorithms , 2014, Advances in Database Systems.
[18] Zhike Zhang,et al. Real-time analytics processing with MapReduce , 2012, 2012 International Conference on Machine Learning and Cybernetics.
[19] Geoffrey C. Fox,et al. Grid services for earthquake science , 2002, Concurr. Comput. Pract. Exp..
[20] Vignesh Prajapati,et al. Big Data Analytics with R and Hadoop , 2013 .
[21] Walid G. Aref,et al. M3: Stream Processing on Main-Memory MapReduce , 2012, 2012 IEEE 28th International Conference on Data Engineering.
[22] Robert L. Grossman,et al. Ieee Transactions on Parallel and Distributed Systems, Manuscript Id towards Efficient and Simplified Distributed Data Intensive Computing* , 2022 .
[23] Tom White,et al. Hadoop: The Definitive Guide , 2009 .