Big data management: project and open issues

In the future, projects will be involved bombing of the huge amount of data related to a variety of fields. The existing data is exponentially growing up in different domains in the world in such a way that is not controllable. This phenomenon is the sign showing that the era for Big Data is arrived. It is plausible that data management and data analysis are the ultimate way to extract the invaluable information from these massive data sets. However, producing high accurate outcomes from these enormous data needs noticeable efforts, expense, and time. Hence, this issue necessitates paying more attention to Big Data phenomenon. Nowadays, researchers try to investigate the impacts of Big Data in variety of domains such as management, business, communication, and so forth. Also, they try to identify the Big Data opportunities and challenges. In this study, we define Big Data subject, explain its major challenges, and discuss about some algorithms in data mining for data clustering in management science. Moreover, we outline the open issues that help in identifying new research directions in Big Data management. In conclusion, this study tries to provide a solution for the Big Data management by means of the existing data mining algorithms.

[1]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[2]  Liang Dong,et al.  Starfish: A Self-tuning System for Big Data Analytics , 2011, CIDR.

[3]  H. P. Friedman,et al.  On Some Invariant Criteria for Grouping Data , 1967 .

[4]  Michael J. Carey,et al.  Extending Map-Reduce for Efficient Predicate-Based Sampling , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[5]  Alexander J. Smola,et al.  An architecture for parallel topic models , 2010, Proc. VLDB Endow..

[6]  David L. Neuhoff,et al.  Simplistic Universal Coding. , 1998, IEEE Trans. Inf. Theory.

[7]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[8]  Joseph M. Hellerstein,et al.  Distributed GraphLab: A Framework for Machine Learning in the Cloud , 2012, Proc. VLDB Endow..

[9]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[10]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[11]  Carlo Zaniolo,et al.  Early Accurate Results for Advanced Analytics on MapReduce , 2012, Proc. VLDB Endow..

[12]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[13]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[14]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[15]  Mukesh K. Mohania,et al.  Cloud Computing and Big Data Analytics: What Is New from Databases Perspective? , 2012, BDA.