Big Data: The New Challenges in Data Mining

Big Data is a new term used to identify the datasets but due to their large size and complexity, we cannot manage them with our current methodologies or data mining software tools. With the fast development of networking, data storage, and the data collection capacity, Big Data is now rapidly expanding in all science and engineering domains, including physical, biological and biomedical sciences. Big Data mining is the capability of extracting useful information from these large datasets or streams of data, that due to its volume, variability, and velocity, it was not possible before to do it. The Big Data challenge is becoming one of the most exciting opportunities for the next years. This paper represents a broad overview of the topic, Big Data challenges, Data Mining Challenges with Big Data, Big Data processing framework and forecast to the future.

[1]  Dan Feldman,et al.  Turning big data into tiny data: Constant-size coresets for k-means, PCA and projective clustering , 2013, SODA.

[2]  Wei Fan,et al.  Mining big data: current status, and forecast to the future , 2013, SKDD.

[3]  Charles Parker,et al.  Unexpected challenges in large scale machine learning , 2012, BigMine '12.

[4]  Shan Suthaharan,et al.  Big data classification: problems and challenges in network intrusion prediction with machine learning , 2014, PERV.

[5]  Robert B. Ross,et al.  The Top 10 Challenges in Extreme-Scale Visual Analytics , 2012, IEEE Computer Graphics and Applications.

[6]  Vivekanand Gopalkrishnan,et al.  Big data, big business: bridging the gap , 2012, BigMine '12.

[7]  Neal Leavitt,et al.  Storage Challenge: Where Will All That Big Data Go? , 2013, Computer.

[8]  John Gantz,et al.  The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East , 2012 .

[9]  Christian P. Robert,et al.  Large-scale inference , 2010 .

[10]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[11]  Sean Carlin,et al.  Cloud Computing Technologies , 2012, CloudCom 2012.

[12]  Nathan Marz,et al.  Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[13]  Keith W. Miller,et al.  Big Data: New Opportunities and New Challenges [Guest editors' introduction] , 2013, Computer.

[14]  Igor Muttik,et al.  Cloud security technologies , 2009, Inf. Secur. Tech. Rep..