Extracting Large Data using Big Data Mining

Innovations in technology and greater affordability of digital devices have presided over today's Age of Big Data, in the quantity and diversity of high frequency digital data. These data hold the potential to allow decision makers to track development progress, improve social protection, and understand where existing policies and programmes require adjustment. For example Turning Big Data—call logs, mobile- banking transactions, online user-generated content such as blog posts and Tweets, online searches, satellite images, etc.—into actionable information requires using computational techniques to unveil patterns within and between these extremely large socioeconomic datasets. The data-driven decision-making is now being recognized broadly, and there is growing enthusiasm for the notion of ``Big Data.'' But there is currently a wide gap between its potential and its realization of real Big Data. Heterogeneity, scale, timeliness, complexity, and privacy problems with Big Data impede progress at all phases of the pipeline that can create value from data. When the data requires us to make decisions, the problems start right away during data acquisition, , currently in an ad hoc manner, about what data to keep and what to discard, and how to store what we keep reliably with the right metadata. Much data today from tweets and blogs are weakly structured pieces of text and is not natively in structured format, while images and video are structured for storage and display, but not for semantic content and search. With this, transforming such content into a structured format for later analysis it is a major challenge. A major investment in Big Data which should be properly directed, can result not only in major scientific advances, but also lay the foundation for the next generation of advances in science, medicine, and business.

[1]  SangKeun Lee,et al.  Novel approaches to crawling important pages early , 2012, Knowledge and Information Systems.

[2]  Sinan Aral,et al.  Identifying Influential and Susceptible Members of Social Networks , 2012, Science.

[3]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[4]  Ping Yang,et al.  A Sketch of Big Data Technologies , 2013, 2013 Seventh International Conference on Internet Computing for Engineering and Science.

[5]  George Karypis,et al.  Algorithms for mining the evolution of conserved relational states in dynamic networks , 2011, 2011 IEEE 11th International Conference on Data Mining.

[6]  Jimeng Sun,et al.  DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[7]  Ashwin Machanavajjhala,et al.  Big privacy: protecting confidentiality in big data , 2012, XRDS.

[8]  T. Larsen,et al.  Cross-platform aviation analytics using big-data methods , 2013, 2013 Integrated Communications, Navigation and Surveillance Conference (ICNS).

[9]  William H. Dutton,et al.  Clouds, big data, and smart assets: Ten tech-enabled business trends to watch , 2010 .