Research on Big Data - A systematic mapping study

Big Data has emerged as a significant area of study for both practitioners and researchers. Big Data is a term for massive data sets with large structure. In 2012, Big Data passed the top of the Gartner Hype Cycle, attesting the maturity level of this technology and its applications. The aim of this paper is to examine how do researchers grasp the big data concept? We will answer the following questions: How many research papers are produced? What is the annual trend of publications? What are the hot topics in big data research? What are the most investigated big data topics? Why the research is performed? What are the most frequently obtained research artefacts? What does big data research produces? Who are the active authors? Which journals include papers on Big Data? What are the active disciplines? For this purpose, we provide a framework identifying existing and emerging research areas of Big Data. This framework is based on eight dimensions, including the SMACIT (Social Mobile Analytics Cloud Internet of Things) perspective. Current and past research in Big Data are analyzed using a systematic mapping study of publications based on more than a decade of related academic publications. The results have shown that significant contributions have been made by the research community, attested by a continuous increase in the number of scientific publications that address Big Data. We found that researchers are increasingly involved in research combining Big Data and Analytics, Cloud, Internet of things, mobility or social media. As for quality objectives, besides an interest in performance, other topics as scalability is emerging. Moreover, security and quality aspects become important. Researchers on Big Data provide more algorithms, frameworks, and architectures than other artifacts. Finally, application domains such as earth, energy, medicine, ecology, marketing, and health attract more attention from researchers on big data. A complementary content analysis on a subset of papers sheds some light on the evolving field of big data research.

[1]  Johnny S. Wong,et al.  A Brief Review on Leading Big Data Models , 2014, Data Sci. J..

[2]  Victor C. M. Leung,et al.  Big Data: Related Technologies, Challenges and Future Prospects , 2014 .

[3]  Adam Jacobs,et al.  The pathologies of big data , 2009, Commun. ACM.

[4]  Jie Li,et al.  Rethinking big data: A review on the data quality and usage issues , 2016 .

[5]  Daniel A. Keim,et al.  Visual analytics for the big data era — A comparative review of state-of-the-art commercial systems , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[6]  E. A. Mary Anita,et al.  A Survey of Big Data Analytics in Healthcare and Government , 2015 .

[7]  Ying Wah Teh,et al.  Big Data Clustering: A Review , 2014, ICCSA.

[8]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[9]  Guangjie Han,et al.  A survey of recent technologies and challenges in big data utilizations , 2015, 2015 International Conference on Information and Communication Technology Convergence (ICTC).

[10]  Adam Barker,et al.  Undefined By Data: A Survey of Big Data Definitions , 2013, ArXiv.

[11]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[12]  Ibrar Yaqoob,et al.  A survey of big data management: Taxonomy and state-of-the-art , 2016, J. Netw. Comput. Appl..

[13]  Paulo B. Góes,et al.  Editor's comments: big data and IS research , 2014 .

[14]  Andrea De Mauro,et al.  What is big data? A consensual definition and a review of key research topics , 2015, AIP Conference Proceedings.

[15]  Vipin Kumar,et al.  Trends in big data analytics , 2014, J. Parallel Distributed Comput..

[16]  Michael L. Brodie,et al.  The meaningful use of big data: four perspectives -- four challenges , 2012, SGMD.

[17]  Mohammad Ali Nematbakhsh,et al.  A Survey on Security Issues in Big Data and NoSQL , 2015 .

[18]  Christophe Nicolle,et al.  Understandable Big Data: A survey , 2015, Comput. Sci. Rev..

[19]  Rajkumar Buyya,et al.  Big Data computing and clouds: Trends and future directions , 2013, J. Parallel Distributed Comput..

[20]  Jacky Akoka,et al.  Artifact Evaluation in Information Systems Design-Science Research - a Holistic View , 2014, PACIS.

[21]  J. Christy Jackson,et al.  Survey on Programming Models and Environments for Cluster, Cloud, and Grid Computing that Defends Big Data☆ , 2015 .

[22]  Michael Mattioli,et al.  Big data, bigger dilemmas: A critical review , 2015, J. Assoc. Inf. Sci. Technol..

[23]  Thomas Hansmann,et al.  Big Data - Characterizing an Emerging Research Field Using Topic Models , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[24]  Muhammad Shiraz,et al.  Big Data: Survey, Technologies, Opportunities, and Challenges , 2014, TheScientificWorldJournal.

[25]  Zahir Tari,et al.  A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis , 2014, IEEE Transactions on Emerging Topics in Computing.

[26]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[27]  Honggang Wang,et al.  A survey of big data research , 2015, IEEE Network.

[28]  Seref Sagiroglu,et al.  A survey on security and privacy issues in big data , 2015, 2015 10th International Conference for Internet Technology and Secured Transactions (ICITST).

[29]  Benjamin W. Wah,et al.  Significance and Challenges of Big Data Research , 2015, Big Data Res..

[30]  I. Song,et al.  Analytics over large-scale multidimensional data: the big data revolution! , 2011, DOLAP '11.

[31]  Jacky Akoka,et al.  Research on Big Data - Characterizing the Field and Its Dimensions , 2015, ER Workshops.

[32]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[33]  Shahriar Akter,et al.  How ‘Big Data’ Can Make Big Impact: Findings from a Systematic Review and a Longitudinal Case Study , 2015 .

[34]  Beng Chin Ooi,et al.  In-Memory Big Data Management and Processing: A Survey , 2015, IEEE Transactions on Knowledge and Data Engineering.

[35]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[36]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[37]  Anwar M. Ghuloum,et al.  ViewpointFace the inevitable, embrace parallelism , 2009, CACM.

[38]  Dilpreet Singh,et al.  A survey on platforms for big data analytics , 2014, Journal of Big Data.

[39]  Carsten Felden,et al.  Big Data - A State-of-the-Art , 2012, AMCIS.

[40]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[41]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[42]  Xike Xie,et al.  Survey of real-time processing systems for big data , 2014, IDEAS.

[43]  Kai Petersen,et al.  Systematic Mapping Studies in Software Engineering , 2008, EASE.

[44]  Salvatore T. March,et al.  Design and natural science research on information technology , 1995, Decis. Support Syst..

[45]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).