On the Need of Opening the Big Data Landscape to Everyone: Challenges and New Trends

The great variety and intrinsic complexity of current Big Data technologies hampers the development of analytic processes for large data sets in domains where their business experts are not required to have specialized knowledge in computing, such as data mining, parallel computing, machine learning or software development. New approaches are therefore necessary to simplify, promote and open to everyone the establishment of these technologies in those sectors like health, economy, market analysis, etc., where such a data processing is highly demanded but it still needs to be outsourced. In this context, workflows are conceptually closer to the business expert, and a well‐known mechanism to represent a sequence of domain‐specific activities that enable the automation of data processes, independently of the infrastructure requirements. In this chapter, we discuss the current challenges to be faced in the widespread adoption of workflow‐based Big Data processes. Further, existing workflow management tools are analyzed, as well as the new trends for the development of custom solutions in multiple domains.

[1]  Behzad Esmaeilian,et al.  The evolution and future of manufacturing: A review , 2016 .

[2]  Yanxia Zhang,et al.  Astronomy in the Big Data Era , 2015, Data Sci. J..

[3]  Efthimios Tambouris,et al.  A classification scheme for open government data: towards linking decentralised data , 2011, Int. J. Web Eng. Technol..

[4]  Markus Hofmann,et al.  RapidMiner: Data Mining Use Cases and Business Analytics Applications , 2013 .

[5]  Rajkumar Buyya,et al.  A Taxonomy of Workflow Management Systems for Grid Computing , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[6]  E. A. Mary Anita,et al.  A Survey of Big Data Analytics in Healthcare and Government , 2015 .

[7]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[8]  Peter Groves,et al.  The 'big data' revolution in healthcare: Accelerating value and innovation , 2016 .

[9]  Silvana Trimi,et al.  Big-data applications in the government sector , 2014, Commun. ACM.

[10]  Qi Li,et al.  A prototype of healthcare big data processing system based on Spark , 2015, 2015 8th International Conference on Biomedical Engineering and Informatics (BMEI).

[11]  David Loshin Chapter 3 – Achieving Organizational Alignment for Big Data Analytics , 2013 .

[12]  John Krogstie,et al.  What is the value of eGovernment – and how can we actually realize it? , 2009 .

[13]  Witold Pedrycz,et al.  Information granularity, big data, and computational intelligence , 2015 .

[14]  Sreekanth Rallapalli,et al.  Impact of Processing and Analyzing Healthcare Big Data on Cloud Computing Environment by Implementing Hadoop Cluster , 2016 .

[15]  Carole A. Goble,et al.  The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud , 2013, Nucleic Acids Res..