Managing Lifecycle of Big Data Applications

The growing digitization and networking process within our society has a large influence on all aspects of everyday life. Large amounts of data are being produced continuously, and when these are analyzed and interlinked they have the potential to create new knowledge and intelligent solutions for economy and society. To process this data, we developed the Big Data Integrator (BDI) Platform with various Big Data components available out-of-the-box. The integration of the components inside the BDI Platform requires components homogenization, which leads to the standardization of the development process. To support these activities we created the BDI Stack Lifecycle (SL), which consists of development, packaging, composition, enhancement, deployment and monitoring steps. In this paper, we show how we support the BDI SL with the enhancement applications developed in the BDE project. As an evaluation, we demonstrate the applicability of the BDI SL on three pilots in the domains of transport, social sciences and security.

[1]  Bruno Volckaert,et al.  Model-driven deployment and management of workflows on analytics frameworks , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[2]  Manolis Koubarakis,et al.  Strabon: A Semantic Geospatial DBMS , 2012, SEMWEB.

[3]  Pasi Kuvaja,et al.  Continuous deployment of software intensive products and services: A systematic mapping study , 2017, J. Syst. Softw..

[4]  Mohak Shah,et al.  An architecture for the deployment of statistical models for the big data era , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[5]  Fuad Rahman,et al.  A novel big-data processing framwork for healthcare applications: Big-data-healthcare-in-a-box , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[6]  Konstantina Bereta,et al.  SexTant: Visualizing Time-Evolving Linked Geospatial Data , 2013, International Semantic Web Conference.

[7]  Tsakalozos Konstantinos,et al.  Open big data infrastructures to everyone , 2016 .

[8]  Stefan Manegold,et al.  GeoTriples: a Tool for Publishing Geospatial Data as RDF Graphs Using R2RML Mappings , 2014, TC/SSN@ISWC.

[9]  Jens Lehmann,et al.  The BigDataEurope Platform - Supporting the Variety Dimension of Big Data , 2017, ICWE.

[10]  Erdogan Dogdu,et al.  An extended IoT framework with semantics, big data, and analytics , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[11]  Seung-Hwan Lim,et al.  On-demand data analytics in HPC environments at leadership computing facilities: Challenges and experiences , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[12]  Aad Versteden,et al.  State-of-the-art Web Applications using Microservices and Linked Data , 2016, SALAD@ESWC.

[13]  Vangelis Karkaletsis,et al.  Semantic Web Technologies and Big Data Infrastructures: SPARQL Federated Querying of Heterogeneous Big Data Stores , 2016, International Semantic Web Conference.

[14]  Nancy W. Grady,et al.  KDD meets Big Data , 2016, 2016 IEEE International Conference on Big Data (Big Data).