CLOUD-BASED FLOWBSTER PORTAL TO DESIGN AND DEPLOY SCIENTIFIC WORKFLOWS

*Correspondence: József Kovács, Institute for Computer Science and Control, Hungarian Academy of Sciences (MTA SZTAKI), Budapest, Hungary, fjozsef.kovacs@sztaki. mta.hu Abstract A workflow system called Flowbster has been designed to create efficient data pipelines in clouds. The entire Flowbster workflow is dynamically built by using virtual machines on a target cloud. The paper describes a recently designed and developed web-based science gateway to support Flowbster. It provides a high-level graphical environment to handle different levels of abstractions, like workflows representing the layout and deployment representing the infrastructure realizing the workflow. Detailed overview of the user interface, the portal architecture and its internal operation are given in the paper. Moreover, an insight is provided on the selection and cooperation of the web modules and on the integration of the portal in the firebase environment developed by Google.

[1]  Péter Kacsuk,et al.  Enabling Workflow-Oriented Science Gateways to Access Multi-Cloud Systems , 2016, Journal of Grid Computing.

[2]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[3]  Rajkumar Buyya,et al.  Cloud Computing Principles and Paradigms , 2011 .

[4]  Paul Watson,et al.  A framework for scientific workflow reproducibility in the cloud , 2016, 2016 IEEE 12th International Conference on e-Science (e-Science).

[5]  Michael McLennan,et al.  HUBzero: A Platform for Dissemination and Collaboration in Computational Science and Engineering , 2010, Computing in Science & Engineering.

[6]  Paul Watson,et al.  Dynamic Deployment of Scientific Workflows in the Cloud Using Container Virtualization , 2016, 2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom).

[7]  Péter Kacsuk,et al.  The Flowbster Cloud-Oriented Workflow System to Process Large Scientific Data Sets , 2018, Journal of Grid Computing.

[8]  Kristina Chodorow,et al.  MongoDB: The Definitive Guide , 2010 .

[9]  Péter Kacsuk,et al.  Occopus: a Multi-Cloud Orchestrator to Deploy and Manage Complex Scientific Infrastructures , 2017, Journal of Grid Computing.

[10]  Marta Mattoso,et al.  A Survey of Data-Intensive Scientific Workflow Management , 2015, Journal of Grid Computing.

[11]  Ewa Deelman,et al.  Bringing Scientific Workflow to the Masses via Pegasus and HUBzero , 2013, IWSG.

[12]  Bartosz Balis,et al.  A Lightweight Approach for Deployment of Scientific Workflows in Cloud Infrastructures , 2015, PPAM.