Augmented Petri Net Cost Model for Optimisation of Large Bioinformatics Workflows Using Cloud

This paper concerns the trade-off that may be madebetween the cost of storing intermediate data and thecomputing costs incurred in regenerating this data when large bioinformatics or other workflows are implemented using cloud resources. The implementation may be required todelete some data to keep storage costs within a budget, anddeciding how best to do this with minimal increase incomputing costs can cause complex problems. To addressthese problems, a modified form of Petri net is introduced for modeling the workflow and allowing an optimization algorithm to be applied for addressing several types of problem that may arise. The proposed 'augmented Petri-net' simulates workflows with cost models included, thus providing a platform for an optimization procedure. Illustrations are presented to show that such optimization can achieve overall cost reductions in a number of different scenarios.

[1]  C. Petri Kommunikation mit Automaten , 1962 .

[2]  Liangxiu Han,et al.  Enhancing Parallelism of Data-Intensive Bioinformatics Applications , 2013, 2013 8th EUROSIM Congress on Modelling and Simulation.

[3]  Ewa Deelman,et al.  The cost of doing science on the cloud: the Montage example , 2008, HiPC 2008.

[4]  Guang Chen,et al.  A personalized recommendation algorithm via heterogeneous heat conduction , 2013, 2013 IEEE Third International Conference on Information Science and Technology (ICIST).

[5]  Gagan Agrawal,et al.  Evaluating caching and storage options on the Amazon Web Services Cloud , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[6]  Parosh Aziz Abdulla,et al.  Petri Nets with Time and Cost , 2012, Infinity.

[7]  James L. Peterson,et al.  Petri Nets , 1977, CSUR.

[8]  Zheng Xie,et al.  Automatic data reuse for accelerating data intensive applications in the Cloud , 2013, 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013).

[9]  Xiao Liu,et al.  Concurrency and Computation: Practice and Experience a Data Dependency Based Strategy for Intermediate Data Storage in Scientific Cloud Workflow Systems ‡ , 2022 .

[10]  Serge Haddad,et al.  Comparison of Different Semantics for Time Petri Nets , 2005, ATVA.

[11]  Miron Livny,et al.  The cost of doing science on the cloud: The Montage example , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.