Delivering ICT infrastructure for biomedical research

This paper describes an implementation of the Infrastructure-as-a-Service (IaaS) concept for scientific computing and seven service pilot implementations with requirements from biomedical use cases at the CSC - IT Center for Science. The key service design requirements were enabling the use of any scientific software environment the use cases needed to succeed, and delivering the distributed infrastructure ICT resources seamlessly with the local ICT resources for the scientist users. The service concept targets the IT administrators at research organisations and delivers virtualised compute cluster and storage capacity via private network solutions. The virtualised resources can become part of the local cluster as virtual nodes and they can share the same file system as the physical nodes assuming the network performance is sufficient. Extension of the local resources can then be made transparent to enable an easy infrastructure uptake to the scientist end-users. Based on 20 months of service piloting most of the biomedical organisations express a sustained and growing need for the distributed compute and storage resources delivered with the IaaS. We conclude that a successful implementation of the IaaS can improve access and reduce the effort to run expensive ICT infrastructure needed for biomedical research.

[1]  Geoffrey C. Fox,et al.  Hybrid cloud and cluster computing paradigms for life science applications , 2010, BMC Bioinformatics.

[2]  Mikko Koski,et al.  Chipster: user-friendly analysis software for microarray and other high-throughput data , 2011, BMC Genomics.

[3]  Brian D. O'Connor,et al.  SeqWare Query Engine: storing and searching sequence data in the cloud , 2010, BMC Bioinformatics.

[4]  Peter J. Tonellato,et al.  Biomedical Cloud Computing With Amazon Web Services , 2011, PLoS Comput. Biol..

[5]  Ronald C. Taylor An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics , 2010, BMC Bioinformatics.

[6]  Stanimir Rakić,et al.  On the organization of the lexicon , 1986 .

[7]  G. Nolan,et al.  Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology , 2011, Nature Reviews Genetics.

[8]  L. Stein The case for cloud computing in genome informatics , 2010, Genome Biology.

[9]  Morris A. Swertz,et al.  The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button , 2010, BMC Bioinformatics.

[10]  Michael C. Schatz,et al.  Cloud Computing and the DNA Data Race , 2010, Nature Biotechnology.

[11]  Doris M Kupfer Cloud computing in biomedical research. , 2012, Aviation, space, and environmental medicine.

[12]  Pedro Fernandes ELIXIR (Infrastructure for Biological Information in Europe) , 2014 .

[13]  Andrew P. Martin,et al.  myTrustedCloud: Trusted Cloud Infrastructure for Security-critical Computation and Data Managment , 2011, CloudCom.

[14]  Alfonso Valencia,et al.  On the organization of bioinformatics core services in biology-based research institutes , 2011, Bioinform..

[15]  K. Ovaska,et al.  Large-scale data integration framework provides a comprehensive view on glioblastoma multiforme , 2010, Genome Medicine.