Using Cloud Computing for Scalable , Reproducible Experimentation

Whether it be data generated from genomic sequencers, telescopes, or other laboratory instruments, technology apparent in many scientific disciplines is generating data at rates never witnessed before. Scientists in the area of bioinformatics are among the many who perform inductive experiments and analyses on these data with the goal of answering scientific questions. These computationally demanding experiments and analyses have become a common occurrence, resulting in a shift in scientific discovery, and thus leading to the term

[1]  Joel T Dudley,et al.  In silico research in the era of cloud computing , 2010, Nature Biotechnology.

[2]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[3]  Geoffrey C. Fox,et al.  Supporting Experimental Computer Science , 2012 .

[4]  Miron Livny,et al.  Condor: a distributed job scheduler , 2001 .

[5]  Bill Howe,et al.  Virtual Appliances, Cloud Computing, and Reproducible Research , 2012, Computing in Science & Engineering.

[6]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[7]  Anthony J. G. Hey,et al.  Jim Gray on eScience: a transformed scientific method , 2009, The Fourth Paradigm.

[8]  Manish Parashar,et al.  CometCloud: An Autonomic Cloud Engine , 2011, CloudCom 2011.

[9]  Kai Diethelm,et al.  The Limits of Reproducibility in Numerical Simulation , 2012, Computing in Science & Engineering.

[10]  Gianluigi Zanetti,et al.  Using Virtual Clusters to Decouple Computation and Data Management in High Throughput Analysis Applications , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.

[11]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[12]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[13]  David E. Culler,et al.  Wide area cluster monitoring with Ganglia , 2003, 2003 Proceedings IEEE International Conference on Cluster Computing.

[14]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[15]  Michael C. Schatz,et al.  Cloud Computing and the DNA Data Race , 2010, Nature Biotechnology.

[16]  Gianluigi Zanetti,et al.  Suspending, migrating and resuming HPC virtual clusters , 2010, Future Gener. Comput. Syst..

[17]  Peter J. Tonellato,et al.  Cloud computing for comparative genomics , 2010, BMC Bioinformatics.

[18]  R. Dahlstrom,et al.  Challenges and opportunities , 2021, Foundations of a Sustainable Economy.

[19]  Robert P. Goldberg,et al.  Survey of virtual machine research , 1974, Computer.

[20]  Borja Sotomayor,et al.  Virtual Clusters for Grid Communities , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[21]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[22]  Monya Baker,et al.  Next-generation sequencing: adjusting to data overload , 2010, Nature Methods.

[23]  John Bresnahan,et al.  Managing appliance launches in infrastructure clouds , 2011, TG.

[24]  L. Stein The case for cloud computing in genome informatics , 2010, Genome Biology.

[25]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[26]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .