The CIPRES science gateway: a community resource for phylogenetic analyses

The CIPRES Science Gateway (CSG) provides researchers and educators with browser-based access to community codes for inference of phylogenetic relationships from DNA and protein sequence data. The CSG allows users to deploy jobs on the high-performance computers of the TeraGrid without requiring detailed knowledge of their complexities. Use of the CSG has grown rapidly; through March 2011 it had more than 2,200 users and enabled more than 180 peer-reviewed publications. The rapid growth in resource consumption was accommodated by deploying codes on Trestles, a new TeraGrid computer. Tools and policies were developed to insure efficient and effective resource use. This paper describes progress in managing the growth of this public cyberinfrastructure resource and reviews the domain science that it has enabled.

[1]  Pedro Trancoso,et al.  Fine-grain Parallelism Using Multi-core, Cell/BE, and GPU Systems: Accelerating the Phylogenetic Likelihood Function , 2009, 2009 International Conference on Parallel Processing.

[2]  D. Gent,et al.  Genetic and pathogenic relatedness of Pseudoperonospora cubensis and P. humuli. , 2011, Phytopathology.

[3]  George Amato,et al.  A phylogenetic hypothesis for Crocodylus (Crocodylia) based on mitochondrial DNA: evidence for a trans-Atlantic voyage from Africa to the New World. , 2011, Molecular phylogenetics and evolution.

[4]  Wu Ning-feng Swami—the next generation biology workbench , 2010 .

[5]  Alexandros Stamatakis,et al.  Hybrid MPI/Pthreads parallelization of the RAxML phylogenetics code , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[6]  Y. Qiu,et al.  Presence of three mycorrhizal genes in the common ancestor of land plants suggests a key role of mycorrhizas in the colonization of land by plants. , 2010, The New phytologist.

[7]  K. Olsen,et al.  Seeing red: the origin of grain pigmentation in US weedy rice , 2010, Molecular ecology.

[8]  Catherine Letondal,et al.  A Web interface generator for molecular biology programs in Unix , 2001, Bioinform..

[9]  David R. Andrew A new view of insect-crustacean relationships II. Inferences from expressed sequence tags and comparisons with neural cladistics. , 2011, Arthropod structure & development.

[10]  Tamir Tuller,et al.  Maximum likelihood of evolutionary trees: hardness and approximation , 2005, ISMB.

[11]  Mark A. Miller,et al.  Creating the CIPRES Science Gateway for inference of large phylogenetic trees , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[12]  R. DeSalle,et al.  Phylogenetic and ecological relationships of the Hawaiian Drosophila inferred by mitochondrial DNA analysis. , 2011, Molecular phylogenetics and evolution.

[13]  Marc A. Suchard,et al.  Many-core algorithms for statistical phylogenetics , 2009, Bioinform..

[14]  M. Vences,et al.  Eastward from Africa: palaeocurrent-mediated chameleon dispersal to the Seychelles islands , 2011, Biology Letters.

[15]  James B. Munro,et al.  Genome sequences reveal divergence times of malaria parasite lineages , 2010, Parasitology.

[16]  Jeremy Bruenn,et al.  Filoviruses are ancient and integrated into mammalian genomes , 2010, BMC Evolutionary Biology.

[17]  A. Rambaut,et al.  BEAST: Bayesian evolutionary analysis by sampling trees , 2007, BMC Evolutionary Biology.

[18]  R. Hanner,et al.  Incorporating DNA barcodes into a multi-year inventory of the fishes of the hyperdiverse Lower Congo River, with a multi-gene performance assessment of the genus Labeo as a case study , 2011, Mitochondrial DNA.

[19]  B. S. Manjunath,et al.  The iPlant Collaborative: Cyberinfrastructure for Plant Biology , 2011, Front. Plant Sci..

[20]  A. Cognato,et al.  Phylogeny of haplo–diploid, fungus‐growing ambrosia beetles (Curculionidae: Scolytinae: Xyleborini) inferred from molecular and morphological data , 2011 .

[21]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[22]  Kazutaka Katoh,et al.  Parallelization of the MAFFT multiple sequence alignment program , 2010, Bioinform..