A RESTful API for Access to Phylogenetic Tools via the CIPRES Science Gateway

The CIPRES Science Gateway is a community web application that provides public access to a set of parallel tree inference and multiple sequence alignment codes run on large computational resources. These resources are made available at no charge to users by the NSF Extreme Science and Engineering Discovery Environment (XSEDE) project. Here we describe the CIPRES RESTful application programmer interface (CRA), a web service that provides programmatic access to all resources and services currently offered by the CIPRES Science Gateway. Software developers can use the CRA to extend their web or desktop applications to include the ability to run MrBayes, BEAST, RAxML, MAFFT, and other computationally intensive algorithms on XSEDE. The CRA also makes it possible for individuals with modest scripting skills to access the same tools from the command line using curl, or through any scripting language. This report describes the CRA and its use in three web applications (Influenza Research Database – www.fludb.org, Virus Pathogen Resource – www.viprbrc.org, and MorphoBank – www.morphobank.org). The CRA is freely accessible to registered users at https://cipresrest.sdsc.edu/cipresrest/v1; supporting documentation and registration tools are available at https://www.phylo.org/restusers.

[1]  Thomas G. Dietterich,et al.  Next-generation phenomics for the Tree of Life , 2013, PLoS currents.

[2]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[3]  Andrea L. Cirranello,et al.  The Placental Mammal Ancestor and the Post–K-Pg Radiation of Placentals , 2013, Science.

[4]  John P. Huelsenbeck,et al.  MRBAYES: Bayesian inference of phylogenetic trees , 2001, Bioinform..

[5]  J. Huelsenbeck,et al.  MRBAYES : Bayesian inference of phylogeny , 2001 .

[6]  Jianwu Wang,et al.  Challenges and approaches for distributed workflow-driven analysis of large-scale biological data: vision paper , 2012, EDBT-ICDT '12.

[7]  Richard H Scheuermann,et al.  Influenza Research Database: an integrated bioinformatics resource for influenza research and surveillance , 2012, Influenza and other respiratory viruses.

[8]  Kazutaka Katoh,et al.  Parallelization of the MAFFT multiple sequence alignment program , 2010, Bioinform..

[9]  A. Rambaut,et al.  BEAST: Bayesian evolutionary analysis by sampling trees , 2007, BMC Evolutionary Biology.

[10]  Yun Zhang,et al.  ViPR: an open bioinformatics database and analysis resource for virology research , 2011, Nucleic Acids Res..

[11]  Seth Kaufman,et al.  MorphoBank: phylophenomics in the “cloud” , 2011, Cladistics : the international journal of the Willi Hennig Society.

[12]  Marco Thines,et al.  siMBa—a simple graphical user interface for the Bayesian phylogenetic inference program MrBayes , 2014, Mycological Progress.

[13]  D. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.6 , 2009 .

[14]  Derrick J. Zwickl Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion , 2006 .

[15]  Mark A. Miller,et al.  Creating the CIPRES Science Gateway for inference of large phylogenetic trees , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[16]  J. Gatesy,et al.  The supermatrix approach to systematics. , 2007, Trends in ecology & evolution.

[17]  Gerardo Chowell,et al.  Climate change and influenza: the likelihood of early and severe influenza seasons following warmer than average winters , 2013, PLoS currents.

[18]  Matthew W. Vaughn,et al.  National Center for Genome Analysis support leverages XSEDE to support life science research , 2013, XSEDE.

[19]  D. Silvestro,et al.  raxmlGUI: a graphical front-end for RAxML , 2011, Organisms Diversity & Evolution.

[20]  Ramón Doallo,et al.  CircadiOmics: integrating circadian genomics, transcriptomics, proteomics and metabolomics , 2012, Nature Methods.

[21]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[22]  Dong Xie,et al.  BEAST 2: A Software Platform for Bayesian Evolutionary Analysis , 2014, PLoS Comput. Biol..

[23]  Nicolas Lartillot,et al.  PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating , 2009, Bioinform..

[24]  Rami Rifaieh,et al.  SWAMI: Integrating Biological Databases and Analysis Tools Within User Friendly Environment , 2007, DILS.