Datamonkey 2.0: a modern web application for characterizing selective and other evolutionary processes.

Inference of how evolutionary forces have shaped extant genetic diversity is a cornerstone of modern comparative sequence analysis. Advances in sequence generation and increased statistical sophistication of relevant methods now allow researchers to extract ever more evolutionary signal from the data, albeit at an increased computational cost. Here, we announce the release of Datamonkey 2.0, a completely re-engineered version of the Datamonkey web-server for analyzing evolutionary signatures in sequence data. For this endeavor, we leveraged recent developments in open-source libraries that facilitate interactive, robust, and scalable web application development. Datamonkey 2.0 provides a carefully curated collection of methods for interrogating coding-sequence alignments for imprints of natural selection, packaged as a responsive (i.e. can be viewed on tablet and mobile devices), fully interactive, and API-enabled web application. To complement Datamonkey 2.0, we additionally release HyPhy Vision, an accompanying JavaScript application for visualizing analysis results. HyPhy Vision can also be used separately from Datamonkey 2.0 to visualize locally-executed HyPhy analyses. Together, Datamonkey 2.0 and HyPhy Vision showcase how scientific software development can benefit from general-purpose open-source frameworks. Datamonkey 2.0 is freely and publicly available at http://www.datamonkey. org, and the underlying codebase is available from https://github.com/veg/datamonkey-js.

[1]  Sergei L. Kosakovsky Pond,et al.  FUBAR: a fast, unconstrained bayesian approximation for inferring selection. , 2013, Molecular biology and evolution.

[2]  E. Holmes,et al.  A single positively selected West Nile viral mutation confers increased virogenesis in American crows , 2007, Nature Genetics.

[3]  Andrew Rambaut,et al.  Evolution and Spread of Ebola Virus in Liberia, 2014-2015. , 2015, Cell host & microbe.

[4]  Avraham Leff,et al.  Web-application development using the Model/View/Controller design pattern , 2001, Proceedings Fifth IEEE International Enterprise Distributed Object Computing Conference.

[5]  Peter Ebert,et al.  Ten Simple Rules for Developing Usable Software in Computational Biology , 2017, PLoS Comput. Biol..

[6]  S. Banke,et al.  Positive Selection Pressure Introduces Secondary Mutations at Gag Cleavage Sites in Human Immunodeficiency Virus Type 1 Harboring Major Protease Resistance Mutations , 2009, Journal of Virology.

[8]  Agostinho Antunes,et al.  Olfactory Receptor Subgenomes Linked with Broad Ecological Adaptations in Sauropsida. , 2015, Molecular biology and evolution.

[9]  Olufunmilayo I. Olopade,et al.  DNA Glycosylases Involved in Base Excision Repair May Be Associated with Cancer Risk in BRCA1 and BRCA2 Mutation Carriers , 2014, PLoS genetics.

[10]  Sergei L. Kosakovsky Pond,et al.  UC Office of the President Recent Work Title Less Is More : An Adaptive Branch-Site Random Effects Model for Efficient Detection of Episodic Diversifying Selection Permalink , 2015 .

[11]  Georgii A. Bazykin,et al.  Intrasubtype Reassortments Cause Adaptive Amino Acid Replacements in H3N2 Influenza Genes , 2014, PLoS genetics.

[12]  S. Berger,et al.  Chemoreceptor Evolution in Hymenoptera and Its Implications for the Evolution of Eusociality , 2015, Genome biology and evolution.

[13]  Sergei L. Kosakovsky Pond,et al.  Detecting Individual Sites Subject to Episodic Diversifying Selection , 2012, PLoS genetics.

[14]  Konrad Scheffler,et al.  Evolutionary fingerprinting of genes. , 2010, Molecular biology and evolution.

[15]  James Howison,et al.  Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature , 2016, J. Assoc. Inf. Sci. Technol..

[16]  Konrad Scheffler,et al.  Robust inference of positive selection from recombining coding sequences , 2006, Bioinform..

[17]  Sergei L. Kosakovsky Pond,et al.  Not so different after all: a comparison of methods for detecting amino acid sites under selection. , 2005, Molecular biology and evolution.

[18]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[19]  D. Macqueen,et al.  Molecular adaptation of ammonia monooxygenase during independent pH specialization in Thaumarchaeota , 2016, Molecular ecology.

[20]  Ben Murrell,et al.  RELAX: detecting relaxed selection in a phylogenetic framework. , 2014, Molecular biology and evolution.

[21]  Ben Murrell,et al.  Gene-wide identification of episodic selection. , 2015, Molecular biology and evolution.

[22]  Ian M. Mitchell,et al.  Best Practices for Scientific Computing , 2012, PLoS biology.

[23]  David Posada,et al.  Automated phylogenetic detection of recombination using a genetic algorithm. , 2006, Molecular biology and evolution.

[24]  Sergei L. Kosakovsky Pond,et al.  HyPhy: hypothesis testing using phylogenies , 2005, Bioinform..

[25]  Sergei L. Kosakovsky Pond,et al.  Purifying Selection Can Obscure the Ancient Age of Viral Lineages , 2011, Molecular biology and evolution.

[26]  S. Muse,et al.  Site-to-site variation of synonymous substitution rates. , 2005, Molecular biology and evolution.

[27]  Sergei L. Kosakovsky Pond,et al.  Datamonkey: rapid detection of selective pressure on individual sites of codon alignments , 2005, Bioinform..

[28]  Andreas Prlic,et al.  Ten Simple Rules for the Open Development of Scientific Software , 2012, PLoS Comput. Biol..

[29]  Philippe Suter,et al.  A Look at the Dynamics of the JavaScript Package Ecosystem , 2016, 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR).