RDP3: a flexible and fast computer program for analyzing recombination

Summary: RDP3 is a new version of the RDP program for characterizing recombination events in DNA-sequence alignments. Among other novelties, this version includes four new recombination analysis methods (3SEQ, VISRD, PHYLRO and LDHAT), new tests for recombination hot-spots, a range of matrix methods for visualizing over-all patterns of recombination within datasets and recombination-aware ancestral sequence reconstruction. Complementary to a high degree of analysis flow automation, RDP3 also has a highly interactive and detailed graphical user interface that enables more focused hands-on cross-checking of results with a wide variety of newly implemented phylogenetic tree construction and matrix-based recombination signal visualization methods. The new RDP3 can accommodate large datasets and is capable of analyzing alignments ranging in size from 1000×10 kilobase sequences to 20×2 megabase sequences within 48 h on a desktop PC. Availability: RDP3 is available for free from its web site http://darwin.uvigo.es/rdp/rdp.html Contact: darrenpatrickmartin@gmail.com Supplementary information: The RDP3 program manual contains detailed descriptions of the various methods it implements and a step-by-step guide describing how best to use these.

[1]  Christopher A. Voigt,et al.  Protein building blocks preserved by recombination , 2002, Nature Structural Biology.

[2]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[3]  D. Posada,et al.  The Effect of Recombination on the Reconstruction of Ancestral Sequences , 2010, Genetics.

[4]  D. Posada Evaluation of methods for detecting recombination from DNA sequences: empirical data. , 2002, Molecular biology and evolution.

[5]  G. Weiller Phylogenetic profiles: a graphical method for detecting genetic recombinations in homologous sequences. , 1998, Molecular biology and evolution.

[6]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[7]  D. Martin,et al.  Widely Conserved Recombination Patterns among Single-Stranded DNA Viruses , 2008, Journal of Virology.

[8]  John Maynard Smith,et al.  Analyzing the mosaic structure of genes , 1992, Journal of Molecular Evolution.

[9]  Sergei L. Kosakovsky Pond,et al.  An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1 , 2009, PLoS Comput. Biol..

[10]  David Posada,et al.  An Exact Nonparametric Method for Inferring Mosaic Structure in Sequence Triplets , 2007, Genetics.

[11]  K. Crandall,et al.  Evaluation of methods for detecting recombination from DNA sequences: Computer simulations , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Mark J. Gibbs,et al.  Sister-Scanning: a Monte Carlo procedure for assessing signals in recombinant sequences , 2000, Bioinform..

[13]  Simon Easteal,et al.  A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences , 1996, Comput. Appl. Biosci..

[14]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[15]  Ming Zhang,et al.  A jumping profile Hidden Markov Model and applications to recombination sites in HIV and HCV genomes , 2006, BMC Bioinformatics.

[16]  Darren P. Martin,et al.  Recombination Patterns in Aphthoviruses Mirror Those Found in Other Picornaviruses , 2006, Journal of Virology.

[17]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[18]  E. Holmes,et al.  Phylogenetic evidence for recombination in dengue virus. , 1999, Molecular biology and evolution.

[19]  K. Lole,et al.  Full-Length Human Immunodeficiency Virus Type 1 Genomes from Subtype C-Infected Seroconverters in India, with Evidence of Intersubtype Recombination , 1999, Journal of Virology.

[20]  Vladimir N. Minin,et al.  Dual multiple change-point model leads to more accurate recombination detection , 2005, Bioinform..

[21]  K. Crandall,et al.  A modified bootscan algorithm for automated identification of recombinant sequences and recombination breakpoints. , 2005, AIDS research and human retroviruses.

[22]  Darren P Martin,et al.  Avoidance of Protein Fold Disruption in Natural Virus Recombinants , 2007, PLoS pathogens.

[23]  David Posada,et al.  RDP2: recombination detection and analysis from sequence alignments , 2005, Bioinform..

[24]  S. Sawyer,et al.  Possible emergence of new geminiviruses by frequent recombination. , 1999, Virology.

[25]  P. Fearnhead,et al.  A coalescent-based method for detecting and estimating recombination from gene sequences. , 2002, Genetics.

[26]  Vincent Moulton,et al.  Identifying recombinants in human and primate immunodeficiency virus sequence alignments using quartet scanning , 2009, BMC Bioinformatics.

[27]  Nicholas Hamilton,et al.  Phylogenetic identification of lateral genetic transfer events , 2006, BMC Evolutionary Biology.