论文信息 - PDA v.2: improving the exploration and estimation of nucleotide polymorphism in large datasets of heterogeneous DNA

PDA v.2: improving the exploration and estimation of nucleotide polymorphism in large datasets of heterogeneous DNA

Pipeline Diversity Analysis (PDA) is an open-source, web-based tool that allows the exploration of polymorphism in large datasets of heterogeneous DNA sequences, and can be used to create secondary polymorphism databases for different taxonomic groups, such as the Drosophila Polymorphism Database (DPDB). A new version of the pipeline presented here, PDA v.2, incorporates substantial improvements, including new methods for data mining and grouping sequences, new criteria for data quality assessment and a better user interface. PDA is a powerful tool to obtain and synthesize existing empirical evidence on genetic diversity in any species or species group. PDA v.2 is available on the web at .

Sònia Casillas | Antonio Barbadilla | A. Barbadilla | Sònia Casillas

[1] Xavier Messeguer,et al. DnaSP, DNA polymorphism analyses by the coalescent and other methods , 2003, Bioinform..

[2] Sònia Casillas,et al. DPDB: a database for the storage, representation and analysis of polymorphism in the Drosophila genus , 2005, ECCB/JBI.

[3] D. Higgins,et al. T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[4] J. Thompson,et al. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[5] Robert C. Edgar,et al. MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[6] Rodrigo Lopez,et al. Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[7] Antonio Barbadilla,et al. PDA: a pipeline to explore and estimate polymorphism in large DNA databases , 2004, Nucleic Acids Res..