galaxie-CGI scripts for sequence identification through automated phylogenetic analysis

MOTIVATION The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. RESULTS Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. AVAILABILITY A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se

[1]  Manfred Binder,et al.  Evolution of complex fruiting–body morphologies in homobasidiomycetes , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[2]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[3]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[4]  M. Nei,et al.  Phylogenetic analysis in molecular evolutionary genetics. , 1996, Annual review of genetics.

[5]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[6]  J. Cracraft,et al.  Parsimony and Phylogenetic Inference Using DNA Sequences : Some Methodological Strategies , 2022 .

[7]  D. Tautz,et al.  A plea for DNA taxonomy , 2003 .

[8]  Kevin de Queiroz,et al.  Phylogenetic definitions and taxonomic philosophy , 1992 .

[9]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[10]  Scott A. Redhead,et al.  Phylogeny of agarics: Partial systematics solutions for bryophilous omphalinoid agarics outside of the Agaricales (Euagarics) , 2002 .

[11]  Jeremy R. deWaard,et al.  Biological identifications through DNA barcodes , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[12]  Steven J. M. Jones,et al.  PhyloBLAST: facilitating phylogenetic analysis of BLAST results , 2001, Bioinform..

[13]  Nils Hallenberg,et al.  Phylogeny of the Hypochnicium punctulatum complex as inferred from ITS sequence data , 2003, Mycologia.

[14]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[15]  Mark Ridley,et al.  Evolution and Classification: The Reformation of Cladism , 1986 .

[16]  G. Perrière,et al.  BIBI, a Bioinformatics Bacterial Identification Tool , 2003, Journal of Clinical Microbiology.

[17]  O. Kandler,et al.  Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Nei,et al.  Molecular Evolution and Phylogenetics , 2000 .

[19]  Nils Hallenberg,et al.  Phylogeography of Hyphoderma setigerum (Basidiomycota) in the Northern Hemisphere. , 2003, Mycological research.