Phylo-mLogo: an interactive and hierarchical multiple-logo visualization tool for alignment of many sequences

BackgroundWhen aligning several hundreds or thousands of sequences, such as epidemic virus sequences or homologous/orthologous sequences of some big gene families, to reconstruct the epidemiological history or their phylogenies, how to analyze and visualize the alignment results of many sequences has become a new challenge for computational biologists. Although there are several tools available for visualization of very long sequence alignments, few of them are applicable to the alignments of many sequences.ResultsA multiple-logo alignment visualization tool, called Phylo-mLogo, is presented in this paper. Phylo-mLogo calculates the variabilities and homogeneities of alignment sequences by base frequencies or entropies. Different from the traditional representations of sequence logos, Phylo-mLogo not only displays the global logo patterns of the whole alignment of multiple sequences, but also demonstrates their local homologous logos for each clade hierarchically. In addition, Phylo-mLogo also allows the user to focus only on the analysis of some important, structurally or functionally constrained sites in the alignment selected by the user or by built-in automatic calculation.ConclusionWith Phylo-mLogo, the user can symbolically and hierarchically visualize hundreds of aligned sequences simultaneously and easily check the changes of their amino acid sites when analyzing many homologous/orthologous or influenza virus sequences. More information of Phylo-mLogo can be found at URL http://biocomp.iis.sinica.edu.tw/phylomlogo.

[1]  Andrea Califano,et al.  Motif-based construction of a functional map for mammalian olfactory receptors. , 2003, Genomics.

[2]  Doron Lancet,et al.  The olfactory receptor gene superfamily: data mining, classification, and nomenclature , 2000, Mammalian Genome.

[3]  Webb Miller,et al.  zPicture: dynamic alignment and visualization tool for analyzing conservation profiles. , 2004, Genome research.

[4]  Michael Q. Zhang,et al.  Identifying tissue-selective transcription factor binding sites in vertebrate promoters. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[5]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[6]  Masatoshi Nei,et al.  Selectionism and neutralism in molecular evolution. , 2005, Molecular biology and evolution.

[7]  R. Gibbs,et al.  PipMaker--a web server for aligning two genomic DNA sequences. , 2000, Genome research.

[8]  Masatoshi Nei,et al.  Evolutionary dynamics of olfactory receptor genes in fishes and tetrapods , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[10]  Yiping Fan,et al.  Response to Comment on "Large-Scale Sequence Analysis of Avian Influenza Isolates" , 2006, Science.

[11]  S. Salzberg,et al.  Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution , 2005, Nature.

[12]  Ian A. Wilson,et al.  Structure and Receptor Specificity of the Hemagglutinin from an H5N1 Influenza Virus , 2006, Science.

[13]  Doron Lancet,et al.  Prediction of the odorant binding site of olfactory receptor proteins by human–mouse comparisons , 2004, Protein science : a publication of the Protein Society.

[14]  Masatoshi Nei,et al.  Comparative evolutionary analysis of olfactory receptor gene clusters between humans and mice. , 2005, Gene.

[15]  M. Nei,et al.  Concerted and birth-and-death evolution of multigene families. , 2005, Annual review of genetics.

[16]  R. Nielsen Molecular signatures of natural selection. , 2005, Annual review of genetics.

[17]  Michael Q. Zhang,et al.  Mining ChIP-chip data for transcription factor and cofactor binding sites , 2005, ISMB.

[18]  D. T. Lee,et al.  SinicView: A visualization environment for comparisons of multiple nucleotide sequence alignment tools , 2006, BMC Bioinformatics.

[19]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[20]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[21]  H. Klenk,et al.  The viral polymerase mediates adaptation of an avian influenza virus to a mammalian host. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Gabriele Neumann,et al.  Host Range Restriction and Pathogenicity in the Context of Influenza Pandemic , 2006, Emerging infectious diseases.

[23]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[24]  Mei Li,et al.  MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences , 2003, Nucleic Acids Res..

[25]  W. Fitch,et al.  Predicting the evolution of human influenza A. , 1999, Science.

[26]  Thomas R. Bürglin,et al.  LogoBar: bar graph visualization of protein logos with gaps , 2006, Bioinform..

[27]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[28]  Sudhir Kumar,et al.  MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment , 2004, Briefings Bioinform..

[29]  Ian A. Wilson,et al.  A Single Amino Acid Substitution in 1918 Influenza Virus Hemagglutinin Changes Receptor Binding Specificity , 2005, Journal of Virology.

[30]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[31]  M. Nei,et al.  Evolution of olfactory receptor genes in the human genome , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Bernd Hamann,et al.  Phylo-VISTA: interactive visualization of multiple DNA sequence alignments , 2004, Bioinform..