Visual BLAST and visual FASTA: graphic workbenches for interactive analysis of full BLAST and FASTA outputs under Microsoft Windows 95/NT

MOTIVATION When routinely analysing protein sequences, detailed analysis of database search results made with BLAST and FASTA becomes exceedingly time consuming and tedious work, as the resultant file may contain a list of hundreds of potential homologies. The interpretation of these results is usually carried out with a text editor which is not a convenient tool for this analysis. In addition, the format of data within BLAST and FASTA output files makes them difficult to read. RESULTS To facilitate and accelerate this analysis, we present for the first time, two easy-to-use programs designed for interactive analysis of full BLAST and FASTA output files containing protein sequence alignments. The programs, Visual BLAST and Visual FASTA, run under Microsoft Windows 95 or NT systems. They are based on the same intuitive graphical user interface (GUI) with extensive viewing, searching, editing, printing and multithreading capabilities. These programs improve the browsing of BLAST/FASTA results by offering a more convenient presentation of these results. They also implement on a computer several analytical tools which automate a manual methodology used for detailed analysis of BLAST and FASTA outputs. These tools include a pairwise sequence alignment viewer, a Hydrophobic Cluster Analysis plot alignment viewer and a tool displaying a graphical map of all database sequences aligned with the query sequence. In addition. Visual Blast includes tools for multiple sequence alignment analysis (with an amino acid patterns search engine), and Visual FASTA provides a GUI to the FASTA program.

[1]  W. Pearson Comparison of methods for searching protein sequence databases , 1995, Protein science : a publication of the Protein Society.

[2]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[3]  R. F. Smith,et al.  BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. , 1995, Genome research.

[4]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[5]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[6]  A Bairoch PROSITE: a dictionary of sites and patterns in proteins. , 1992, Nucleic acids research.

[7]  B Henrissat,et al.  Hydrophobic cluster analysis: procedures to derive structural and functional information from 2-D-representation of protein sequences. , 1990, Biochimie.

[8]  J. Devereux,et al.  A comprehensive set of sequence analysis programs for the VAX , 1984, Nucleic Acids Res..

[9]  Gilles Labesse,et al.  MulBlast 1.0: a multiple alignment of BLAST output to boost protein sequence similarity analysis , 1996, Comput. Appl. Biosci..

[10]  A. Bairoch PROSITE: a dictionary of sites and patterns in proteins. , 1991, Nucleic acids research.

[11]  B. Henrissat,et al.  Active-site motifs of lysosomal acid hydrolases: invariant features of clan GH-A glycosyl hydrolases deduced from hydrophobic cluster analysis. , 1997, Glycobiology.

[12]  J. P. Mornon,et al.  HCABAND: a computer program for the 2D-helical representation of protein sequences , 1990, Comput. Appl. Biosci..

[13]  Sándor Pongor,et al.  Predicting potential domain homologies from FASTA search results , 1993, Comput. Appl. Biosci..

[14]  B. Sopher,et al.  Molecular cloning and characterization of bovine beta-mannosidase. , 1995, The Journal of biological chemistry.

[15]  Eugene V. Koonin,et al.  A simple tool to search for sequence motifs that are conserved in BLAST outputs , 1994, Comput. Appl. Biosci..