GBuilder--an application for the visualization and integration of EST cluster data.

This paper presents a network-centric DNA sequence visualization and analysis tool called GBuilder. The tool is an easy-to-use Java application that can be used to analyze DNA sequence clusters and assemblies. The emphasis is on the analysis of EST data, where these highly redundant collections of low-quality and often alternatively spliced or chimeric sequence data are difficult to explore. The tool has the capacity to visualize similarities or dissimilarities between sequences at the level of the nucleotide base or annotation in many ways. Sequences may also be edited manually. The novel feature of GBuilder is its ability to access different data sources and analysis applications available on the Internet and to integrate these results and functionality back into itself. External resources such as EST cluster databases and conventional command-line analysis applications are integrated and accessed using CORBA (Common Object Request Broker Architecture), which provides a standard implementation independent protocol for integration. New CORBA services can be integrated immediately if they use a known interface described using the Interface Definition Language.

[1]  M. Tohyama,et al.  Expressed-sequence-tag approach to identify differentially expressed genes following peripheral nerve axotomy , 1998, Neuroscience Research.

[2]  A. Chou,et al.  CRAWview: for viewing splicing variation, gene families, and polymorphism in clusters of ESTs and full-length sequences , 1999, Bioinform..

[3]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[4]  David T. Flanagan Java in a nutshell - a desktop quick reference: covers Java 1.1 (2. deluxe edition) , 1997 .

[5]  Raghu V. Hudli,et al.  CORBA fundamentals and programming , 1996 .

[6]  C. Pilarsky,et al.  Exhaustive mining of EST libraries for genes differentially expressed in normal and tumour tissues. , 1999, Nucleic acids research.

[7]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[8]  T. Ideker,et al.  Mining SNPs from EST databases. , 1999, Genome research.

[9]  Graziano Pesole,et al.  CLEANUP: a fast computer program for removing redundancies from nucleotide sequence databases , 1996, Comput. Appl. Biosci..

[10]  Patricia Rodriguez-Tomé,et al.  RHdb: the Radiation Hybrid database , 2001, Nucleic Acids Res..

[11]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[12]  D B Davison,et al.  Alternative gene form discovery and candidate gene selection from gene indexing projects. , 1998, Genome research.

[13]  K. O. Elliston,et al.  Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data. , 1996, Genome research.

[14]  Winston A Hide,et al.  A comprehensive approach to clustering of expressed human gene sequence: the sequence tag alignment and consensus knowledge base. , 1999, Genome research.

[15]  G. Schuler Pieces of the puzzle: expressed sequence tags and the catalog of human genes , 1997, Journal of Molecular Medicine.

[16]  Thangavel Alphonse Thanaraj A clean data set of EST-confirmed splice sites from Homo sapiens and standards for clean-up procedures , 1999, Nucleic Acids Res..

[17]  X. Huang,et al.  An improved sequence assembly program. , 1996, Genomics.

[18]  Patricia Rodriguez-Tomé,et al.  JESAM: CORBA software components to create and publish EST alignments and clusters , 2000, Bioinform..

[19]  S Audic,et al.  Alternate polyadenylation in human mRNAs: a large-scale analysis by EST clustering. , 1998, Genome research.

[20]  S. Taylor,et al.  A new dynamic tool to perform assembly of expressed sequence tags (ESTs) , 1997, Comput. Appl. Biosci..

[21]  Gregory D. Schuler,et al.  ESTablishing a human transcript map , 1995, Nature Genetics.