BioBIKE: A Web-based, programmable, integrated biological knowledge base

BioBIKE (biobike.csbc.vcu.edu) is a web-based environment enabling biologists with little programming expertise to combine tools, data, and knowledge in novel and possibly complex ways, as demanded by the biological problem at hand. BioBIKE is composed of three integrated components: a biological knowledge base, a graphical programming interface and an extensible set of tools. Each of the five current BioBIKE instances provides all available information (genomic, metabolic, experimental) appropriate to a given research community. The BioBIKE programming language and graphical programming interface employ familiar operations to help users combine functions and information to conduct biologically meaningful analyses. Many commonly used tools, such as Blast and PHYLIP, are built-in, allowing users to access them within the same interface and to pass results from one to another. Users may also invent their own tools, packaging complex expressions under a single name, which is immediately made accessible through the graphical interface. BioBIKE represents a partial solution to the difficult question of how to enable those with no background in computer programming to work directly and creatively with mass biological information. BioBIKE is distributed under the MIT Open Source license. A description of the underlying language and other technical matters is available at www.Biobike.org.

[1]  A. Pierik,et al.  Molecular and functional analysis of nicotinate catabolism in Eubacterium barkeri , 2006, Proceedings of the National Academy of Sciences.

[2]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[3]  Matthew R. Pocock,et al.  The Bioperl toolkit: Perl modules for the life sciences. , 2002, Genome research.

[4]  Jeff Shrager,et al.  KnowOS: The (Re)Birth of the Knowledge Operating System , 2005 .

[5]  Marian Petre,et al.  Usability Analysis of Visual Programming Environments: A 'Cognitive Dimensions' Framework , 1996, J. Vis. Lang. Comput..

[6]  Geoffrey J. Barton,et al.  The Jalview Java alignment editor , 2004, Bioinform..

[7]  J. Noirel,et al.  Quantitative overview of N2 fixation in Nostoc punctiforme ATCC 29133 through cellular enrichments and iTRAQ shotgun proteomics. , 2009, Journal of proteome research.

[8]  Peter F Stadler,et al.  Fast and reliable prediction of noncoding RNAs , 2005, Proc. Natl. Acad. Sci. USA.

[9]  R. Waldinger,et al.  Deductive Biocomputing , 2007, PloS one.

[10]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[11]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[12]  I-Min A. Chen,et al.  The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions , 2007, Nucleic Acids Res..

[13]  Michael Travers,et al.  BioLingua: a programmable knowledge environment for biologists , 2005, Bioinform..

[14]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[15]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[16]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..

[17]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[18]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[19]  M. Kato,et al.  Very small mobile repeated elements in cyanobacterial genomes. , 2008, Genome research.