Bioinformatics Applications Note Databases and Ontologies Bioq: Tracing Experimental Origins in Public Genomic Databases Using a Novel Data Provenance Model

UNLABELLED Public genomic databases, which are often used to guide genetic studies of human disease, are now being applied to genomic medicine through in silico integrative genomics. These databases, however, often lack tools for systematically determining the experimental origins of the data. RESULTS We introduce a new data provenance model that we have implemented in a public web application, BioQ, for assessing the reliability of the data by systematically tracing its experimental origins to the original subjects and biologics. BioQ allows investigators to both visualize data provenance as well as explore individual elements of experimental process flow using precise tools for detailed data exploration and documentation. It includes a number of human genetic variation databases such as the HapMap and 1000 Genomes projects. AVAILABILITY AND IMPLEMENTATION BioQ is freely available to the public at http://bioq.saclab.net.