On Crowd-verification of Biological Networks

Biological networks with a structured syntax are a powerful way of representing biological information generated from high density data; however, they can become unwieldy to manage as their size and complexity increase. This article presents a crowd-verification approach for the visualization and expansion of biological networks. Web-based graphical interfaces allow visualization of causal and correlative biological relationships represented using Biological Expression Language (BEL). Crowdsourcing principles enable participants to communally annotate these relationships based on literature evidences. Gamification principles are incorporated to further engage domain experts throughout biology to gather robust peer-reviewed information from which relationships can be identified and verified. The resulting network models will represent the current status of biological knowledge within the defined boundaries, here processes related to human lung disease. These models are amenable to computational analysis. For some period following conclusion of the challenge, the published models will remain available for continuous use and expansion by the scientific community.

[1]  James Bennett,et al.  The Netflix Prize , 2007 .

[2]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[3]  Benjamin M. Good,et al.  Games with a scientific purpose , 2011, Genome Biology.

[4]  Manuel C. Peitsch,et al.  Construction of a Computable Network Model for DNA Damage, Autophagy, Cell Death, and Senescence , 2013, Bioinformatics and biology insights.

[5]  Alan D. Lopez THE GLOBAL BURDEN OF DISEASE 1990-2020 , 2001 .

[6]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[7]  Adrien Treuille,et al.  Predicting protein structures with a multiplayer online game , 2010, Nature.

[8]  J. Barendregt,et al.  Global burden of disease , 1997, The Lancet.

[9]  Manuel C. Peitsch,et al.  Construction of a computable cell proliferation network focused on non-diseased lung cells , 2011, BMC Systems Biology.

[10]  Cathy H. Wu,et al.  Community annotation and bioinformatics workforce development in concert—Little Skate Genome Annotation Workshops and Jamborees , 2012, Database J. Biol. Databases Curation.

[11]  Carole A. Goble,et al.  Micropublications: a semantic model for claims, evidence, arguments and annotations in biomedical communications , 2013, Journal of Biomedical Semantics.

[12]  Lena Mamykina,et al.  Design lessons from the fastest q&a site in the west , 2011, CHI.

[13]  Manuel C. Peitsch,et al.  Construction of a Computable Network Model of Tissue Repair and Angiogenesis in the Lung , 2013 .

[14]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[15]  Carla E. Brodley,et al.  KDD-Cup 2000 organizers' report: peeling the onion , 2000, SKDD.

[16]  Kevin C. Dorff,et al.  The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models , 2010, Nature Biotechnology.

[17]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[18]  Gary D Bader,et al.  BioPAX – A community standard for pathway data sharing , 2010, Nature Biotechnology.

[19]  Steven K. Gibb Toxicity testing in the 21st century: a vision and a strategy. , 2008, Reproductive toxicology.

[20]  M. Peitsch,et al.  Verification of systems biology research in the age of collaborative competition , 2011, Nature Biotechnology.

[21]  Henning Lenz,et al.  PREPACT 2.0: Predicting C-to-U and U-to-C RNA Editing in Organelle Genome Sequences with Multiple References and Curated RNA Editing Annotation , 2013, Bioinformatics and biology insights.

[22]  Manuel C. Peitsch,et al.  A Modular Cell-Type Focused Inflammatory Process Network Model for Non-Diseased Pulmonary Tissue , 2013, Bioinformatics and biology insights.

[23]  Alfonso Valencia,et al.  Overview of BioCreAtIvE: critical assessment of information extraction for biology , 2005, BMC Bioinformatics.

[24]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[25]  A. Califano,et al.  Dialogue on Reverse‐Engineering Assessment and Methods , 2007, Annals of the New York Academy of Sciences.

[26]  Cathy H. Wu,et al.  Community annotation in biology , 2010, Biology Direct.

[27]  Manuel C. Peitsch,et al.  Systematic Verification of Upstream Regulators of a Computable Cellular Proliferation Network Model on Non-Diseased Lung Cells Using a Dedicated Dataset , 2013, Bioinformatics and biology insights.

[28]  Martin Kuiper,et al.  Jointly creating digital abstracts: dealing with synonymy and polysemy , 2012, BMC Research Notes.

[29]  Ajay K. Royyuru,et al.  Industrial methodology for process verification in research (IMPROVER): toward systems biology verification , 2012, Bioinform..

[30]  W. Mattes,et al.  An omics strategy for discovering pulmonary biomarkers potentially relevant to the evaluation of tobacco products. , 2012, Biomarkers in medicine.

[31]  Sampo Pyysalo,et al.  Overview of BioNLP Shared Task 2013 , 2013, BioNLP@ACL.

[32]  Jennifer Park,et al.  A computable cellular stress network model for non-diseased pulmonary and cardiovascular tissue , 2011, BMC Systems Biology.

[33]  Mario Lauria,et al.  Strengths and limitations of microarray-based phenotype prediction: lessons learned from the IMPROVER Diagnostic Signature Challenge , 2013, Bioinform..

[34]  Ronan M. T. Fleming,et al.  A community-driven global reconstruction of human metabolism , 2013, Nature Biotechnology.

[35]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[36]  Manuel C. Peitsch,et al.  Assessment of network perturbation amplitudes by applying high-throughput data to causal biological networks , 2012, BMC Systems Biology.

[37]  R. Pauwels,et al.  Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease. NHLBI/WHO Global Initiative for Chronic Obstructive Lung Disease (GOLD) Workshop summary. , 2001, American journal of respiratory and critical care medicine.

[38]  Juliane Fluck,et al.  BEL Networks Derived from Qualitative Translations of BioNLP Shared Task Annotations , 2013, BioNLP@ACL.

[39]  Adam A. Margolin,et al.  Systematic Analysis of Challenge-Driven Improvements in Molecular Prognostic Models for Breast Cancer , 2013, Science Translational Medicine.

[40]  K Fidelis,et al.  A large‐scale experiment to assess protein structure prediction methods , 1995, Proteins.

[41]  Tudor Groza,et al.  State of the art and open challenges in community-driven knowledge curation , 2013, J. Biomed. Informatics.

[42]  Julia Hoeng,et al.  A network-based approach to quantifying the impact of biologically active substances. , 2012, Drug discovery today.