Linking genes to diseases with a SNPedia-Gene Wiki mashup

BackgroundA variety of topic-focused wikis are used in the biomedical sciences to enable the mass-collaborative synthesis and distribution of diverse bodies of knowledge. To address complex problems such as defining the relationships between genes and disease, it is important to bring the knowledge from many different domains together. Here we show how advances in wiki technology and natural language processing can be used to automatically assemble ‘meta-wikis’ that present integrated views over the data collaboratively created in multiple source wikis.ResultsWe produced a semantic meta-wiki called the Gene Wiki+ that automatically mirrors and integrates data from the Gene Wiki and SNPedia. The Gene Wiki+, available at (http://genewikiplus.org/), captures 8,047 distinct gene-disease relationships. SNPedia accounts for 4,149 of the gene-disease pairs, the Gene Wiki provides 4,377 and only 479 appear independently in both sources. All of this content is available to query and browse and is provided as linked open data.ConclusionsWikis contain increasing amounts of diverse, biological information useful for elucidating the connections between genes and disease. The Gene Wiki+ shows how wiki technology can be used in concert with natural language processing to provide integrated views over diverse underlying data sources.

[1]  Ewen Callaway No rest for the bio-wikis , 2010, Nature.

[2]  Roy Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation , 2000 .

[3]  Mark A. Musen,et al.  The Open Biomedical Annotator , 2009, Summit on translational bioinformatics.

[4]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[5]  R. Hoffmann A wiki for the life sciences where authorship matters , 2008, Nature Genetics.

[6]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[7]  Mitch Waldrop,et al.  Big data: Wikiomics , 2008, Nature.

[8]  Luca de Alfaro,et al.  The Gene Wiki in 2011: community intelligence applied to human gene annotation , 2011, Nucleic Acids Res..

[9]  H-J Tsai,et al.  Cysteinyl leukotriene receptor 1 gene variation and risk of asthma , 2009, European Respiratory Journal.

[10]  Jon W. Huss,et al.  A Gene Wiki for Community Annotation of Gene Function , 2008, PLoS biology.

[11]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[12]  M. Ashburner,et al.  Calling on a million minds for community annotation in WikiProteins , 2008, Genome Biology.

[13]  Lewis Y. Geer,et al.  Database resources of the National Center for Biotechnology Information , 2014, Nucleic Acids Res..

[14]  Markus Krötzsch,et al.  Semantic Wikipedia , 2006, WikiSym '06.

[15]  Jörg Stülke,et al.  A community-curated consensual annotation that is continuously updated: the Bacillus subtilis centred wiki SubtiWiki , 2009, Database J. Biol. Databases Curation.

[16]  Andrew I. Su,et al.  The Gene Wiki: community intelligence applied to human gene annotation , 2009, Nucleic Acids Res..

[17]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[18]  Dan M. Bolser,et al.  PDBWiki: added value through community annotation of the Protein Data Bank , 2010, Database J. Biol. Databases Curation.

[19]  M. Millan,et al.  Dopamine D3 receptor agonists for protection and repair in Parkinson's disease. , 2007, Current opinion in pharmacology.

[20]  Hala Skaf-Molli,et al.  DSMW: Distributed Semantic MediaWiki , 2010, SemWiki@ESWC.

[21]  John C. Wooley,et al.  TOPSAN: a collaborative annotation environment for structural genomics , 2010, BMC Bioinformatics.

[22]  Yasunori Sato,et al.  Genetic polymorphisms in folate and alcohol metabolism and breast cancer risk: a case–control study in Thai women , 2010, Breast Cancer Research and Treatment.

[23]  W. Kibbe,et al.  Annotating the human genome with Disease Ontology , 2009, BMC Genomics.