The Zebrafish GenomeWiki: a crowdsourcing approach to connect the long tail for zebrafish gene annotation

A large repertoire of gene-centric data has been generated in the field of zebrafish biology. Although the bulk of these data are available in the public domain, most of them are not readily accessible or available in nonstandard formats. One major challenge is to unify and integrate these widely scattered data sources. We tested the hypothesis that active community participation could be a viable option to address this challenge. We present here our approach to create standards for assimilation and sharing of information and a system of open standards for database intercommunication. We have attempted to address this challenge by creating a community-centric solution for zebrafish gene annotation. The Zebrafish GenomeWiki is a ‘wiki’-based resource, which aims to provide an altruistic shared environment for collective annotation of the zebrafish genes. The Zebrafish GenomeWiki has features that enable users to comment, annotate, edit and rate this gene-centric information. The credits for contributions can be tracked through a transparent microattribution system. In contrast to other wikis, the Zebrafish GenomeWiki is a ‘structured wiki’ or rather a ‘semantic wiki’. The Zebrafish GenomeWiki implements a semantically linked data structure, which in the future would be amenable to semantic search. Database URL: http://genome.igib.res.in/twiki

Meenakshi Sharma | Eric W. Klee | Ashish Mittal | Vaibhav Jadhav | Stephen C. Ekker | Koustav Pal | Vivek Bhardwaj | Shruti Kapoor | Aalok Kumar | Meghna Singh | Vikas Pandey | Deeksha Bhartiya | Jayant Maini | Angom Ramcharan Singh | Subburaj Kadarkaraisamy | Rajiv Rana | Ankit Sabharwal | Srishti Nanda | Aravindhakshan Ramachandran | Paras Sehgal | Zainab Asad | Kriti Kaushik | Shamsudheen Karuthedath Vellarikkal | Divya Jagga | Muthulakshmi Muthuswami | Rajendra K. Chauhan | Elvin Leonard | Ruby Priyadarshini | Mahantappa Halimani | Sunny Malhotra | Ashok Patowary | Harinder Vishwakarma | Prateek Rakeshkumar Joshi | Arijit Bhaumik | Bharat Bhatt | Aamod Jha | Prerna Budakoti | Mukesh Kumar Lalwani | Rajeshwari Meli | Saakshi Jalali | Kandarp Joshi | Heena Dhiman | Saurabh V. Laddha | Naresh Singh | Chetana Sachidanandan | Vinod Scaria | Sridhar Sivasubbu | Aalok Kumar | V. Scaria | Deeksha Bhartiya | Koustav Pal | S. Kapoor | Saakshi Jalali | Chetana Sachidanandan | S. Sivasubbu | S. Ekker | Heena Dhiman | Vaibhav Jadhav | A. Patowary | S. Vellarikkal | M. Muthuswami | N. Singh | J. Maini | E. Klee | Angom Ramcharan Singh | A. Sabharwal | R. Chauhan | M. Lalwani | R. Rana | S. Laddha | A. Bhaumik | Paras Sehgal | S. Malhotra | Harinder Vishwakarma | Meghna Singh | Kandarp Joshi | Meenakshi Sharma | Elvin Leonard | Mahantappa Halimani | Kriti Kaushik | Rajeshwari Meli | Zainab Asad | Ashish Mittal | Ankit Sabharwal | Subburaj Kadarkaraisamy | Divya Jagga | Srishti Nanda | Aravindhakshan Ramachandran | Ruby Priyadarshini | Sunny Malhotra | P. Joshi | Vivek Bhardwaj | Arijit Bhaumik | B. Bhatt | Aamod Jha | Prerna Budakoti | Vikas Pandey

[1]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[2]  Xiangdong Fang,et al.  A Brief Review on the Human Encyclopedia of DNA Elements (ENCODE) Project , 2013, Genom. Proteom. Bioinform..

[3]  M. Scott Marshall,et al.  A semantic web approach applied to integrative bioinformatics experimentation: a biological use case with genomics data , 2007, Bioinform..

[4]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[5]  Raymond K. Auerbach,et al.  Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project , 2010, Science.

[6]  Lincoln D. Stein,et al.  Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges , 2008, Nature Reviews Genetics.

[7]  Monte Westerfield,et al.  ZFIN: enhancements and updates to the zebrafish model organism database , 2010, Nucleic Acids Res..

[8]  Kai Wang,et al.  Gene-function wiki would let biologists pool worldwide resources , 2006, Nature.

[9]  Jim Giles,et al.  Key biology databases go wiki , 2007, Nature.

[10]  Emilio Artacho Reader-appeal should not outweigh merit of research , 2006, Nature.

[11]  Philip Cayting,et al.  An encyclopedia of mouse DNA elements (Mouse ENCODE) , 2012, Genome Biology.

[12]  Kumaran Kandasamy,et al.  Human Proteinpedia: a unified discovery resource for proteomics research , 2008, Nucleic Acids Res..

[13]  References , 1971 .

[14]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[15]  B R Schatz,et al.  The Worm Community System, release 2.0 (WCSr2). , 1995, Methods in cell biology.

[16]  J. Kocher,et al.  A sequence-based variation map of zebrafish. , 2013, Zebrafish.

[17]  A. Amsterdam,et al.  A large-scale insertional mutagenesis screen in zebrafish. , 1999, Genes & development.

[18]  Byron Gallis,et al.  Comparison of Francisella tularensis genomes reveals evolutionary events associated with the emergence of human pathogenic strains , 2007, Genome Biology.

[19]  M. Ashburner,et al.  Calling on a million minds for community annotation in WikiProteins , 2008, Genome Biology.

[20]  Stephen C. Ekker,et al.  in vivo protein trapping produces a functional expression codex of the vertebrate proteome , 2011, Nature Methods.

[21]  Robert Hoehndorf,et al.  BOWiki: an ontology-based wiki for annotation of data and integration of knowledge in biology , 2009, BMC Bioinformatics.

[22]  Winston A Hide,et al.  Big data: The future of biocuration , 2008, Nature.

[23]  J. Avery,et al.  The long tail. , 1995, Journal of the Tennessee Medical Association.

[24]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[25]  Akihiro Urasaki,et al.  zTrap: zebrafish gene trap and enhancer trap database , 2010, BMC Developmental Biology.

[26]  Anne Cambon-Thomsen,et al.  The role of a bioresource research impact factor as an incentive to share human bioresources , 2011, Nature Genetics.

[27]  R. Hoffmann A wiki for the life sciences where authorship matters , 2008, Nature Genetics.

[28]  Daniel Rios,et al.  Ensembl 2011 , 2010, Nucleic Acids Res..

[29]  Luca de Alfaro,et al.  Measuring author contributions to the Wikipedia , 2008, Int. Sym. Wikis.

[30]  S. Salzberg Genome re-annotation: a wiki solution? , 2007, Genome Biology.

[31]  R. Durbin,et al.  The Sequence Ontology: a tool for the unification of genome annotations , 2005, Genome Biology.

[32]  C. Pasquier Biological data integration using Semantic Web technologies. , 2008, Biochimie.

[33]  V. Scaria,et al.  FishMap Zv8 update--a genomic regulatory map of zebrafish. , 2010, Zebrafish.

[34]  V. Scaria,et al.  FishMap: a community resource for zebrafish genomics. , 2008, Zebrafish.

[35]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[36]  T. Hocking,et al.  Heritable Targeted Gene Disruption in Zebrafish Using Designed Zinc Finger Nucleases , 2008, Nature Biotechnology.

[37]  Sean R. Eddy,et al.  The Distributed Annotation System , 2001, BMC Bioinformatics.

[38]  V. Korzh,et al.  BMC Developmental Biology BioMed Central , 2006 .

[39]  Jon W. Huss,et al.  A Gene Wiki for Community Annotation of Gene Function , 2008, PLoS biology.

[40]  Bo Leuf,et al.  The Wiki Way: Quick Collaboration on the Web , 2001 .

[41]  Eleazar Eskin,et al.  A sequence-based variation map of 8.27 million SNPs in inbred mouse strains , 2007, Nature.

[42]  Christine G Elsik,et al.  Community annotation: procedures, protocols, and supporting tools. , 2006, Genome research.

[43]  Jaime Prilusky,et al.  GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support , 1998, Bioinform..