10 Simple Rules for Sharing Human Genomic Data

These 10 Simple Rules have been developed from our combined experiences of working with human genomic data, data repositories and data users. We do not claim that these rules will eliminate every possible risk of data misuse. Rather, we hope that these will help researchers to increase the reusability of their human genomic data, whilst also ensuring that the privacy of their subjects is maintained according to their consent frameworks. Many of the principles presented are also applicable to other types of clinical research data, where participant privacy is a concern.

[1]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[2]  S. Nelson,et al.  Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays , 2008, PLoS genetics.

[3]  K. Frazer Human Genome Variation , 2007, Nature Genetics.

[4]  Yaniv Erlich,et al.  Routes for breaching and protecting genetic privacy , 2013, Nature Reviews Genetics.

[5]  A. El-Osta,et al.  Gene name errors are widespread in the scientific literature , 2016, Genome Biology.

[6]  Alvis Brazma,et al.  Minimum Information About a Microarray Experiment (MIAME) – Successes, Failures, Challenges , 2009, TheScientificWorldJournal.

[7]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[8]  Damian Smedley,et al.  The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data , 2014, Nucleic Acids Res..

[9]  Iain Hrynaszkiewicz,et al.  Publishing descriptions of non-public clinical datasets: proposed guidance for researchers, repositories, editors and funding organisations , 2016, Research Integrity and Peer Review.

[10]  C. Amid European Nucleotide Archive: Quick tour , 2017 .

[11]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[12]  Tempest A. van Schaik,et al.  The need to redefine genomic data sharing: A focus on data accessibility , 2014, Applied & translational genomics.

[13]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[14]  C. Ball,et al.  Repeatability of published microarray gene expression analyses , 2009, Nature Genetics.

[15]  Allyson L. Lister,et al.  BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences , 2016, Database J. Biol. Databases Curation.