Data Sanitization to Reduce Private Information Leakage from Functional Genomics

[1]  Maximillian S Westphal,et al.  SMaSH: Sample matching using SNPs in humans , 2019, BMC Genomics.

[2]  Wenhui Wang,et al.  A probabilistic multi-omics data matching method for detecting sample errors in integrative analysis , 2019, GigaScience.

[3]  Michael J. Purcaro,et al.  Revealing the brain's molecular architecture. , 2018, Science.

[4]  Yaniv Erlich,et al.  Identity inference of genomic data using long-range familial searches , 2018, Science.

[5]  Michael D. Edge,et al.  Statistical Detection of Relatives Typed with Disjoint Forensic and Biomedical Loci , 2018, Cell.

[6]  Brian L Browning,et al.  A One-Penny Imputed Genome from Next-Generation Reference Panels. , 2018, American journal of human genetics.

[7]  Fabian J Theis,et al.  SCANPY: large-scale single-cell gene expression data analysis , 2018, Genome Biology.

[8]  Mauricio O. Carneiro,et al.  Scaling accurate genetic variant discovery to tens of thousands of samples , 2017, bioRxiv.

[9]  Peter J. Park,et al.  NGSCheckMate: software for validating sample identity in next-generation sequencing studies within and across data types , 2017, Nucleic acids research.

[10]  B. Knoppers,et al.  Are Data Sharing and Privacy Protection Mutually Exclusive? , 2016, Cell.

[11]  Fidel Ramírez,et al.  deepTools2: a next generation web server for deep-sequencing data analysis , 2016, Nucleic Acids Res..

[12]  Lior Pachter,et al.  Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[13]  M. Gerstein,et al.  Quantification of private information leakage from phenotype-genotype data: linking attacks , 2016, Nature Methods.

[14]  N. Loman,et al.  A complete bacterial genome assembled de novo using only nanopore sequencing data , 2015, Nature Methods.

[15]  David A. Knowles,et al.  Transcriptome Sequencing of a Large Human Family Identifies the Impact of Rare Noncoding Variants , 2014, American journal of human genetics.

[16]  Tao Huang,et al.  MODMatcher: Multi-Omics Data Matcher for Integrative Genomic Analysis , 2014, PLoS Comput. Biol..

[17]  Yaniv Erlich,et al.  Routes for breaching and protecting genetic privacy , 2013, Nature Reviews Genetics.

[18]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.

[19]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[20]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[21]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[22]  Eran Halperin,et al.  Identifying Personal Genomes by Surname Inference , 2013, Science.

[23]  E. Schadt The changing privacy landscape in the era of big data , 2012, Molecular systems biology.

[24]  K. Hao,et al.  Bayesian method to predict individual SNP genotypes from gene expression data , 2012, Nature Genetics.

[25]  N. Cox,et al.  On sharing quantitative trait GWAS results in an era of multiple-omics data and the limits of genomic privacy. , 2012, American journal of human genetics.

[26]  Peter J. Bickel,et al.  Measuring reproducibility of high-throughput experiments , 2011, 1110.4705.

[27]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[28]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[29]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[30]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[31]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[32]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[33]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[34]  S. Nelson,et al.  Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays , 2008, PLoS genetics.

[35]  Vitaly Shmatikov,et al.  Robust De-anonymization of Large Sparse Datasets , 2008, 2008 IEEE Symposium on Security and Privacy (sp 2008).

[36]  T. Hampton,et al.  The Cancer Genome Atlas , 2020, Indian Journal of Medical and Paediatric Oncology.

[37]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[38]  Bo Peng,et al.  Large-Scale Privacy-Preserving Mapping of Human Genomic Sequences on Hybrid Clouds , 2012, NDSS.

[39]  A map of human genome variation from population scale sequencing , 2012 .

[40]  Claude-Alain H. Roten,et al.  Fast and accurate short read alignment with Burrows–Wheeler transform , 2009, Bioinform..

[41]  L. Sweeney Simple Demographics Often Identify People Uniquely , 2000 .