Identifying and mitigating batch effects in whole genome sequencing data
暂无分享,去创建一个
Jennifer A. Tom | Jens Reeder | William F. Forrest | Robert R. Graham | Julie Hunkapiller | Timothy W. Behrens | Tushar R. Bhangale | Jens Reeder | T. Behrens | R. Graham | T. Bhangale | W. Forrest | J. Hunkapiller | J. Tom
[1] T. Hudson,et al. A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.
[2] G. Satten,et al. Testing Rare-Variant Association without Calling Genotypes Allows for Systematic Differences in Sequencing between Cases and Controls , 2015, bioRxiv.
[3] I. Deary,et al. Genome-wide association study of age-related macular degeneration identifies associated variants in the TNXB-FKBPL-NOTCH4 region of chromosome 6p21.3. , 2012, Human molecular genetics.
[4] Tom R. Gaunt,et al. The UK10K project identifies rare variants in health and disease , 2016 .
[5] R. Dennin. Overlooked: Extrachromosomal DNA and Their Possible Impact on Whole Genome Sequencing , 2018, The Malaysian journal of medical sciences : MJMS.
[6] ENCODEConsortium,et al. An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.
[7] Paul Scheet,et al. A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. , 2006, American journal of human genetics.
[8] Rafael A Irizarry,et al. Overcoming bias and systematic errors in next generation sequencing data , 2010, Genome Medicine.
[9] Christophe G. Lambert,et al. Learning from our GWAS mistakes: from experimental design to scientific method , 2012, Biostatistics.
[10] Insuk Lee,et al. Systematic comparison of variant calling pipelines using gold standard personal exome variants , 2015, Scientific Reports.
[11] Gabriëlle H S Buitendijk,et al. Seven New Loci Associated with Age-Related Macular Degeneration , 2013, Nature Genetics.
[12] Hiroko Matsui,et al. Effective filtering strategies to improve data quality from population-based whole exome sequencing studies , 2014, BMC Bioinformatics.
[13] Zhaoxia Yu,et al. Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies. , 2009, American journal of human genetics.
[14] P. Donnelly,et al. Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip , 2009, PLoS genetics.
[15] Tom R. Gaunt,et al. The UK10K project identifies rare variants in health and disease , 2015, Nature.
[16] D. Mann,et al. Will the Precision Medicine Initiative Transform Cardiovascular Translational Research? , 2016, JACC. Basic to translational science.
[17] C. Fischer,et al. Using next-generation DNA sequence data for genetic association tests based on allele counts with and without consideration of zero inflation , 2016, BMC Proceedings.
[18] D. Reich,et al. Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.
[19] B. Browning,et al. Haplotype phasing: existing methods and new developments , 2011, Nature Reviews Genetics.
[20] Jean-Pierre A. Kocher,et al. A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis , 2013, Bioinform..
[21] B. Browning,et al. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. , 2007, American journal of human genetics.
[22] L. Liang,et al. Extremely low-coverage sequencing and imputation increases power for genome-wide association studies , 2012, Nature Genetics.
[23] P. Donnelly,et al. A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.
[24] Jiang Li,et al. Multi-perspective quality control of Illumina exome sequencing data using QC3. , 2014, Genomics.
[25] Tao Han,et al. Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples , 2008, BMC Bioinformatics.
[26] A global reference for human genetic variation , 2015, Nature.
[27] Peter Kraft,et al. Quality control and quality assurance in genotypic data for genome‐wide association studies , 2010, Genetic epidemiology.
[28] Warren W. Kretzschmar,et al. Sparse whole genome sequencing identifies two loci for major depressive disorder , 2015, Nature.
[29] Gabor T. Marth,et al. A global reference for human genetic variation , 2015, Nature.
[30] Bjarni V. Halldórsson,et al. Large-scale whole-genome sequencing of the Icelandic population , 2015, Nature Genetics.
[31] Frederick E. Dewey,et al. CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data , 2015, Bioinform..
[32] A. Clark,et al. Inference of haplotypes from PCR-amplified samples of diploid populations. , 1990, Molecular biology and evolution.
[33] G. Abecasis,et al. Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies , 2006, Nature Genetics.
[34] P. Donnelly,et al. A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.
[35] K. Robasky,et al. The role of replicates for error mitigation in next-generation sequencing , 2013, Nature Reviews Genetics.
[36] Xin Jin,et al. An exome sequencing pipeline for identifying and genotyping common CNVs associated with disease with application to psoriasis , 2012, Bioinform..
[37] Manuel A. R. Ferreira,et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.
[38] S. Wright,et al. Genomic Loss and Silencing on the Y Chromosomes of Rumex , 2017, Genome biology and evolution.
[39] Heng Li. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.
[40] Stephen C. J. Parker,et al. The genetic architecture of type 2 diabetes , 2016, Nature.
[41] Jae-Yoon Jung,et al. Outgroup Machine Learning Approach Identifies Single Nucleotide Variants in Noncoding DNA Associated with Autism Spectrum Disorder , 2019, PSB.
[42] Erika Check Hayden,et al. Technology: The $1,000 genome , 2014, Nature.
[43] M. DePristo,et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.
[44] Data production leads,et al. An integrated encyclopedia of DNA elements in the human genome , 2012 .
[45] Alexa B. R. McIntyre,et al. Extensive sequencing of seven human genomes to characterize benchmark reference materials , 2015, Scientific Data.
[46] David Haussler,et al. Variation resources at UC Santa Cruz , 2006, Nucleic Acids Res..
[47] Yusuke Nakamura,et al. Genome-wide association study identifies two susceptibility loci for exudative age-related macular degeneration in the Japanese population , 2011, Nature Genetics.
[48] Bradley P. Coe,et al. Copy number variation detection and genotyping from exome sequence data , 2012, Genome research.
[49] Robert Gentleman,et al. rtracklayer: an R package for interfacing with genome browsers , 2009, Bioinform..
[50] Li Zhang,et al. Batch effects in the BRLMM genotype calling algorithm influence GWAS results for the Affymetrix 500K array , 2010, The Pharmacogenomics Journal.
[51] C. Furlanello,et al. Variability in GWAS analysis: the impact of genotype calling algorithm inconsistencies , 2010, The Pharmacogenomics Journal.
[52] Rongling Li,et al. Quality Control Procedures for Genome‐Wide Association Studies , 2011, Current protocols in human genetics.
[53] J. Leek. svaseq: removing batch effects and other unwanted noise from sequencing data , 2014, bioRxiv.