A genome-wide study of preferential amplification/hybridization in microarray-based pooled DNA experiments

Microarray-based pooled DNA methods overcome the cost bottleneck of simultaneously genotyping more than 100 000 markers for numerous study individuals. The success of such methods relies on the proper adjustment of preferential amplification/hybridization to ensure accurate and reliable allele frequency estimation. We performed a hybridization-based genome-wide single nucleotide polymorphisms (SNPs) genotyping analysis to dissect preferential amplification/hybridization. The majority of SNPs had less than 2-fold signal amplification or suppression, and the lognormal distributions adequately modeled preferential amplification/hybridization across the human genome. Comparative analyses suggested that the distributions of preferential amplification/hybridization differed among genotypes and the GC content. Patterns among different ethnic populations were similar; nevertheless, there were striking differences for a small proportion of SNPs, and a slight ethnic heterogeneity was observed. To fulfill appropriate and gratuitous adjustments, databases of preferential amplification/hybridization for African Americans, Caucasians and Asians were constructed based on the Affymetrix GeneChip Human Mapping 100 K Set. The robustness of allele frequency estimation using this database was validated by a pooled DNA experiment. This study provides a genome-wide investigation of preferential amplification/hybridization and suggests guidance for the reliable use of the database. Our results constitute an objective foundation for theoretical development of preferential amplification/hybridization and provide important information for future pooled DNA analyses.

[1]  Peter M. Visscher,et al.  Analysis of pooled DNA samples on high density arrays without prior knowledge of differential hybridization rates , 2006, Nucleic acids research.

[2]  Keith W. Jones,et al.  Whole genome DNA copy number changes identified by high density oligonucleotide arrays , 2004, Human Genomics.

[3]  Hsin-Chou Yang,et al.  Association mapping using pooled DNA. , 2007, Methods in molecular biology.

[4]  I. Craig,et al.  Genotyping Pooled DNA on Microarrays: A Systematic Genome Screen of Thousands of SNPs in Large Samples to Detect QTLs for Complex Traits , 2004, Behavior genetics.

[5]  P. Sham,et al.  DNA pooling analysis of 21 norepinephrine transporter gene SNPs with attention deficit hyperactivity disorder: No evidence for association , 2005, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[6]  S. P. Fodor,et al.  Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays , 2004, Nature Methods.

[7]  Hongyu Zhao,et al.  Family‐Based Association Tests for Different Family Structures Using Pooled DNA , 2005, Annals of human genetics.

[8]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[9]  C. Fann,et al.  A Comparison of Individual Genotyping and Pooled DNA Analysis for Polymorphism Validation Prior to Large‐Scale Genetic Studies , 2006, Annals of human genetics.

[10]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[11]  S. P. Fodor,et al.  Large-scale genotyping of complex DNA , 2003, Nature Biotechnology.

[12]  E. Thompson,et al.  Performing the exact test of Hardy-Weinberg proportion for multiple alleles. , 1992, Biometrics.

[13]  Claire L. Simpson,et al.  A central resource for accurate allele frequency estimation from pooled DNA genotyped on DNA microarrays , 2005, Nucleic acids research.

[14]  M. Feldman,et al.  Biodiversity of 52 chicken populations assessed by microsatellite typing of DNA pools , 2003, Genetics Selection Evolution.

[15]  Michael Owen,et al.  Cheap, accurate and rapid allele frequency estimation of single nucleotide polymorphisms by primer extension and DHPLC in DNA pools , 2000, Human Genetics.

[16]  Chia-Ching Pan,et al.  New Adjustment Factors and Sample Size Calculation in a DNA-Pooling Experiment With Preferential Amplification , 2005, Genetics.

[17]  M. Owen,et al.  Streamlined analysis of pooled genotype data in SNP‐based association studies , 2005, Genetic epidemiology.

[18]  P. Visscher,et al.  SNP genotyping on pooled DNAs: comparison of genotyping technologies and a semi automated method for data storage and analysis. , 2002, Nucleic acids research.

[19]  R. Strausberg,et al.  High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Robert Plomin,et al.  Genotyping pooled DNA using 100K SNP microarrays: a step towards genomewide association scans , 2006, Nucleic acids research.

[21]  W. Klitz,et al.  Association mapping of disease loci, by use of a pooled DNA genomic screen. , 1997, American journal of human genetics.

[22]  The International HapMap Consortium,et al.  A physical map of the human genome , 2001 .

[23]  M. O’Donovan,et al.  DNA Pooling: a tool for large-scale association studies , 2002, Nature Reviews Genetics.

[24]  D. Clayton,et al.  Identification of the sources of error in allele frequency estimations from pooled DNA indicates an optimal experimental design. , 2002, Annals of human genetics.

[25]  Laura J. Scott,et al.  High-throughput screening for evidence of association by using mass spectrometry genotyping on DNA pools , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26]  M. Nelson,et al.  Large-scale validation of single nucleotide polymorphisms in gene regions. , 2004, Genome research.

[27]  P. Dubreuil,et al.  Evaluation of a DNA pooled-sampling strategy for estimating the RFLP diversity of maize populations , 1999, Plant Molecular Biology Reporter.

[28]  Michael C O'Donovan,et al.  DNA pooling as a tool for large‐scale association studies in complex traits , 2004, Annals of medicine.

[29]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[30]  D. Cox,et al.  Application of pooled genotyping to scan candidate regions for association with HDL cholesterol levels , 2004, Human Genomics.

[31]  A Chakravarti,et al.  Allele frequency distributions in pooled DNA samples: applications to mapping complex disease genes. , 1998, Genome research.

[32]  L. Griffiths,et al.  A genetic analysis of serotonergic biosynthetic and metabolic enzymes in migraine using a DNA pooling approach , 2005, Journal of Human Genetics.

[33]  Deborah A Nickerson,et al.  Population History and Natural Selection Shape Patterns of Genetic Variation in 132 Genes , 2004, PLoS biology.

[34]  P. Sham,et al.  Family-based association tests for quantitative traits using pooled DNA , 2002, European Journal of Human Genetics.

[35]  Katarina Lindroos,et al.  Multiplex SNP genotyping in pooled DNA samples by a four-colour microarray system. , 2002, Nucleic acids research.

[36]  M. Kostrzewa,et al.  MALDI-TOF mass spectrometry-based SNP genotyping. , 2002, Pharmacogenomics.

[37]  Stefan Kammerer,et al.  Association testing by DNA pooling: An effective initial screen , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  W. Pan,et al.  Han Chinese Cell and Genome Bank in Taiwan: Purpose, Design and Ethical Considerations , 2006, Human Heredity.

[39]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[40]  Robert Plomin,et al.  Genotyping DNA pools on microarrays: Tackling the QTL problem of large samples and large numbers of SNPs , 2005, BMC Genomics.

[41]  Hsin-Chou Yang,et al.  PDA: Pooled DNA analyzer , 2006, BMC Bioinformatics.

[42]  M. Procházka,et al.  High-throughput SNP detection by using DNA pooling and denaturing high performance liquid chromatography (DHPLC) , 2000, Human Genetics.

[43]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[44]  Jing Huang,et al.  Algorithms for large-scale genotyping microarrays , 2003, Bioinform..

[45]  N. Arnheim,et al.  Use of pooled DNA samples to detect linkage disequilibrium of polymorphic restriction fragments and human disease: studies of the HLA class II loci. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[46]  D. Haussler,et al.  A physical map of the human genome , 2001, Nature.

[47]  Matthias Wjst,et al.  Large‐scale determination of SNP allele frequencies in DNA pools using MALDI‐TOF mass spectrometry , 2002, Human mutation.

[48]  D. Naiman,et al.  Polysubstance abuse-vulnerability genes: genome scans for association, using 1,004 subjects and 1,494 single-nucleotide polymorphisms. , 2001, American journal of human genetics.

[49]  P. Visscher,et al.  Simple method to analyze SNP‐based association studies using DNA pools , 2003, Genetic epidemiology.