Direct Inference of SNP Heterozygosity Rates and Resolution of LOH Detection

Single nucleotide polymorphisms (SNPs) have been increasingly utilized to investigate somatic genetic abnormalities in premalignancy and cancer. LOH is a common alteration observed during cancer development, and SNP assays have been used to identify LOH at specific chromosomal regions. The design of such studies requires consideration of the resolution for detecting LOH throughout the genome and identification of the number and location of SNPs required to detect genetic alterations in specific genomic regions. Our study evaluated SNP distribution patterns and used probability models, Monte Carlo simulation, and real human subject genotype data to investigate the relationships between the number of SNPs, SNP HET rates, and the sensitivity (resolution) for detecting LOH. We report that variances of SNP heterozygosity rate in dbSNP are high for a large proportion of SNPs. Two statistical methods proposed for directly inferring SNP heterozygosity rates require much smaller sample sizes (intermediate sizes) and are feasible for practical use in SNP selection or verification. Using HapMap data, we showed that a region of LOH greater than 200 kb can be reliably detected, with losses smaller than 50 kb having a substantially lower detection probability when using all SNPs currently in the HapMap database. Higher densities of SNPs may exist in certain local chromosomal regions that provide some opportunities for reliably detecting LOH of segment sizes smaller than 50 kb. These results suggest that the interpretation of the results from genome-wide scans for LOH using commercial arrays need to consider the relationships among inter-SNP distance, detection probability, and sample size for a specific study. New experimental designs for LOH studies would also benefit from considering the power of detection and sample sizes required to accomplish the proposed aims.

[1]  J. Witte,et al.  Linkage disequilibrium and allele-frequency distributions for 114 single-nucleotide polymorphisms in five populations. , 2000, American journal of human genetics.

[2]  Eric S. Lander,et al.  Loss-of-heterozygosity analysis of small-cell lung carcinomas using single-nucleotide polymorphism arrays , 2000, Nature Biotechnology.

[3]  A. Richardson,et al.  Genome-Wide Analysis for Loss of Heterozygosity in Primary and Recurrent Phyllodes Tumor and Fibroadenoma of Breast using Single Nucleotide Polymorphism Arrays , 2006, Breast Cancer Research and Treatment.

[4]  J. Landers,et al.  Using high-throughput SNP technologies to study cancer , 2006, Oncogene.

[5]  M. Meyerson,et al.  Homozygous deletions and chromosome amplifications in human lung carcinomas revealed by single nucleotide polymorphism array analysis. , 2005, Cancer research.

[6]  Nicholas G Martin,et al.  Estimation of the Rate of SNP Genotyping Errors From DNA Extracted From Different Tissues , 2005, Twin Research and Human Genetics.

[7]  S. Shete,et al.  On estimating the heterozygosity and polymorphism information content value. , 2000, Theoretical population biology.

[8]  Leonid Kruglyak,et al.  The use of a genetic map of biallelic markers in linkage studies , 1997, Nature Genetics.

[9]  Keith W. Jones,et al.  Whole genome DNA copy number changes identified by high density oligonucleotide arrays , 2004, Human Genomics.

[10]  Deborah A. Nickerson,et al.  Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans , 2003, Nature Genetics.

[11]  C. Maley,et al.  Genetic Mechanisms of TP53 Loss of Heterozygosity in Barrett's Esophagus: Implications for Biomarker Validation , 2006, Cancer Epidemiology Biomarkers & Prevention.

[12]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[13]  N. Rajewsky,et al.  Natural selection on human microRNA binding sites inferred from SNP data , 2006, Nature Genetics.

[14]  T. P. Dryja,et al.  Expression of recessive alleles by chromosomal mechanisms in retinoblastoma , 1983, Nature.

[15]  J. Gregg,et al.  Genomic and functional profiling of duplicated chromosome 15 cell lines reveal regulatory alterations in UBE3A-associated ubiquitin-proteasome pathway processes. , 2006, Human molecular genetics.

[16]  Cheng Li,et al.  Integration of global SNP-based mapping and expression arrays reveals key regions, mechanisms, and genes important in the pathogenesis of multiple myeloma. , 2006, Blood.

[17]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[18]  L. Chasin,et al.  Comparison of multiple vertebrate genomes reveals the birth and evolution of human exons , 2006, Proceedings of the National Academy of Sciences.

[19]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[20]  Andrea Ferreira-Gonzalez,et al.  Genome-wide detection of LOH in prostate cancer using human SNP microarray technology. , 2003, Genomics.

[21]  J. Todd,et al.  The usefulness of different density SNP maps for disease association studies of common variants. , 2003, Human Molecular Genetics.

[22]  J. Chan,et al.  Ethnic differences in the linkage disequilibrium and distribution of single-nucleotide polymorphisms in 35 candidate genes for cardiovascular diseases. , 2004, Genomics.

[23]  Rebecca A Betensky,et al.  Feature‐Specific Penalized Latent Class Analysis for Genomic Data , 2006, Biometrics.

[24]  K. Gunderson,et al.  High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. , 2006, Genome research.

[25]  A D Roses,et al.  Complex disease-associated pharmacogenetics: drug efficacy, drug safety, and confirmation of a pathogenetic hypothesis (Alzheimer's disease) , 2007, The Pharmacogenomics Journal.

[26]  N. Shen,et al.  Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis , 1999, Nature Genetics.

[27]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[28]  B S Weir,et al.  Maximum-likelihood estimation of gene location by linkage disequilibrium. , 1994, American journal of human genetics.

[29]  Christopher B. Miller,et al.  Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia , 2007, Nature.

[30]  M. Cargill Characterization of single-nucleotide polymorphisms in coding regions of human genes , 1999, Nature Genetics.

[31]  Jens Timmer,et al.  Using High-density Snp Arrays Genome-wide Analysis of Dna Copy Number Changes and Loh in Cll , 2022 .

[32]  D. Clayton,et al.  A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferon-induced helicase (IFIH1) region , 2006, Nature Genetics.

[33]  E. Rappaport,et al.  Region-specific detection of neuroblastoma loss of heterozygosity at multiple loci simultaneously using a SNP-based tag-array platform. , 2005, Genome research.

[34]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[35]  Sridhar Ramaswamy,et al.  Loss of Heterozygosity and Its Correlation with Expression Profiles in Subclasses of Invasive Breast Cancers , 2004, Cancer Research.