dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity data

MOTIVATION Oligonucleotide microarrays allow genotyping of thousands of single-nucleotide polymorphisms (SNPs) in parallel. Recently, this technology has been applied to loss-of-heterozygosity (LOH) analysis of paired normal and tumor samples. However, methods and software for analyzing such data are not fully developed. RESULT Here, we report automated methods for pooling SNP array replicates to make LOH calls, visualizing SNP and LOH data along chromosomes in the context of genes and cytobands, making statistical inference to identify shared LOH regions, clustering samples based on LOH profiles and correlating the clustering results to clinical variables. Application of these methods to prostate and breast cancer datasets generates biologically important results. AVAILABILITY The software module dChipSNP implementing these methods is available at http://biosun1.harvard.edu/complab/dchip/snp/ SUPPLEMENTARY INFORMATION The breast cancer data are provided by Andrea L. Richardson, Zhigang C. Wang and James D. Iglehart.

[1]  Cheng Li,et al.  Genome-wide loss of heterozygosity analysis from laser capture microdissected prostate cancer using single nucleotide polymorphic allele (SNP) arrays and a novel bioinformatics platform dChipSNP. , 2003, Cancer research.

[2]  R. Durbin,et al.  Biological sequence analysis: Background on probability , 1998 .

[3]  D J Lockhart,et al.  Genome-wide detection of allelic imbalance using human SNPs and high-density DNA arrays. , 2000, Genome research.

[4]  James L. Winkler,et al.  Accessing Genetic Information with High-Density DNA Arrays , 1996, Science.

[5]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[6]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[7]  A Chakravarti,et al.  High-throughput variation detection and genotyping using microarrays. , 2001, Genome research.

[8]  Sridhar Ramaswamy,et al.  Loss of Heterozygosity and Its Correlation with Expression Profiles in Subclasses of Invasive Breast Cancers , 2004, Cancer Research.

[9]  E. Lander,et al.  Characterization of single-nucleotide polymorphisms in coding regions of human genes , 1999 .

[10]  N. Shen,et al.  Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis , 1999, Nature Genetics.

[11]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Yogendra P. Chaubey Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[14]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[15]  S. P. Fodor,et al.  Determination of ancestral alleles for human single-nucleotide polymorphisms using high-density oligonucleotide arrays , 1999, Nature Genetics.

[16]  Eric S. Lander,et al.  Loss-of-heterozygosity analysis of small-cell lung carcinomas using single-nucleotide polymorphism arrays , 2000, Nature Biotechnology.

[17]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.