Optimization of primer design for the detection of variable genomic lesions in cancer

Primer approximation multiplex PCR (PAMP) is a new experimental protocol for efficiently assaying structural variation in genomes. PAMP is particularly suited to cancer genomes where the precise breakpoints of alterations such as deletions or translocations vary between patients. The design of PCR primer sets for PAMP is challenging because a large number of primer pairs are required to detect alterations in the hundreds of kilobases range that can occur in cancer. These sets of primers must achieve high coverage of the region of interest, while avoiding primer dimers and satisfying the physico-chemical constraints of good PCR primers. We describe a natural formulation of these constraints as a combinatorial optimization problem. We show that the PAMP primer design problem is NP-hard, and design algorithms based on simulated annealing and integer programming, that provide good solutions to this problem in practice. The algorithms are applied to a test region around the known CDKN2A deletion, which show excellent results even in a 1:49 mixture of mutated:wild-type cells. We use these test results to help set design parameters for larger problems. We can achieve near-optimal designs for regions close to 1 Mb.

[1]  D. Lipson Optimization Problems in Design of Oligonucleotides for Hybridization based Methods , 2003 .

[2]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[3]  D. Pinkel,et al.  Array comparative genomic hybridization and its applications in cancer , 2005, Nature Genetics.

[4]  Benjamin J. Raphael,et al.  Decoding the fine-scale structure of a breast cancer genome and transcriptome. , 2006, Genome research.

[5]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[6]  Jun Yokota,et al.  Molecular processes of chromosome 9p21 deletions in human cancers , 2003, Oncogene.

[7]  P. Pollock,et al.  CDKN2A/p16 is inactivated in most melanoma cell lines. , 1997, Cancer research.

[8]  Simon Kasif,et al.  Computational tradeoffs in multiplex PCR assay design for SNP genotyping , 2005, BMC Genomics.

[9]  Doi,et al.  Greedy Algorithms for Finding a Small Set of Primers Satisfying Cover and Length Resolution Conditions in PCR Experiments. , 1997, Genome informatics. Workshop on Genome Informatics.

[10]  G. Peters,et al.  Regulation of the INK4b–ARF–INK4a tumour suppressor locus: all for one or one for all , 2006, Nature Reviews Molecular Cell Biology.

[11]  Yu-Tseung Liu,et al.  A Novel Approach for Determining Cancer Genomic Breakpoints in the Presence of Normal DNA , 2007, PloS one.

[12]  J. Butler,et al.  AutoDimer: a screening tool for primer-dimer and hairpin structures. , 2004, BioTechniques.

[13]  Reidar Andreson,et al.  GENOMEMASKER package for designing unique genomic PCR primers , 2006, BMC Bioinformatics.

[14]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[15]  Jean-Marc Steyaert,et al.  Selecting Optimal Oligonucleotide Primers for Multiplex PCR , 1997, ISMB.

[16]  D. Sidransky,et al.  p16(MTS-1/CDKN2/INK4a) in cancer progression. , 2001, Experimental cell research.

[17]  Scott E. Kern,et al.  DPC4, A Candidate Tumor Suppressor Gene at Human Chromosome 18q21.1 , 1996, Science.

[18]  Kevin L. Gunderson,et al.  Highly parallel genomic assays , 2006, Nature Reviews Genetics.

[19]  G. Mohapatra,et al.  Detection of p16 Gene Deletions in Gliomas: A Comparison of Fluorescence in Situ Hybridization (FISH) Versus Quantitative PCR , 1997, Journal of neuropathology and experimental neurology.

[20]  Thomas Kämpke,et al.  Efficient primer design algorithms , 2001, Bioinform..

[21]  Doi,et al.  A Greedy Algorithm for Minimizing the Number of Primers in Multiple PCR Experiments. , 1999, Genome informatics. Workshop on Genome Informatics.

[22]  Minjie Luo,et al.  A genotyping system capable of simultaneously analyzing >1000 single nucleotide polymorphisms in a haploid genome. , 2005, Genome research.

[23]  J. Tchinda,et al.  Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. , 2006, Science.

[24]  T. Efferth,et al.  Homozygous deletions of CDKN2A caused by alternative mechanisms in various human cancer cell lines , 2005, Genes, chromosomes & cancer.

[25]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.