Evaluation of placement techniques for DNA probe array layout

DNA probe arrays have emerged as a core genomic technology that enables cost-effective gene expression monitoring, mutation detection, single nucleotide polymorphism analysis and other genomic analyses. DNA chips are manufactured through a highly scalable process, Very Large-Scale Immobilized Polymer Synthesis (VL-SIPS), that combines photolithographic technologies adapted from the semiconductor industry with combinatorial chemistry. Commercially available DNA chips contain more than a half million probes and are expected to exceed one hundred million probes in the next generation. This paper is one of the first attempts to apply VLSI CAD methods to the problem of probe placement in DNA chips, where the main objective is to minimize total border cost (i.e., the number of nucleotide mismatches between adjacent sites). We make the following contributions. First, we propose several partitioning-based algorithms for DNA probe placement that improve solution quality by over 4% compared to best previously known methods. Second, we give a simple in-place probe reembedding algorithm with solution quality better than previous "chessboard" and batched greedy algorithms. Third, we experimentally evaluate scalability and suboptimality of existing and newly proposed probe placement algorithms. Interestingly, we find that DNA placement algorithms appear to have better suboptimality properties than those recently reported for VLSI placement algorithms.

[1]  S. P. Fodor,et al.  Light-directed, spatially addressable parallel chemical synthesis. , 1991, Science.

[2]  Andrew B. Kahng,et al.  Can recursive bisection alone produce routable, placements? , 2000, Proceedings 37th Design Automation Conference.

[3]  P. Pevzner,et al.  Gray code masks for sequencing by hybridization. , 1994, Genomics.

[4]  Leon Steinberg,et al.  The Backboard Wiring Problem: A Placement Algorithm , 1961 .

[5]  Jane W. Chan,et al.  Microarrays for the Neurosciences: An Essential Guide , 2004 .

[6]  Bryan D. Ackland,et al.  Physical Design Automation of Vlsi Systems , 1988 .

[7]  Hidekazu Terai,et al.  Automatic Placement Algorithms for High Packing density VLSI , 1983, 20th Design Automation Conference Proceedings.

[8]  Konrad Doll,et al.  Iterative placement improvement by network flow methods , 1994, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[9]  Jason Cong,et al.  Optimality, scalability and stability study of partitioning and placement algorithms , 2003, ISPD '03.

[10]  Pavel A Pevzner,et al.  Combinatorial algorithms for design of DNA arrays. , 2002, Advances in biochemical engineering/biotechnology.

[11]  Andrew B. Kahng,et al.  Geometric Embeddings for Faster and Better Multi-Way Netlist Partitioning , 1993, 30th ACM/IEEE Design Automation Conference.

[12]  Andrew B. Kahng,et al.  Engineering a scalable placement heuristic for DNA probe arrays , 2003, RECOMB '03.

[13]  David R. Karger,et al.  Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.

[14]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[15]  Charles M. Fiduccia,et al.  A linear-time heuristic for improving network partitions , 1988, 25 years of DAC.

[16]  Daniel H. Geschwind,et al.  Microarrays for the neurosciences : an essential guide , 2002 .

[17]  Andrew B. Kahng,et al.  Border Length Minimization in DNA Array Design , 2002, WABI.

[18]  Pinaki Mazumder,et al.  VLSI cell placement techniques , 1991, CSUR.

[19]  Andrew B. Kahng,et al.  Recent directions in netlist partitioning: a survey , 1995, Integr..

[20]  Majid Sarrafzadeh,et al.  Dragon2000: standard-cell placement tool for large industry circuits , 2000, IEEE/ACM International Conference on Computer Aided Design. ICCAD - 2000. IEEE/ACM Digest of Technical Papers (Cat. No.00CH37140).

[21]  Andrew B. Kahng,et al.  Quantified Suboptimality of VLSI Layout Heuristics , 1995, 32nd Design Automation Conference.

[22]  Andrew B. Kahng,et al.  Partitioning-based standard-cell global placement with an exact objective , 1997, ISPD '97.

[23]  J. Cong,et al.  Optimality and scalability study of existing placement algorithms , 2003, Proceedings of the ASP-DAC Asia and South Pacific Design Automation Conference, 2003..

[24]  Sheldon B. Akers On the Use of the Linear Assignment Algorithm in Module Placement , 1981, 18th Design Automation Conference.