An Ant Colony Optimization Algorithm for DNA Copy Number Analysis in Array CGH Data

In this paper, the self organization properties of ant colonies are employed to tackle the problem of DNA copy number analysis in array CGH data, which can reveal chromosomal aberrations in the genomic DNA. These amplifications and deletions may be crucial events in the development and progression of cancer and other diseases. Accurately identifying the recurrent aberration at a particular genome location is important to find the possibly damaged genes. Unfortunately, it is difficult to exactly detect the boundaries of aberration from array CGH data with low signal-to-noise ratio. The presented ant colony optimization algorithm represents the problem as a directed graph such that the objective of the original problem becomes to find the shortest path on the graph under the problem-specific constraints. A number of artificial ants are distributed on the graph and communicate with one another through the pheromone trails which are a form of the long-term memory guiding the future exploration of the graph. The important properties of the proposed method are thoroughly investigated. The performance of the proposed method as compared to those of the state-of-the-art methods is very promising.

[1]  Franck Picard,et al.  A statistical approach for array CGH data analysis , 2005, BMC Bioinformatics.

[2]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[3]  Marco Dorigo,et al.  Ant Colony Optimization and Stochastic Gradient Descent , 2002, Artificial Life.

[4]  Melvin Alexander Applied Statistics and Probability for Engineers , 1995 .

[5]  Elena Marchiori,et al.  Breakpoint identification and smoothing of array comparative genomic hybridization data , 2004, Bioinform..

[6]  Ajay N. Jain,et al.  Hidden Markov models approach to the analysis of array CGH data , 2004 .

[7]  Peter J. Park,et al.  Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data , 2005, Bioinform..

[8]  D. Pinkel,et al.  Array comparative genomic hybridization and its applications in cancer , 2005, Nature Genetics.

[9]  Paul H. C. Eilers,et al.  Quantile smoothing of array CGH data , 2005, Bioinform..

[10]  Christian Blum,et al.  Ant colony optimization: Introduction and recent trends , 2005 .

[11]  Emmanuel Barillot,et al.  Analysis of array CGH data: from signal ratio to gain and loss of DNA regions , 2004, Bioinform..

[12]  Kevin P. Murphy,et al.  Integrating copy number polymorphisms into array CGH analysis using a robust HMM , 2006, ISMB.

[13]  M. McCarthy The statistical approach , 1959 .

[14]  Ji Zhu,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm364 Data and text mining Analysis of array CGH data for cancer studies using , 2022 .

[15]  Jane Fridlyand,et al.  Bioinformatics Original Paper a Comparison Study: Applying Segmentation to Array Cgh Data for Downstream Analyses , 2022 .

[16]  Douglas Grove,et al.  Denoising array-based comparative genomic hybridization data using wavelets. , 2005, Biostatistics.

[17]  R. Tibshirani,et al.  A method for calling gains and losses in array CGH data. , 2005, Biostatistics.