Comprehensive study of DNA copy number analysis using Sigma filter

DNA copy number aberrations are characteristic of many genomic diseases including cancer. Microarray-based Comparative Genomic Hybridization (aCGH) is a recently developed high-throughput technique used to detect DNA copy number (DCN) aberrations. Unfortunately, the observed copy number changes are corrupted by noise, making aberration boundaries hard to detect. In the first part of this paper, we propose a novel technique to analyze DCN aberrations based on the Sigma filter algorithm. We establish its superior performance for denoising DCN data and low computational complexity as compared to previous techniques. We present a comparison study between our approach and other smoothing and statistical approaches, the wavelet-based, LookA-head, CGH segmentation and HMM. We provide examples using real data to illustrate the performance of the algorithms. In the second part of this paper, we extend our algorithm by considering the effect of nonuniform physical distance between the probes in the aCGH data. Finally, we provide simulated and real data examples to study this effect.

[1]  Ahmed H. Tewfik,et al.  Multiscale sigma filter and active contour for image segmentation , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[2]  Yuhang Wang,et al.  A novel stationary wavelet denoising algorithm for array-based DNA Copy Number data , 2007, Int. J. Bioinform. Res. Appl..

[3]  Franck Picard,et al.  A statistical approach for array CGH data analysis , 2005, BMC Bioinformatics.

[4]  G. Churchill Fundamentals of experimental design for cDNA microarrays , 2002, Nature Genetics.

[5]  Jong-Sen Lee,et al.  Digital image smoothing and the sigma filter , 1983, Comput. Vis. Graph. Image Process..

[6]  Ajay N. Jain,et al.  Assembly of microarrays for genome-wide measurement of DNA copy number , 2001, Nature Genetics.

[7]  S. Mallat A wavelet tour of signal processing , 1998 .

[8]  Antonio Ortega,et al.  Wavelet Footprints and Sparse Bayesian Learning for DNA Copy Number Change Analysis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[9]  Ajay N. Jain,et al.  Hidden Markov models approach to the analysis of array CGH data , 2004 .

[10]  Jane Fridlyand,et al.  Bioinformatics Original Paper a Comparison Study: Applying Segmentation to Array Cgh Data for Downstream Analyses , 2022 .

[11]  Yonatan Aumann,et al.  Efficient Calculation of Interval Scores for DNA Copy Number Data Analysis , 2005, RECOMB.

[12]  R. Tibshirani,et al.  A method for calling gains and losses in array CGH data. , 2005, Biostatistics.

[13]  Yonatan Aumann,et al.  Efficient calculation of interval scores for DNA copy number data analysis. , 2006 .