Robust DNA microarray image analysis

Abstract.DNA microarrays are an increasingly important tool that allow biologists to gain insight into the function of thousands of genes in a single experiment. Common to all array-based approaches is the necessity to analyze digital images of the scanned DNA array. The ultimate image analysis goal is to automatically quantify every individual array element (spot), providing information about the amount of DNA bound to a spot. Irrespective of the quantification strategy, the preliminary information to extract about a spot includes the mapping between its location in the digital image and its possibly distorted position in the spot array (gridding). We present a gridding approach divided into a spot-amplification step (matched filter), a rotation estimation step (Radon transform), and a grid spanning step. Quantification of the spots is performed by robustly fitting of a parametric model to pixel intensities with the help of M-estimators. The main advantage of parametric spot fitting is its ability to cope with overlapping spots. If the goodness-of-fit is too bad, a semiparametric spot fitting is employed. We show that our approach is superior to simple quantification strategies such as averaging of the pixel intensities. The system was extensively tested on 1740 images resulting from two DNA libraries.

[1]  Hanspeter Herzel,et al.  Extracting information from cDNA arrays. , 2001, Chaos.

[2]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jeremy Buhler,et al.  Dapple: Improved Techniques for Finding Spots on DNA Microarrays , 2000 .

[4]  Soheil Shams,et al.  Image Segmentation and Quality Control Measures in Microarray Image Analysis , 2001 .

[5]  Hans Lehrach,et al.  An automated approach to generating expressed sequence catalogues , 1993, Nature.

[6]  Jan Sijbers,et al.  Automatic segmentation and modelling of two-dimensional electrophoresis gels , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[7]  B. Ripley,et al.  Robust Statistics , 2018, Wiley Series in Probability and Statistics.

[8]  Soheil Shams,et al.  Automation in Microarray Image Analysis with AutoGene , 2000 .

[9]  P. J. Huber Robust Statistical Procedures , 1977 .

[10]  Rolf Adams,et al.  Seeded Region Growing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[12]  Thomas Kämpke,et al.  Spotting approaches for biochip arrays , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[13]  Arnold W. M. Smeulders,et al.  Detection and Characterization of Isolated and Overlapping Spots , 1998, Comput. Vis. Image Underst..

[14]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[15]  R. Stoughton,et al.  Experimental annotation of the human genome using microarray technology , 2001, Nature.

[16]  R Herwig,et al.  Comparative gene expression profiling by oligonucleotide fingerprinting. , 1998, Nucleic acids research.

[17]  Edward R. Dougherty,et al.  Probability and Statistics for the Engineering, Computing and Physical Sciences , 1990 .

[18]  William H. Press,et al.  Numerical recipes in C , 2002 .

[19]  Y. Chen,et al.  Ratio-based decisions and the quantitative analysis of cDNA microarray images. , 1997, Journal of biomedical optics.

[20]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[21]  S. Nayar,et al.  Early Visual Learning , 1996 .

[22]  Azriel Rosenfeld,et al.  A Pyramid Framework for Early Vision , 1994 .

[23]  K. Schittkowski,et al.  NONLINEAR PROGRAMMING , 2022 .

[24]  Hans Lehrach,et al.  Automated image analysis for array hybridization experiments , 2001, Bioinform..

[25]  Azriel Rosenfeld,et al.  A Pyramid Framework for Early Vision: Multiresolutional Computer Vision , 1993 .

[26]  Zhengyou Zhang,et al.  Parameter estimation techniques: a tutorial with application to conic fitting , 1997, Image Vis. Comput..

[27]  A. Harris Genes VI , 1997 .

[28]  Tracy L. Bergemann,et al.  Statistical issues in signal extraction from microarrays , 2001, SPIE BiOS.

[29]  Terence P. Speed,et al.  Comparison of Methods for Image Analysis on cDNA Microarray Data , 2002 .

[30]  Horst Bischof,et al.  Automatic Grid Fitting for Genetic Spot Array Images Containing Guide Spots , 1999, CAIP.

[31]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[32]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[33]  R. Maronna Robust $M$-Estimators of Multivariate Location and Scatter , 1976 .

[34]  M. Morley,et al.  Making and reading microarrays , 1999, Nature Genetics.

[35]  Karsten Hartelius,et al.  Analysis of irregularly distributed points , 1996 .

[36]  Zheng Zheng Zhou,et al.  GLEAMS: a novel approach to high-throughput genetic microarray image capture and analysis , 2001, SPIE BiOS.

[37]  Howard Wainer,et al.  Robust Regression & Outlier Detection , 1988 .