Image Analysis and Length Estimation of Biomolecules Using AFM

There are many examples of problems in pattern analysis for which it is often possible to obtain systematic characterizations, if in addition a small number of useful features or parameters of the image are known a priori or can be estimated reasonably well. Often, the relevant features of a particular pattern analysis problem are easy to enumerate, as when statistical structures of the patterns are well understood from the knowledge of the domain. We study a problem from molecular image analysis, where such a domain-dependent understanding may be lacking to some degree and the features must be inferred via machine-learning techniques. In this paper, we propose a rigorous, fully automated technique for this problem. We are motivated by an application of atomic force microscopy (AFM) image processing needed to solve a central problem in molecular biology, aimed at obtaining the complete transcription profile of a single cell, a snapshot that shows which genes are being expressed and to what degree. Reed et al. (“Single molecule transcription profiling with AFM,” Nanotechnology, vol. 18, no. 4, 2007) showed that the transcription profiling problem reduces to making high-precision measurements of biomolecule backbone lengths, correct to within 20-25 bp (6-7.5 nm). Here, we present an image processing and length estimation pipeline using AFM that comes close to achieving these measurement tolerances. In particular, we develop a biased length estimator on trained coefficients of a simple linear regression model, biweighted by a Beaton-Tukey function, whose feature universe is constrained by James-Stein shrinkage to avoid overfitting. In terms of extensibility and addressing the model selection problem, this formulation subsumes the models we studied.

[1]  John T. Woodward,et al.  Removing drift from scanning probe microscope images of periodic samples , 1998 .

[2]  Luca Benini,et al.  A Robust Algorithm for Automated Analysis of DNA Molecules in AFM Images , 2004 .

[3]  Luca Benini,et al.  Automated DNA fragments recognition and sizing through AFM image processing , 2005, IEEE Transactions on Information Technology in Biomedicine.

[4]  I N Bankman,et al.  Solid-state DNA sizing by atomic force microscopy. , 1998, Analytical chemistry.

[5]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[6]  J. Gimzewski,et al.  Single molecule transcription profiling with AFM , 2007, Nanotechnology.

[7]  Ching Y. Suen,et al.  Thinning Methodologies - A Comprehensive Survey , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  V. Kalmykov STRUCTURAL ANALYSIS OF CONTOURS AS THE SEQUENCES OF THE DIGITAL STRAIGHT SEGMENTS AND OF THE DIGITAL CURVE ARCS , 2007 .

[9]  R. Reeves,et al.  Length determination of DNA fragments in atomic force microscope images , 1997, Proceedings of International Conference on Image Processing.

[10]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[11]  Yuechao Wang,et al.  AFM operating-drift detection and analyses based on automated sequential image processing , 2007, 2007 7th IEEE Conference on Nanotechnology (IEEE NANO).

[12]  J. Villarrubia Algorithms for Scanned Probe Microscope Image Simulation, Surface Reconstruction, and Tip Estimation , 1997, Journal of research of the National Institute of Standards and Technology.

[13]  Reinhard Klette,et al.  A Comparative Evaluation of Length Estimators of Digital Curves , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Marcel Worring,et al.  Digitized Circular Arcs: Characterization and Parameter Estimation , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  J. Tukey,et al.  The Fitting of Power Series, Meaning Polynomials, Illustrated on Band-Spectroscopic Data , 1974 .

[16]  Paul Marjoram,et al.  Markov chain Monte Carlo without likelihoods , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Gregory A. Dahlen,et al.  Tip characterization and surface reconstruction of complex structures with critical dimension atomic force microscopy , 2005 .

[18]  A. Carasso Linear and Nonlinear Image Deblurring: A Documented Study , 1999 .

[19]  Hanchuan Peng,et al.  Bioimage informatics: a new area of engineering biology , 2008, Bioinform..

[20]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Muni S. Srivastava,et al.  Regression Analysis: Theory, Methods, and Applications , 1991 .

[22]  Aristides A. G. Requicha,et al.  Towards automatic nanomanipulation: drift compensation in scanning probe microscopes , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[23]  Bud Mishra,et al.  Identifying individual DNA species in a complex mixture by precisely measuring the spacing between nicking restriction enzymes with atomic force microscope , 2012, Journal of The Royal Society Interface.

[24]  M. Newton Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis , 2008 .

[25]  Z. Tomori,et al.  Interactive measurement and characterization of DNA molecules by analysis of AFM images , 2005, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[26]  A. Smeulders,et al.  Discrete straight line segments: parameters, primitives and properties , 1991 .

[27]  Wenhao Huang,et al.  Application of a novel nonperiodic grating in scanning probe microscopy drift measurement. , 2007, The Review of scientific instruments.

[28]  Luca Benini,et al.  Automated DNA sizing in atomic force microscope images , 2002, Proceedings IEEE International Symposium on Biomedical Imaging.

[29]  J. Villarrubia Morphological estimation of tip geometry for scanned probe microscopy , 1994 .

[30]  T. S. Spisz,et al.  Automated sizing of DNA fragments in atomic force microscope images , 1998, Medical and Biological Engineering and Computing.

[31]  Rajjan Shinghal,et al.  Skeletonizing Binary Patterns on the Homogeneous Multiprocessor , 1989, Int. J. Pattern Recognit. Artif. Intell..

[32]  José M. N. Leitão,et al.  Unsupervised contour representation and estimation using B-splines and a minimum description length criterion , 2000, IEEE Trans. Image Process..

[33]  Arnold W. M. Smeulders,et al.  Length estimators for digitized contours , 1987, Comput. Vis. Graph. Image Process..

[34]  S. Codeluppi,et al.  Accurate length determination of DNA molecules visualized by atomic force microscopy: evidence for a partial B- to A-form transition on mica. , 2001, Ultramicroscopy.

[35]  Roberto Marcondes Cesar Junior,et al.  Towards effective planar shape representation with multiscale digital curvature analysis based on signal processing techniques , 1996, Pattern Recognit..

[36]  D. Keller Reconstruction of STM and AFM images distorted by finite-size tips , 1991 .

[37]  Luca Benini,et al.  Automatic intrinsic DNA curvature computation from AFM images , 2005, IEEE Transactions on Biomedical Engineering.

[38]  Alfred S. Carasso Error bounds in nonsmooth image deblurring , 1997 .

[39]  Stefano Piccarolo,et al.  Some experimental issues of AFM tip blind estimation: the effect of noise and resolution , 2006 .

[40]  N. Ben-Yosef,et al.  Line Thinning Algorithm , 1983, Other Conferences.

[41]  C. Stein,et al.  Estimation with Quadratic Loss , 1992 .

[42]  Jacques Barbet,et al.  Accuracy of AFM measurements of the contour length of DNA fragments adsorbed on mica in air and in aqueous buffer. , 2002, Ultramicroscopy.

[43]  P. West,et al.  Tip dilation and AFM capabilities in the characterization of nanoparticles , 2007 .

[44]  Marcel Worring,et al.  Measurement and characterization in vision geometry , 1997, Optics & Photonics.

[45]  G Zuccheri,et al.  Mapping the intrinsic curvature and flexibility along the DNA chain , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Reinhard Klette,et al.  Length estimation of digital curves , 1999, Optics & Photonics.

[47]  Lawrence R. Rabiner,et al.  A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[48]  Joseph F. Murray,et al.  Supervised Learning of Image Restoration with Convolutional Networks , 2007, 2007 IEEE 11th International Conference on Computer Vision.