New algorithmic approaches to protein spot detection and pattern matching in two‐dimensional electrophoresis gel databases

Protein spot identification in two‐dimensional electrophoresis gels can be supported by the comparison of gel images accessible in different World Wide Web two‐dimensional electrophoresis (2‐DE) gel protein databases. The comparison may be performed either by visual cross‐matching between gel images or by automatic recognition of similar protein spot patterns. A prerequisite for the automatic point pattern matching approach is the detection of protein spots yielding the x(s),y(s) coordinates and integrated spot intensities i(s). For this purpose an algorithm is developed based on a combination of hierarchical watershed transformation and feature extraction methods. This approach reduces the strong over‐segmentation of spot regions normally produced by watershed transformation. Measures for the ellipticity and curvature are determined as features of spot regions. The resulting spot lists containing x(s),y(s),i(s)‐triplets are calculated for a source as well as for a target gel image accessible in 2‐DE gel protein databases. After spot detection a matching procedure is applied. Both the matching of a local pattern vs. a full 2‐DE gel image and the global matching between full images are discussed. Preset slope and length tolerances of pattern edges serve as matching criteria. The local matching algorithm relies on a data structure derived from the incremental Delaunay triangulation of a point set and a two‐step hashing technique. For the incremental construction of triangles the spot intensities are considered in decreasing order. The algorithm needs neither landmarks nor an a priori image alignment. A graphical user interface for spot detection and gel matching is written in the Java programming language for the Internet. The software package called CAROL (http://gelmatching.inf.fu‐berlin.de) is realized in a client‐server architecture.

[1]  E Fleck,et al.  The construction of the World Wide Web‐accessible myocardial two‐dimensional gel electrophoresis protein database “HEART‐2DPAGE”: A practical approach , 1996, Electrophoresis.

[2]  P F Lemkin,et al.  GELLAB: A computer system for 2D gel electrophoresis analysis. II. Pairing spots. , 1981, Computers and biomedical research, an international journal.

[3]  Fang-Hsuan Cheng,et al.  Fast algorithm for point pattern matching: Invariant to translations, rotations and scale changes , 1997, Pattern Recognit..

[4]  M. Dunn,et al.  Construction of HSC‐2DPAGE: A two‐dimensional gel electrophoresis database of heart proteins , 1997, Electrophoresis.

[5]  Olivier Golaz,et al.  Federated two‐dimensional electrophoresis database: A simple means of publishing two‐dimensional electrophoresis data , 1996, Electrophoresis.

[6]  A D Olson,et al.  Elsie 4: quantitative computer analysis of sets of two-dimensional gel electrophoretograms. , 1988, Analytical biochemistry.

[7]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Edwin R. Hancock,et al.  Matching delaunay graphs , 1997, Pattern Recognit..

[9]  Stefan Sander,et al.  Towards design and comparison of World Wide Web‐accessible myocardial two‐dimensional gel electrophoresis protein databases , 1997, Electrophoresis.

[10]  Gerald Weber,et al.  An Application of Point Pattern Matching in Astronautics , 1994, J. Symb. Comput..

[11]  J I Garrels,et al.  A Saccharomyces cerevisiae Internet protein resource now available , 1995, Electrophoresis.

[12]  Katherine B. Williams,et al.  A two‐dimensional electrophoresis database of human breast epithelial cell proteins , 1997, Electrophoresis.

[13]  Fang-Hsuan Cheng,et al.  Point pattern matching algorithm invariant to geometrical transformation and distortion , 1996, Pattern Recognit. Lett..

[14]  Hideo Ogawa Labeled point pattern matching by Delaunay triangulation and maximal cliques , 1986, Pattern Recognit..

[15]  P F Lemkin,et al.  The 2DWG meta‐database of two‐dimensional electrophoretic gel images on the Internet , 1997, Electrophoresis.

[16]  M M Skolnick,et al.  An approach to completely automatic comparison of two-dimensional electrophoresis gels. , 1982, Clinical chemistry.

[17]  Philippe Tarroux,et al.  HERMeS: A second generation approach to the automatic analysis of two‐dimensional electrophoresis gels. Part III: Spot list matching , 1987 .

[18]  Mark J. Miller,et al.  Computer analysis of two‐dimensional gels: Automatic matching , 1984 .

[19]  N G Anderson,et al.  The TYCHO system for computer analysis of two-dimensional gel electrophoresis patterns. , 1981, Clinical chemistry.

[20]  J. Garrels The QUEST system for quantitative analysis of two-dimensional gels. , 1989, The Journal of biological chemistry.

[21]  Eckart Fleck,et al.  Dilated cardiomyopathy‐associated proteins and their presentation in a WWW‐accessible two‐dimensional gel protein database , 1997, Electrophoresis.

[22]  E Fleck,et al.  Representation of amino acid sequences as two‐dimensional point patterns , 1997, Electrophoresis.

[23]  R D Appel,et al.  Melanie II – a third‐generation software package for analysis of two‐dimensional electrophoresis images: II. Algorithms , 1997, Electrophoresis.

[24]  P. Lemkin Comparing two‐dimensional electrophoretic gel images across the Internet , 1997, Electrophoresis.

[25]  R D Appel,et al.  Make2ddb: A simple package to set up a two‐dimensional electrophoresis database for the World Wide Web , 1997, Electrophoresis.

[26]  D Van Dyck,et al.  Computer analysis of two‐dimensional electrophoresis gels: A new segmentation and modeling algorithm , 1997, Electrophoresis.

[27]  P F Lemkin,et al.  GELLAB: a computer system for two-dimensional gel electrophoresis analysis. III. Multiple two-dimensional gel analysis. , 1981, Computers and biomedical research, an international journal.

[28]  Joachim Klose,et al.  Analysis of two‐dimensional electrophoretic protein patterns using a video camera and a computer. II. Adaptation of automatic spot detection to visual evaluation , 1987 .

[29]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..