Automatic recognition of complete palynomorphs in digital images

Images of dispersed kerogen preparation are analysed in order to detect palynomorphs of elliptical/spherical shape. This process consists of three automatic stages. Firstly, the background of the image is segmented from the foreground. Secondly the foreground particles are segmented into individual regions. Finally a trained classifier is used to label a region as either containing a palynomorph or containing other material. Ten classifiers were trained and then tested using a ten times tenfold cross-validation. Typically the number of regions in the image containing other material exceeds by far the number of regions with palynomorphs. Hence the problem of imbalanced classes was addressed. Training data was sampled ten different times maintaining a balanced class distribution. Thus the accuracy for each classifier was assessed on 1,000 testing sets. The logistic classifier was chosen and a certainty threshold was selected by ROC curve analysis. The final automatic recognition has accuracy of 88%, sensitivity of 87% and specificity of 88%.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Scott J. Hill,et al.  Outline extraction of microfossils in reflected light images , 1988 .

[3]  P. A. Swaby,et al.  VIDES: an expert system for visually identifying microfossils , 1992, IEEE Expert.

[4]  Ian Witten,et al.  Data Mining , 2000 .

[5]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[6]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[7]  W R Evitt,et al.  A DISCUSSION AND PROPOSALS CONCERNING FOSSIL DINOFLAGELLATES, HYSTRICHOSPHERES, AND ACRITARCHS, I. , 1963, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Andrew Ware,et al.  Determining the saliency of feature measurements obtained from images of sedimentary organic matter for use in its classification , 2006, Comput. Geosci..

[9]  Ludmila I. Kuncheva,et al.  Background Segmentation in Microscopy Images , 2008, VISAPP.

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[11]  R. Barandelaa,et al.  Strategies for learning in class imbalance problems , 2003, Pattern Recognit..

[12]  Ludmila I. Kuncheva,et al.  Stability of Kerogen Classification with Regard to Image Segmentation , 2009 .

[13]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[14]  W R Evitt,et al.  A DISCUSSION AND PROPOSALS CONCERNING FOSSIL DINOFLAGELLATES, HYSTRICHOSPHERES, AND ACRITARCHS, II. , 1963, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Robert A. McLaughlin,et al.  Randomized Hough Transform: Improved ellipse detection with comparison , 1998, Pattern Recognit. Lett..

[16]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[17]  Richard J. Howarth,et al.  The application of expert systems to the identification and use of microfossils in the petroleum industry , 1994 .

[18]  Trevor Hastie,et al.  Additive Logistic Regression : a Statistical , 1998 .

[19]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[20]  David G. Stork,et al.  Pattern Classification , 1973 .

[21]  Jonathan Corcoran,et al.  The semi-automated classification of sedimentary organic matter in palynological preparations , 2005, Comput. Geosci..

[22]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[23]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[24]  Ludmila I. Kuncheva,et al.  Automated Kerogen Classification in Microscope Images of Dispersed Kerogen Preparation , 2008 .

[25]  Ludmila I. Kuncheva,et al.  Object segmentation within microscope images of palynofacies , 2008, Comput. Geosci..

[26]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[27]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[28]  W. R. Riedel,et al.  IDENTIFY: a Prolog program to help identify fossils , 1989 .

[29]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[30]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[31]  Ludmila I. Kuncheva,et al.  An Evaluation Measure of Image Segmentation Based on Object Centres , 2006, ICIAR.