Intelligent Image Segmentation for Organic-Rich Shales Using Random Forest, Wavelet Transform, and Hessian Matrix

Scanning electron microscope (SEM) image can capture the distribution, topology, and morphology of microstructural constituents of geological materials. Segmentation of SEM image is needed to delineate/locate the various microstructural constituents. To locate locating kerogen/organic matter and pores in shale samples, we test an automated SEM-image segmentation workflow involving feature extraction followed by machine learning, as an alternative to threshold-based and object-based segmentation. For each pixel in the SEM image, 16 features are generated and then fed to a random forest classifier to determine the presence of the four shale components, namely: 1) pore/crack; 2) rock matrix including clay, calcite, and quartz; 3) pyrite; and 4) organic/kerogen components. With the help of feature extraction techniques such as wavelet transform and Hessian affine region detector, the proposed segmentation methodology can segment one 2058 pixel $\times2606$ pixel SEM image in approximately 30 s. The performance of the trained classifier, quantified in terms of overall F1 score, on the validation data set was higher than 0.9. The newly developed method is significantly robust in comparison to the popular histogram thresholding and object-based segmentation methods.