Trade-Off Analysis Between Parallelism and Accuracy of SLIC on Apache Spark

In this paper, we present a parallel algorithm for SLIC on Apache Spark, which we call PSLIC-on-Spark. To this purpose, we have extended the original SLIC algorithm to use the operations in Apache Spark, supporting its parallel processing on multiple executors in the Apache Spark cluster. Then, we analyze the trade-off relationship of PSLIC-on-Spark between its processing speed and accuracy due to partitioning of the original image data sets. Especially, we identify two limitations in PSLIC-on-Spark, which degrade the accuracy of the original SLIC. Through experiments, we verify the trade-off relationship. Specifically, we show that PSLIC-on-Spark using 8 CPU cores reduces the processing time of SLIC by 2. 24∼2.93 times while it reduces the boundary recall (BR) of SLIC by 1. 54∼6.32 % and increases under-segmentation error (UE) by 1. 79∼6.2 %. In contrast, PSLIC-on-Spark using 2 CPU cores reduces the processing time of SLIC by 1.38∼1.45 times while it reduces the BR of SLIC by 0. 28∼1.5 %, and increases UE by 0. 25∼1.77 %. We also verify the effectiveness of PSLIC-on-Spark to deal with a large-scale image by showing that the processing speed of PSLIC-on-Spark becomes much more efficient as the image size becomes large. Specifically, compared to the original SLIC, the proposed SLIC-on-Spark reduces its processing time by 2.23 times for the image of 480×320 pixels and by 5.59 times for the image of 2002×1335 pixels, respectively

[1]  Hwee Pink Tan,et al.  Mobile big data analytics using deep learning and apache spark , 2016, IEEE Network.

[2]  Ian Reid,et al.  gSLIC: a real-time implementation of SLIC superpixel segmentation , 2011 .

[3]  Hui Wang,et al.  Adaptive Nonlocal Random Walks for Image Superpixel Segmentation , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Xiao Ma,et al.  Superpixel segmentation: A benchmark , 2017, Signal Process. Image Commun..

[5]  Julien Michel,et al.  Scaling Up SLIC Superpixels Using a Tile-Based Approach , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Hemerson Pistori,et al.  Image Segmentation and Classification with SLIC Superpixel and Convolutional Neural Network in Forest Context , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[7]  He Ma,et al.  Breast Cancer Classification with Ultrasound Images Based on SLIC , 2019, ArXiv.

[8]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Atanas Gotchev,et al.  Fast and Accurate Depth Estimation From Sparse Light Fields , 2018, IEEE Transactions on Image Processing.

[10]  Peer Neubert,et al.  Superpixel Benchmark and Comparison , 2012 .

[11]  Adam Van Etten,et al.  Satellite Imagery Multiscale Rapid Detection with Windowed Networks , 2018, WACV.

[12]  Min Yang,et al.  Extracting Centerlines From Dual-Line Roads Using Superpixel Segmentation , 2019, IEEE Access.

[13]  Jian Yang,et al.  Image Gradient-based Fast Superpixel Segmentation Algorithm for PolSAR Images , 2019, 2019 6th Asia-Pacific Conference on Synthetic Aperture Radar (APSAR).

[14]  Dongmei Chen,et al.  Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[15]  Michael Ying Yang,et al.  SLIC SUPERPIXELS FOR OBJECT DELINEATION FROM UAV DATA , 2017 .

[16]  Zhengqin Li,et al.  Superpixel segmentation using Linear Spectral Clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Xinying Xu,et al.  Automatic Image Segmentation With Superpixels and Image-Level Labels , 2019, IEEE Access.

[18]  Sven J. Dickinson,et al.  TurboPixels: Fast Superpixels Using Geometric Flows , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Xinying Xu,et al.  High precision image segmentation algorithm using SLIC and neighborhood rough set , 2018, Multimedia Tools and Applications.

[20]  Yufei Chen,et al.  A graph based superpixel generation algorithm , 2018, Applied Intelligence.

[21]  Biao Hou,et al.  High-quality depth up-sampling based on multi-scale SLIC , 2018 .

[22]  Tengpeng Li,et al.  Co-Saliency Detection via Mask-Guided Fully Convolutional Networks With Multi-Scale Label Smoothing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Hao Li,et al.  A texture fused superpixel algorithm for coal mine waste rock image segmentation , 2019, International Journal of Coal Preparation and Utilization.

[24]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[25]  Ke Wu,et al.  Semisupervised Classification Based on SLIC Segmentation for Hyperspectral Image , 2020, IEEE Geoscience and Remote Sensing Letters.

[26]  Harshad B. Prajapati,et al.  Analytical study of parallel and distributed image processing , 2011, 2011 International Conference on Image Information Processing.

[27]  Wilfried Philips,et al.  Fast and Robust Variational Optical Flow for High-Resolution Images Using SLIC Superpixels , 2015, ACIVS.

[28]  Amaury Lendasse,et al.  Parameter-free image segmentation with SLIC , 2018, Neurocomputing.

[29]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[30]  Karl Potisepp,et al.  Large-scale Image Processing Using MapReduce , 2013 .

[31]  Mohsen Guizani,et al.  A Spark-Based Parallel Fuzzy $c$ -Means Segmentation Algorithm for Agricultural Image Big Data , 2019, IEEE Access.

[32]  Ikhlas Abdel-Qader,et al.  Fault detection in photovoltaic system using SLIC and thermal images , 2017, 2017 8th International Conference on Information Technology (ICIT).

[33]  Rama Chellappa,et al.  Entropy rate superpixel segmentation , 2011, CVPR 2011.

[34]  Joshua Zhexue Huang,et al.  Big data analytics on Apache Spark , 2016, International Journal of Data Science and Analytics.

[35]  Grant J. Scott,et al.  Remote Sensing Object Localization with Deep Heterogeneous Superpixel Features , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[36]  Jitendra Malik,et al.  Learning to detect natural image boundaries using local brightness, color, and texture cues , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Shaoliang Peng,et al.  Bioinformatics applications on Apache Spark , 2018, GigaScience.

[38]  Helen D. Karatza,et al.  Performance evaluation of cloud-based log file analysis with Apache Hadoop and Apache Spark , 2017, J. Syst. Softw..