A Spark-Based Parallel Fuzzy $c$ -Means Segmentation Algorithm for Agricultural Image Big Data

With the explosive growth of image big data in the agriculture field, image segmentation algorithms are confronted with unprecedented challenges. As one of the most important images segmentation technologies, the fuzzy $c$ -means (FCMs) algorithm has been widely used in the field of agricultural image segmentation as it provides simple computation and high-quality segmentation. However, due to its large amount of computation, the sequential FCM algorithm is too slow to finish the segmentation task within an acceptable time. This paper proposes a parallel FCM segmentation algorithm based on the distributed memory computing platform Apache Spark for agricultural image big data. The input image is first converted from the RGB color space to the lab color space and generates point cloud data. Then, point cloud data are partitioned and stored in different computing nodes, in which the membership degrees of pixel points to different cluster centers are calculated and the cluster centers are updated iteratively in a data-parallel form until the stopping condition is satisfied. Finally, point cloud data are restored after clustering for reconstructing the segmented image. On the Spark platform, the performance of the parallel FCMs algorithm is evaluated and reaches an average speedup of 12.54 on ten computing nodes. The experimental results show that the Spark-based parallel FCMs algorithm can obtain a significant increase in speedup, and the agricultural image testing set delivers a better performance improvement of 128% than the Hadoop-based approach. This paper indicates that the Spark-based parallel FCM algorithm provides faster speed of segmentation for agricultural image big data and has better scale-up and size-up rates.

[1]  Yun Zhang,et al.  Identification of Apple Leaf Diseases Based on Deep Convolutional Neural Networks , 2017, Symmetry.

[2]  Juan Song,et al.  Improving Semantic Image Segmentation With a Probabilistic Superpixel-Based Dense Conditional Random Field , 2018, IEEE Access.

[3]  Parul Sahare,et al.  Multilingual Character Segmentation and Recognition Schemes for Indian Document Images , 2018, IEEE Access.

[4]  Xinming Zhang,et al.  Spark-SIFT: A Spark-Based Large-Scale Image Feature Extract System , 2017, 2017 13th International Conference on Semantics, Knowledge and Grids (SKG).

[5]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[6]  Nasser Ghadiri,et al.  BigFCM: Fast, precise and scalable FCM on hadoop , 2016, Future Gener. Comput. Syst..

[7]  Lav R. Khot,et al.  A Real-Time Weed Mapping and Precision Herbicide Spraying System for Row Crops , 2018, Sensors.

[8]  P. Balasubramaniam,et al.  Segmentation of nutrient deficiency in incomplete crop images using intuitionistic fuzzy C-means clustering algorithm , 2015, Nonlinear Dynamics.

[9]  G. Ram Mohana Reddy,et al.  A Parallel Fuzzy C Means Algorithm for Brain Tumor Segmentation on Multiple MRI Images , 2013 .

[10]  Shuo Li,et al.  Glioma Segmentation With a Unified Algorithm in Multimodal MRI Images , 2018, IEEE Access.

[11]  Zhen Liu,et al.  The recognition of litchi clusters and the calculation of picking point in a nocturnal natural environment , 2018 .

[12]  J. Santhana Krishnan,et al.  Big Data-Based Image Retrieval Model Using Shape Adaptive Discreet Curvelet Transformation , 2018, Advances in Intelligent Systems and Computing.

[13]  Mahmoud Al-Ayyoub,et al.  Accelerating compute-intensive image segmentation algorithms using GPUs , 2017, The Journal of Supercomputing.

[14]  Raul Morais,et al.  Machine learning classification methods in hyperspectral data processing for agricultural applications , 2018, ICGDA.

[15]  Jie Shen,et al.  Adaptive fuzzy c-means algorithm based on local noise detecting for image segmentation , 2016, IET Image Process..

[16]  Wenyu Zhang,et al.  Hyperspectral Identification and Classification of Oilseed Rape Waterlogging Stress Levels Using Parallel Computing , 2018, IEEE Access.

[17]  Kurumalla Suresh,et al.  Various Image Segmentation Algorithms: A Survey , 2018, Smart Intelligent Computing and Applications.

[18]  P. Sriramakrishnan,et al.  Rapid brain tissue segmentation process by modified FCM algorithm with CUDA enabled GPU machine , 2018, Int. J. Imaging Syst. Technol..

[19]  D. P. Acharjya,et al.  Clustering Algorithm in Possibilistic Exponential Fuzzy C-Mean Segmenting Medical Images , 2017 .

[20]  Chun-Yan Han,et al.  Improved SLIC imagine segmentation algorithm based on K-means , 2017, Cluster Computing.

[21]  Xin Wang,et al.  A fast image segmentation algorithm for detection of pseudo-foreign fibers in lint cotton , 2015, Comput. Electr. Eng..

[22]  Wei Dai,et al.  An Improved Hybrid Canopy-Fuzzy C-Means Clustering Algorithm Based on MapReduce Model , 2016, J. Comput. Sci. Eng..

[23]  Rajesh N. Davé,et al.  Validating fuzzy partitions obtained through c-shells clustering , 1996, Pattern Recognit. Lett..

[24]  Francisco Herrera,et al.  A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark , 2018, Swarm Evol. Comput..

[25]  Yuexiang Li,et al.  cC-GAN: A Robust Transfer-Learning Framework for HEp-2 Specimen Image Segmentation , 2018, IEEE Access.

[26]  Jyoti Arora,et al.  Fuzzy c-Means Clustering Strategies: A Review of Distance Measures , 2018, Advances in Intelligent Systems and Computing.

[27]  Bouchaib Cherradi,et al.  GPU fuzzy c-means algorithm implementations: performance analysis on medical image segmentation , 2017, Multimedia Tools and Applications.

[28]  Ahmed A. Nashat,et al.  New effective techniques for automatic detection and classification of external olive fruits defects based on image processing techniques , 2019, Multidimens. Syst. Signal Process..

[29]  Paulo Estevão Cruvinel,et al.  Big Data Infrastructure for Agricultural Tomographic Images Reconstruction , 2018, 2018 IEEE 12th International Conference on Semantic Computing (ICSC).

[30]  Deise Santana Maia,et al.  Face Detection and Recognition in Color Images under Matlab , 2016 .

[31]  Xu Zhang,et al.  Interactive Image Segmentation on Multiscale Appearances , 2018, IEEE Access.

[32]  Gonzalo Pajares,et al.  Discrete wavelets transform for improving greenness image segmentation in agricultural images , 2015, Comput. Electron. Agric..

[33]  Yambem Jina Chanu,et al.  Image Segmentation Using K -means Clustering Algorithm and Subtractive Clustering Algorithm , 2015 .

[34]  Cheng-Fan Li,et al.  A New Fuzzy Clustering Method With Neighborhood Distance Constraint for Volcanic Ash Cloud , 2016, IEEE Access.

[35]  Jiang Bian,et al.  Design and implementation of an automatic peach-harvesting robot system , 2018, 2018 Tenth International Conference on Advanced Computational Intelligence (ICACI).

[36]  Chang-Dong Wang,et al.  Multi-view Intact Space Clustering , 2017, 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR).

[37]  Seniha Esen Yuksel,et al.  Hyperspectral image segmentation using normalized cuts , 2016, 2016 24th Signal Processing and Communication Application Conference (SIU).

[38]  Ye Zhang,et al.  Large Scale Remote Sensing Image Segmentation Based on Fuzzy Region Competition and Gaussian Mixture Model , 2018, IEEE Access.

[39]  Sheng Liu,et al.  Towards Clinical Diagnosis: Automated Stroke Lesion Segmentation on Multi-Spectral MR Image Using Convolutional Neural Network , 2018, IEEE Access.

[40]  Dipak Kumar Basu,et al.  A modified fuzzy C‐means algorithm using scale control spatial information for MRI image segmentation in the presence of noise , 2015 .

[41]  Huarui Wu,et al.  Agricultural image target segmentation based on fuzzy set , 2015 .

[42]  Hatem S. Ali,et al.  Optimizing Oil Reduction in Fried Eggplant Rings , 2013 .

[43]  Zhang Aiqun,et al.  Nutrient deficiency diagnosis method for rape leaves using color histogram on HSV space , 2016 .

[44]  Mahmoud Al-Ayyoub,et al.  A GPU-based implementations of the fuzzy C-means algorithms for medical image segmentation , 2015, The Journal of Supercomputing.