Hyperspectral band selection for soybean classification based on information measure in FRS theory

Soybeans and soy foods have attracted widespread attention due to their health benefits. Special varieties of soybeans are in demand from soybean processing enterprises. Because of the advantage of rapid measurement with minimal sample preparation, hyperspectral imaging technology is used for classifying soybean varieties. Based on fuzzy rough set (FRS) theory, the research of hyperspectral band selection can provide the foundation for variety classification. The performance of band selection with Gaussian membership functions and triangular membership functions under various parameters were explored. Appropriate ranges of parameters were determined by the number of bands and mutual information of subsets relative to the decision. The effectiveness of the proposed algorithms was validated through experiments on soybean hyperspectral datasets by building two classification methods, namely Extreme Learning Machine and Random Forest. Compared with ranking methods, the proposed algorithm provides a promising improvement in classification accuracy by selecting highly informative bands. To further reduce the size of the subset, a post-pruning design was used. For the Gaussian membership function, a subset containing eight bands achieved an average accuracy of 99.11% after post-pruning. As well as classification accuracy, we explored stability of band selection algorithm under small perturbations. The band selection algorithm of the Gaussian membership function was more stable than that of the triangular membership function. The results showed that the information measure (IM) based band selection algorithm is effective at obtaining satisfactory classification accuracy and providing stable results under perturbations.

[1]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[2]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[3]  Xiao Zhang,et al.  Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy , 2016, Pattern Recognit..

[4]  D. Jayas,et al.  Performance evaluation of a model for the classification of contaminants from wheat using near-infrared hyperspectral imaging , 2016 .

[5]  Melanie Hilario,et al.  Knowledge and Information Systems , 2007 .

[6]  Qiang Shen,et al.  Centre for Intelligent Systems and Their Applications Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Sets and Systems ( ) – Fuzzy–rough Attribute Reduction with Application to Web Categorization , 2022 .

[7]  Lei Liu,et al.  Ensemble gene selection by grouping for microarray data classification , 2010, J. Biomed. Informatics.

[8]  Colm P. O'Donnell,et al.  Hyperspectral imaging – an emerging process analytical tool for food quality and safety control , 2007 .

[9]  Qinghua Hu,et al.  Information-preserving hybrid data reduction based on fuzzy-rough techniques , 2006, Pattern Recognit. Lett..

[10]  Feifei Xu,et al.  Fuzzy-rough attribute reduction via mutual information with an application to cancer classification , 2009, Comput. Math. Appl..

[11]  Xizhao Wang,et al.  On the generalization of fuzzy rough sets , 2005, IEEE Transactions on Fuzzy Systems.

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  Zdzislaw Pawlak,et al.  Rough sets and intelligent data analysis , 2002, Inf. Sci..

[14]  Da-Wen Sun,et al.  Hyperspectral imaging as an effective tool for quality analysis and control of fish and other seafoods: Current research and potential applications , 2014 .

[15]  Jiye Liang,et al.  Fuzzy-rough feature selection accelerator , 2015, Fuzzy Sets Syst..

[16]  Jianbin Qiu,et al.  A novel approach to hyperspectral band selection based on spectral shape similarity analysis and fast branch and bound search , 2014, Eng. Appl. Artif. Intell..

[17]  Qinghua Hu,et al.  On Robust Fuzzy Rough Set Models , 2012, IEEE Transactions on Fuzzy Systems.

[18]  Qinghua Hu,et al.  Uncertainty measures for fuzzy relations and their applications , 2007, Appl. Soft Comput..

[19]  Wei-Zhi Wu,et al.  Constructive and axiomatic approaches of fuzzy approximation operators , 2004, Inf. Sci..

[20]  Kezhu Tan,et al.  Neighborhood mutual information and its application on hyperspectral band selection for classification , 2016 .

[21]  M. Messina Soy foods and soybean isoflavones and menopausal health. , 2002, Nutrition in clinical care : an official publication of Tufts University.

[22]  Dimitrios Moshou,et al.  Active learning system for weed species recognition based on hyperspectral sensing , 2016 .

[23]  Chongzhao Han,et al.  Complement information entropy for uncertainty measure in fuzzy rough set and its applications , 2015, Soft Comput..

[24]  P. Cunningham,et al.  Solutions to Instability Problems with Sequential Wrapper-based Approaches to Feature Selection , 2002 .

[25]  Zhen Xu,et al.  Hyperspectral band selection based on consistency-measure of neighborhood rough set theory , 2016 .

[26]  I. A. Lyashenko,et al.  Impact of an elastic sphere with an elastic half space revisited: Numerical analysis based on the method of dimensionality reduction , 2015, Scientific Reports.

[27]  Ashish Ghosh,et al.  Band elimination of hyperspectral imagery using partitioned band image correlation and capacitory discrimination , 2014 .

[28]  Maryam Imani,et al.  Band Clustering-Based Feature Extraction for Classification of Hyperspectral Images Using Limited Training Samples , 2014, IEEE Geoscience and Remote Sensing Letters.

[29]  Y. Ishimi Soybean isoflavones in bone health. , 2009, Forum of nutrition.

[30]  Sundaram Suresh,et al.  Performance enhancement of extreme learning machine for multi-category sparse data classification problems , 2010, Eng. Appl. Artif. Intell..

[31]  Kezhu Tan,et al.  Stability analysis of hyperspectral band selection algorithms based on neighborhood rough set theory for classification , 2017 .

[32]  Da-Wen Sun,et al.  Recent Applications of Spectroscopic and Hyperspectral Imaging Techniques with Chemometric Analysis for Rapid Inspection of Microbial Spoilage in Muscle Foods , 2015 .

[33]  Jiye Liang,et al.  Uncertainty Measures for Multigranulation Approximation Space , 2015, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[34]  Peijun Du,et al.  Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging , 2016, Neurocomputing.

[35]  Witold Pedrycz,et al.  Gaussian kernel based fuzzy rough sets: Model, uncertainty measures and applications , 2010, Int. J. Approx. Reason..

[36]  Francisco Argüello,et al.  Exploring ELM-based spatial–spectral classification of hyperspectral images , 2014 .