Shape Oriented Feature Selection for Tomato Plant Identification

Selection of relevant features for classification from a high dimensional data set by keeping their class discriminatory information intact is a classical problem in Machine Learning. The classification power of the features can be measured from the point of view of redundant information and correlations among them. Choosing minimal set of features optimizes time, space complexity related cost and simplifies the classifier design, resulting in better classification accuracy. In this paper, tomato (Solanum Lycopersicum L) leaves and fruiting habits were chosen with a futuristic goal to build a prototype model of leaf & fruit classification. By applying digital image processing techniques, tomato leaf and fruit images were pre-processed and morphological shape based features were computed. Next, supervised filter and wrapper based feature selection techniques were adopted to choose the optimal feature set leading to small within-class variance and large among-class distance which may be of utter importance in building the model for recognition system of the tomato leaf and fruiting habit genre.

[1]  Luo Zhang,et al.  A method for feature selection based on the correlation analysis , 2012, Proceedings of 2012 International Conference on Measurement, Information and Control.

[2]  K. Deb,et al.  Dimensionality reduction of morphological features of tomato leaves and fruiting habits , 2012, 2012 International Conference on Communications, Devices and Intelligent Systems (CODIS).

[3]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[4]  Georgios Tsirigotis,et al.  Morphological feature selection and neural classification for electronic components , 2009 .

[5]  Craig Valli,et al.  A Wrapper-Based Feature Selection for Analysis of Large Data Sets , 2010 .

[6]  M. Ashraful Amin,et al.  Leaf shape identification based plant biometrics , 2010, 2010 13th International Conference on Computer and Information Technology (ICCIT).

[7]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[8]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[9]  P. Langley Selection of Relevant Features in Machine Learning , 1994 .

[10]  S. Maurya,et al.  A semi automatic plant identification based on digital leaf and flower images , 2012, IEEE-International Conference On Advances In Engineering, Science And Management (ICAESM -2012).

[11]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[12]  Slobodan Petrovic,et al.  Improving Effectiveness of Intrusion Detection by Correlation Feature Selection , 2010, 2010 International Conference on Availability, Reliability and Security.

[13]  L. N. Kanal,et al.  Handbook of Statistics, Vol. 2. Classification, Pattern Recognition and Reduction of Dimensionality. , 1985 .

[14]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..