Gender Identification and Classification of Drosophila melanogaster Flies Using Machine Learning Techniques

Drosophila melanogaster is an important genetic model organism used extensively in medical and biological studies. About 61% of known human genes have a recognizable match with the genetic code of Drosophila flies, and 50% of fly protein sequences have mammalian analogues. Recently, several investigations have been conducted in Drosophila to study the functions of specific genes exist in the central nervous system, heart, liver, and kidney. The outcomes of the research in Drosophila are also used as a unique tool to study human-related diseases. This article presents a novel automated system to classify the gender of Drosophila flies obtained through microscopic images (ventral view). The proposed system takes an image as input and converts it into grayscale illustration to extract the texture features from the image. Then, machine learning (ML) classifiers such as support vector machines (SVM), Naive Bayes (NB), and K-nearest neighbour (KNN) are used to classify the Drosophila as male or female. The proposed model is evaluated using the real microscopic image dataset, and the results show that the accuracy of the KNN is 90%, which is higher than the accuracy of the SVM classifier.

[1]  M. Shakarad,et al.  Gender based disruptive selection maintains body size polymorphism in Drosophila melanogaster , 2014, Journal of Biosciences.

[2]  I. Muchnik,et al.  Support Vector Machines for Classification , 2015 .

[3]  M. S. Sudhakar,et al.  A new 2D shape retrieval scheme based on phase congruency and histogram of oriented gradients , 2019, Signal Image Video Process..

[4]  Shuo Xu,et al.  Bayesian Naïve Bayes classifiers to text classification , 2018, J. Inf. Sci..

[5]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[6]  Vladimir Naumovich Vapni The Nature of Statistical Learning Theory , 1995 .

[7]  G. Vecchio A fruit fly in the nanoworld: once again Drosophila contributes to environment and human health , 2015, Nanotoxicology.

[8]  Xiaofeng Zhu,et al.  Efficient kNN Classification With Different Numbers of Nearest Neighbors , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Shichao Zhang,et al.  kNN Algorithm with Data-Driven k Value , 2014, ADMA.

[10]  David Bilder,et al.  Taking Stock of the Drosophila Research Ecosystem , 2017, Genetics.

[11]  Yongli Zhang,et al.  Support Vector Machine Classification Algorithm and Its Application , 2012, ICICA.

[12]  Francisco Gerardo Medeiros Neto,et al.  Drosophila Melanogaster Gender Classification Based on Fractal Dimension , 2017, 2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI).

[13]  N. Gorantla,et al.  Photoexcited Toluidine Blue Inhibits Tau Aggregation in Alzheimer’s Disease , 2019, ACS omega.

[14]  Kaushik Roy,et al.  Fly Wing Biometrics Using Modified Local Binary Pattern, SVMs and Random Forest , 2014 .

[15]  Nourhan Zayed,et al.  Statistical Analysis of Haralick Texture Features to Discriminate Lung Abnormalities , 2015, Int. J. Biomed. Imaging.

[16]  Alexandros Iosifidis,et al.  Multi-class Support Vector Machine classifiers using intrinsic and penalty graphs , 2016, Pattern Recognit..

[17]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[18]  Carlo Gatta,et al.  Unsupervised Deep Feature Extraction for Remote Sensing Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Golshah Naghdy,et al.  Unsupervised Classification of Images: A Review , 2014 .

[20]  R. Bhaskaran,et al.  Supervised Classification Performance of Multispectral Images , 2010, ArXiv.

[21]  Hugo J. Bellen,et al.  COLLECTION : TRANSLATIONAL IMPACT OF DROSOPHILA Drosophila tools and assays for the study of human diseases , 2016 .

[22]  Thomas Roeder,et al.  Drosophila in asthma research. , 2009, American journal of respiratory and critical care medicine.