A Novel Generative Adversarial Network Model Based on GC-MS Analysis for the Classification of Taif Rose

Rose oil production is believed to be dependent on only a few genotypes of the famous rose Rosa damascena. The aim of this study was to develop a novel GC-MS fingerprint based on the need to expand the genetic resources of oil-bearing rose for industrial cultivation in the Taif region (Saudi Arabia). Gas chromatography-mass spectrometry (GC-MS) is a widely used analytical technique for determining the volatile composition of distilled rose oil from flower data. Because biosample availability, prohibitive costs, and ethical concerns limit observations in agricultural research, we aimed to enhance the quality of analysis by combining real observations with samples generated in silico. This study proposes a novel artificial intelligence model based on generative adversarial neural networks (GANs) to classify Taif rose cultivars using raw GC-MS data. We employed a variant of the GAN known as conditional stacked GANs (cSGANs) to predict Taif rose’s oil content and other latent characteristics without the need to conduct laboratory tests. A hierarchical stack of conditional GANs is used in this algorithm to generate images. A cluster model was developed based on the dataset provided, to quantify the diversity that should be implemented in the proposed model. The networks were trained using the cross-entropy and minimax loss functions. The accuracy of the proposed model was assessed by measuring losses as a function of the number of epochs. The results prove the ability of the proposed model to perfectly generate new real samples of different classes based on the GC-MS fingerprint.

[1]  Ribana Roscher,et al.  Temporal Prediction and Evaluation of Brassica Growth in the Field using Conditional Generative Adversarial Networks , 2021, Comput. Electron. Agric..

[2]  Sean Peisert,et al.  Machine learning for metabolic engineering: A review. , 2020, Metabolic engineering.

[3]  Kwanjeera Wanichthanarak,et al.  Deep metabolome: Applications of deep learning in metabolomics , 2020, Computational and structural biotechnology journal.

[4]  Gorka Epelde,et al.  Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing , 2020, JMIR medical informatics.

[5]  Guan Gui,et al.  6G: Opening New Horizons for Integration of Comfort, Security, and Intelligence , 2020, IEEE Wireless Communications.

[6]  Zuobing Xiao,et al.  Characterization of key aroma compounds from different rose essential oils using gas chromatography-mass spectrometry, gas chromatography–olfactometry and partial least squares regression , 2018, Natural product research.

[7]  Fadhl M Alakwaa,et al.  Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data , 2017, bioRxiv.

[8]  Tom White,et al.  Generative Adversarial Networks: An Overview , 2017, IEEE Signal Processing Magazine.

[9]  Pietro Lió,et al.  Seeing the wood for the trees: a forest of methods for optimization and omic-network integration in metabolic modelling , 2017, Briefings Bioinform..

[10]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[11]  Lars K. Nielsen,et al.  Construction of feasible and accurate kinetic models of metabolism: A Bayesian approach , 2016, Scientific Reports.

[12]  Ney Lemke,et al.  Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review , 2016, Front. Physiol..

[13]  David J. Miller,et al.  ATD: Anomalous Topic Discovery in High Dimensional Discrete Data , 2015, IEEE Transactions on Knowledge and Data Engineering.

[14]  I. Atanassov,et al.  Flower phenotype variation, essential oil variation and genetic diversity among Rosa alba L. accessions used for rose oil production in Bulgaria , 2013 .

[15]  S. Lee,et al.  Metabolic network modeling and simulation for drug targeting and discovery , 2012, Biotechnology journal.

[16]  I. Atanassov,et al.  Comparative GC/MS Analysis of Rose Flower and Distilled Oil Volatiles of The Oil Bearing Rose Rosa Damascena , 2011 .

[17]  A. Atanassov,et al.  Rosa Damascena—Genetic Resources and Capacity Building for Molecular Breeding , 2009 .

[18]  B. Vosman,et al.  Microsatellite analysis of Rosa damascena Mill. accessions reveals genetic similarity between genotypes used for rose oil production and old Damask rose varieties , 2005, Theoretical and Applied Genetics.