Application of machine learning in understanding plant virus pathogenesis: trends and perspectives on emergence, diagnosis, host-virus interplay and management

Inclusion of high throughput technologies in the field of biology has generated massive amounts of biological data in the recent years. Now, transforming these huge volumes of data into knowledge is the primary challenge in computational biology. The traditional methods of data analysis have failed to carry out the task. Hence, researchers are turning to machine learning based approaches for the analysis of high-dimensional big data. In machine learning, once a model is trained with a training dataset, it can be applied on a testing dataset which is independent. In current times, deep learning algorithms further promote the application of machine learning in several field of biology including plant virology. Considering a significant progress in the application of machine learning in understanding plant virology, this review highlights an introductory note on machine learning and comprehensively discusses the trends and prospects of machine learning in diagnosis of viral diseases, understanding host-virus interplay and emergence of plant viruses.

[1]  L. Galipienso,et al.  Detection of Plant Viruses and Disease Management: Relevance of Genetic Diversity and Evolution , 2020, Frontiers in Plant Science.

[2]  Sorin Draghici,et al.  Machine Learning and Its Applications to Biology , 2007, PLoS Comput. Biol..

[3]  Hitoshi Iyatomi,et al.  Basic Study of Automated Diagnosis of Viral Plant Diseases Using Convolutional Neural Networks , 2015, ISVC.

[4]  Ashutosh Kumar Singh,et al.  Deep Learning for Plant Stress Phenotyping: Trends and Future Perspectives. , 2018, Trends in plant science.

[5]  Rampi Ramprasad,et al.  Screening of Therapeutic Agents for COVID-19 Using Machine Learning and Ensemble Docking Studies , 2020, The journal of physical chemistry letters.

[6]  Bo Wang,et al.  Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities , 2018, Inf. Fusion.

[7]  Anne-Katrin Mahlein Plant Disease Detection by Imaging Sensors - Parallels and Specific Demands for Precision Agriculture and Plant Phenotyping. , 2016, Plant disease.

[8]  Y. Liu,et al.  Comparative transcriptome analysis in Triticum aestivum infecting wheat dwarf virus reveals the effects of viral infection on phytohormone and photosynthesis metabolism pathways , 2020, Phytopathology Research.

[9]  Yeşim Benal Öztekin,et al.  Performance Analysis of Deep Learning CNN Models for Variety Classification in Hazelnut , 2021, Sustainability.

[10]  Malay Kishore Dutta,et al.  VirLeafNet: Automatic analysis and viral disease diagnosis using deep-learning in Vigna mungo plant , 2020, Ecol. Informatics.

[11]  Xiao-Meng Zhang,et al.  Graph Neural Networks and Their Current Applications in Bioinformatics , 2021, Frontiers in Genetics.

[12]  Gurjit S. Randhawa,et al.  Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study , 2020, bioRxiv.

[13]  H. Garcia-Ruiz,et al.  Changes in Subcellular Localization of Host Proteins Induced by Plant Viruses , 2021, Viruses.

[14]  Sarah Webb Deep learning for biology , 2018, Nature.

[15]  Jesse Poland,et al.  Advances and Challenges in Genomic Selection for Disease Resistance. , 2016, Annual review of phytopathology.

[16]  J. García,et al.  How do plant viruses induce disease? Interactions and interference with host components. , 2011, The Journal of general virology.

[17]  David A. Landgrebe,et al.  Signal Theory Methods in Multispectral Remote Sensing , 2003 .

[18]  Y. Hu,et al.  Machine Learning Methods for Predicting Human-Adaptive Influenza A Viruses Based on Viral Nucleotide Compositions , 2019, Molecular biology and evolution.

[19]  Karin J. Metzner,et al.  V-pipe: a computational pipeline for assessing viral genetic diversity from high-throughput data , 2021, Bioinform..

[20]  G. de los Campos,et al.  Genomic Selection in Plant Breeding: Methods, Models, and Perspectives. , 2017, Trends in plant science.

[21]  Debmalya Barh,et al.  PlantOmics: The Omics of Plant Science , 2015, Springer India.

[22]  Ryuei Nishii,et al.  Statistical and Machine Learning Approaches to Predict Gene Regulatory Networks From Transcriptome Datasets , 2018, Front. Plant Sci..

[23]  E. Fontes,et al.  Plant immunity against viruses: antiviral immune receptors in focus , 2016, Annals of botany.

[24]  Dinesh Gupta,et al.  Supervised Learning Classification Models for Prediction of Plant Virus Encoded RNA Silencing Suppressors , 2014, PloS one.

[25]  Sandeep K. Kushwaha,et al.  NBSPred: a support vector machine-based high-throughput pipeline for plant resistance protein NBSLRR prediction , 2016, Bioinform..

[26]  Byoung-Tak Zhang,et al.  Supervised Learning Methods for MicroRNA Studies , 2008 .

[27]  Jason Weston,et al.  Semi-supervised Protein Classification Using Cluster Kernels , 2003, NIPS.

[28]  F. White Faculty Opinions recommendation of Receptor Kinases in Plant-Pathogen Interactions: More Than Pattern Recognition. , 2019, Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature.

[29]  Aboul Ella Hassanien,et al.  The prediction of virus mutation using neural networks and rough set techniques , 2016, EURASIP J. Bioinform. Syst. Biol..

[30]  Xiangfeng Wang,et al.  Machine learning for Big Data analytics in plants. , 2014, Trends in plant science.

[31]  Dan Li,et al.  Identification of Proteins of Tobacco Mosaic Virus by Using a Method of Feature Extraction , 2020, Frontiers in Genetics.

[32]  Leslie N. Smith,et al.  A disciplined approach to neural network hyper-parameters: Part 1 - learning rate, batch size, momentum, and weight decay , 2018, ArXiv.

[33]  Dong Xu,et al.  MU-LOC: A Machine-Learning Method for Predicting Mitochondrially Localized Proteins in Plants , 2018, Front. Plant Sci..

[34]  Martin Krzywinski,et al.  The curse(s) of dimensionality , 2018, Nature Methods.

[35]  Jennifer M. Taylor,et al.  LOCALIZER: subcellular localization prediction of both plant and effector proteins in the plant cell , 2016, Scientific Reports.

[36]  Andrew P French,et al.  Hyperspectral image analysis techniques for the detection and classification of the early onset of plant disease and stress , 2017, Plant Methods.

[37]  Jianguo Wu,et al.  Roles of Small RNAs in Virus-Plant Interactions , 2019, Viruses.

[38]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[39]  Anne E Carpenter,et al.  Opportunities and obstacles for deep learning in biology and medicine , 2017, bioRxiv.

[40]  R. V. D. van der Hoorn,et al.  Defended to the Nines: 25 Years of Resistance Gene Cloning Identifies Nine Mechanisms for R Protein Function[OPEN] , 2018, Plant Cell.

[41]  Konstantinos P. Ferentinos,et al.  Deep learning models for plant disease detection and diagnosis , 2018, Comput. Electron. Agric..

[42]  S. Jackson,et al.  Machine learning and complex biological data , 2019, Genome Biology.

[43]  C.A.L. Bailer-Jones,et al.  An introduction to artificial neural networks , 2001 .

[44]  S. Chakraborty,et al.  A geminivirus betasatellite damages the structural and functional integrity of chloroplasts leading to symptom formation and inhibition of photosynthesis , 2015, Journal of experimental botany.

[45]  Otávio J. B. Brustolini,et al.  Bioinformatics Analysis of the Receptor-Like Kinase (RLK) Superfamily. , 2017, Methods in molecular biology.

[46]  Yang Young Lu,et al.  VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data , 2017, Microbiome.

[47]  Ashutosh Kumar Singh,et al.  Machine Learning for High-Throughput Stress Phenotyping in Plants. , 2016, Trends in plant science.

[48]  M. Varjosalo,et al.  Nuclear proteome of virus-infected and healthy potato leaves , 2020, BMC Plant Biology.

[49]  Qiang Zhao,et al.  Transcriptome analysis of two cultivars of tobacco in response to Cucumber mosaic virus infection , 2019, Scientific Reports.

[50]  A. Nath,et al.  Probing an optimal class distribution for enhancing prediction and feature characterization of plant virus-encoded RNA-silencing suppressors , 2016, 3 Biotech.

[51]  S. Elena,et al.  Experimental evolution of plant RNA viruses , 2008, Heredity.

[52]  Yang Tao,et al.  Early Detection of Tomato Spotted Wilt Virus by Hyperspectral Imaging and Outlier Removal Auxiliary Classifier Generative Adversarial Nets (OR-AC-GAN) , 2019, Scientific Reports.

[53]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[54]  Thales F. M. Carvalho,et al.  Fangorn Forest (F2): a machine learning approach to classify genes and genera in the family Geminiviridae , 2017, BMC Bioinformatics.

[55]  Kazushi Murakoshi,et al.  Avoiding overfitting in multilayer perceptrons with feeling-of-knowing using self-organizing maps. , 2005, Bio Systems.

[56]  M. Roossinck,et al.  Plant virus metagenomics: what we know and why we need to know more , 2014, Front. Plant Sci..

[57]  A. Macho,et al.  Molecular dialogues between viruses and receptor‐like kinases in plants , 2019, Molecular plant pathology.

[58]  X. Yao Evolving Artificial Neural Networks , 1999 .

[59]  Jana Sperschneider,et al.  EffectorP: predicting fungal effector proteins from secretomes using machine learning. , 2016, The New phytologist.

[60]  S. Chakraborty,et al.  Impact of viral silencing suppressors on plant viral synergism: a global agro-economic concern , 2021, Applied Microbiology and Biotechnology.

[61]  Bhartendu Nath Mishra,et al.  Machine Learning Techniques in Plant Biology , 2015 .

[62]  Xiaofei Cheng,et al.  The Tug-of-War between Plants and Viruses: Great Progress and Many Remaining Questions , 2019, Viruses.

[63]  Y. Haviv,et al.  Comparative metabolomics and transcriptomics of plant response to Tomato yellow leaf curl virus infection in resistant and susceptible tomato cultivars , 2014, Metabolomics.

[64]  Zheng Rong Yang,et al.  A novel radial basis function neural network for discriminant analysis , 2006, IEEE Transactions on Neural Networks.

[65]  Oriol Vinyals,et al.  Highly accurate protein structure prediction with AlphaFold , 2021, Nature.

[66]  V. S. S. Prasad,et al.  Applications And Potentials Of Artificial Neural Networks In Plant Tissue Culture , 2008 .

[67]  S. Chakraborty,et al.  Complexity of begomovirus and betasatellite populations associated with chilli leaf curl disease in India. , 2015, The Journal of general virology.

[68]  Binhua Tang,et al.  Recent Advances of Deep Learning in Bioinformatics and Computational Biology , 2019, Front. Genet..

[69]  Wen Zhang,et al.  Deep Learning Application in Plant Stress Imaging: A Review , 2020, AgriEngineering.