Image processing techniques represent innovative tools for comparative analysis of proteins

Different bioinformatic and data-mining approaches have been used for the analysis of proteins. Here, we describe a novel, robust, and reliable approach for comparative analysis of a large number of proteins by combining Image Processing Techniques and Convolutional Deep Neural Network (IPT-CNN). As proof of principle, we used IPT-CNN to predict different subtypes of Influenza A virus (IAV). Over 8000 sequences of surface proteins haemagglutinin (HA) and neuraminidase (NA) from different IAV subtypes were used to create polynomial or binary vector datasets. The datasets were then converted into binary images. Analysis of these images enabled the classification of IAV subtypes with 100% accuracy and, compared to non-image-based approaches, within a shorter time frame. The proteome-based IPT-CNN approach described here may be used for analysis and proteome-based classification of other proteins.

[1]  Tomohiro Kuroda,et al.  Computer-aided diagnosis of lung nodule classification between benign nodule, primary lung cancer, and metastatic lung cancer at different image size using deep convolutional neural network with transfer learning , 2018, PloS one.

[2]  Ben Glocker,et al.  Automated cardiovascular magnetic resonance image analysis with fully convolutional networks , 2017, Journal of Cardiovascular Magnetic Resonance.

[3]  B. Frey,et al.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning , 2015, Nature Biotechnology.

[4]  E. Ebrahimie,et al.  Interaction between Bovine leukemia virus (BLV) infection and age on telomerase misregulation , 2015, Veterinary Research Communications.

[5]  István Csabai,et al.  Detecting and classifying lesions in mammograms with Deep Learning , 2017, Scientific Reports.

[6]  Feng Liu,et al.  Deep Learning and Its Applications in Biomedicine , 2018, Genom. Proteom. Bioinform..

[7]  A. M. Clark,et al.  Functional Evolution of Influenza Virus NS1 Protein in Currently Circulating Human 2009 Pandemic H1N1 Viruses , 2017, Journal of Virology.

[8]  Samuel S. Shepard,et al.  LABEL: Fast and Accurate Lineage Assignment with Assessment of H5N1 and H9N2 Influenza A Hemagglutinins , 2014, PloS one.

[9]  S. Van der Auwera,et al.  ClassyFlu: Classification of Influenza A Viruses with Discriminatively Trained Profile-HMMs , 2014, PloS one.

[10]  Amir Hossein KayvanJoo,et al.  Unravelling evolution of Nanog, the key transcription factor involved in self-renewal of undifferentiated embryonic stem cells, by pattern recognition in nucleotide and tandem repeats characteristics. , 2016, Gene.

[11]  David L. Adelson,et al.  Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein , 2014, PloS one.

[12]  Tien Dat Nguyen,et al.  Combining Deep and Handcrafted Image Features for Presentation Attack Detection in Face Recognition Systems Using Visible-Light Camera Sensors , 2018, Sensors.

[13]  SchmidhuberJürgen Deep learning in neural networks , 2015 .

[14]  M. Ebrahimi,et al.  Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology. , 2014, Journal of theoretical biology.

[15]  Sujin Pyo,et al.  Predictability of machine learning techniques to forecast the trends of market index prices: Hypothesis testing for the Korean stock markets , 2017, PloS one.

[16]  Claudia Mello-Thoms,et al.  Modeling visual search behavior of breast radiologists using a deep convolution neural network , 2018, Journal of medical imaging.

[17]  Xiaomei Ma,et al.  Lung Nodule Detection via Deep Reinforcement Learning , 2018, Front. Oncol..

[18]  O. Stegle,et al.  Deep learning for computational biology , 2016, Molecular systems biology.

[19]  Yuemin Bian,et al.  Deep Learning for Drug Design: an Artificial Intelligence Paradigm for Drug Discovery in the Big Data Era , 2018, The AAPS Journal.

[20]  Nilanjan Dey,et al.  A Survey of Data Mining and Deep Learning in Bioinformatics , 2018, Journal of Medical Systems.

[21]  Swami Sankaranarayanan,et al.  Face recognition accuracy of forensic examiners, superrecognizers, and face recognition algorithms , 2018, Proceedings of the National Academy of Sciences.

[22]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[23]  Yanjun Qi,et al.  Deep Motif: Visualizing Genomic Sequence Classifications , 2016, ArXiv.

[24]  David K. Gifford,et al.  Convolutional neural network architectures for predicting DNA–protein binding , 2016, Bioinform..

[25]  Kengo Kinoshita,et al.  De novo profile generation based on sequence context specificity with the long short-term memory network , 2018, BMC Bioinformatics.

[26]  Petras J. Kundrotas,et al.  Natural language processing in text mining for structural modeling of protein complexes , 2018, BMC Bioinformatics.

[27]  B. Schepens,et al.  Vaccine options for influenza: thinking small. , 2018, Current opinion in immunology.

[28]  Tania Stathaki,et al.  Recent Developments in Deep Learning for Engineering Applications , 2018, Comput. Intell. Neurosci..

[29]  Shandong Wu,et al.  Breast Cancer Molecular Subtype Prediction by Mammographic Radiomic Features. , 2019, Academic radiology.

[30]  Shipra Banik,et al.  Hybrid Machine Learning Technique for Forecasting Dhaka Stock Market Timing Decisions , 2014, Comput. Intell. Neurosci..

[31]  J Rodellar,et al.  Image processing and machine learning in the morphological analysis of blood cells , 2018, International journal of laboratory hematology.

[32]  Jasjit S. Suri,et al.  Deep learning strategy for accurate carotid intima-media thickness measurement: An ultrasound study on Japanese diabetic cohort , 2018, Comput. Biol. Medicine.