Classification of bacterial nanowire proteins using Machine Learning and Feature Engineering model

Nanowires (NW) have been extensively studied for Shewanella spp. and Geobacter spp. and are mostly produced by Type IV pili or multiheme c-type cytochrome. Electron transfer via NW is the most studied mechanism in microbially induced corrosion, with recent interest in application in bioelectronics and biosensor. In this study, a machine learning (ML) based tool was developed to classify NW proteins. A manually curated 999 protein collection was developed as an NW protein dataset. Gene ontology analysis of the dataset revealed microbial NW is part of membranal proteins with metal ion binding motifs and plays a central role in electron transfer activity. Random Forest (RF), support vector machine (SVM), and extreme gradient boost (XGBoost) models were implemented in the prediction model and were observed to identify target proteins based on functional, structural, and physicochemical properties with 89.33%, 95.6%, and 99.99% accuracy. Dipetide amino acid composition, transition, and distribution protein features of NW are key important features aiding in the model’s high performance.

[1]  N. Malvankar,et al.  Structure of Geobacter cytochrome OmcZ identifies mechanism of nanowire assembly and conductivity , 2023, Nature Microbiology.

[2]  Cathy H. Wu,et al.  UniProt: the Universal Protein Knowledgebase in 2023 , 2022, Nucleic acids research.

[3]  D. Sanches,et al.  MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors , 2021, Briefings Bioinform..

[4]  Kelly P. Nevin,et al.  Extracellular Electron Exchange Capabilities of Desulfovibrio ferrophilus and Desulfopila corrodens. , 2021, Environmental science & technology.

[5]  I. Sazanovich,et al.  Nanosecond heme-to-heme electron transfer rates in a multiheme cytochrome nanowire reported by a spectrally unique His/Met-ligated heme , 2021, Proceedings of the National Academy of Sciences.

[6]  pan wang,et al.  A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites , 2021, Frontiers in Genetics.

[7]  Dawei Zhang,et al.  Adaptive bidirectional extracellular electron transfer during accelerated microbiologically influenced corrosion of stainless steel , 2021, Communications Materials.

[8]  Kaveh Kavousi,et al.  IAMPE: NMR-Assisted Computational Prediction of Antimicrobial Peptides , 2020, J. Chem. Inf. Model..

[9]  Qingyuan Li,et al.  Prediction of Anticancer Peptides Using a Low-Dimensional Feature Model , 2020, Frontiers in Bioengineering and Biotechnology.

[10]  D. Lovley,et al.  Geobacter Protein Nanowires , 2019, Front. Microbiol..

[11]  G. Reguera Harnessing the power of microbial nanowires , 2018, Microbial biotechnology.

[12]  Derek R Lovley,et al.  The electrically conductive pili of Geobacter species are a recently evolved feature for extracellular electron transfer. , 2016, Microbial genomics.

[13]  N. Burroughs,et al.  Regulation of Gene Expression in Shewanella oneidensis MR-1 during Electron Acceptor Limitation and Bacterial Nanowire Formation , 2016, Applied and Environmental Microbiology.

[14]  Vineet K. Sharma,et al.  Prediction of Biofilm Inhibiting Peptides: An In silico Approach , 2016, Front. Microbiol..

[15]  Rakesh Kumar,et al.  dPABBs: A Novel in silico Approach for Predicting and Designing Anti-biofilm Peptides , 2016, Scientific Reports.

[16]  Hye Suk Byun,et al.  Shewanella oneidensis MR-1 nanowires are outer membrane and periplasmic extensions of the extracellular electron transport components , 2014, Proceedings of the National Academy of Sciences.

[17]  Derek R Lovley,et al.  Microbial nanowires for bioenergy applications. , 2014, Current opinion in biotechnology.

[18]  Marcel J. T. Reinders,et al.  SPiCE: a web-based tool for sequence-based protein classification and exploration , 2014, BMC Bioinformatics.

[19]  Achuthsankar S. Nair,et al.  Composition, Transition and Distribution (CTD) — A dynamic feature for predictions based on hierarchical structure of cellular sorting , 2011, 2011 Annual IEEE India Conference.

[20]  D. Lovley,et al.  Purification and Characterization of OmcZ, an Outer-Surface, Octaheme c-Type Cytochrome Essential for Optimal Current Production by Geobacter sulfurreducens , 2010, Applied and Environmental Microbiology.

[21]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[22]  T. Mehta,et al.  Outer Membrane c-Type Cytochromes Required for Fe(III) and Mn(IV) Oxide Reduction in Geobacter sulfurreducens , 2005, Applied and Environmental Microbiology.

[23]  T. Mehta,et al.  Extracellular electron transfer via microbial nanowires , 2005, Nature.

[24]  K. Chou,et al.  Protein subcellular location prediction. , 1999, Protein engineering.