Identification of functional candidates amongst hypothetical proteins of Mycobacterium leprae Br4923, a causative agent of leprosy.

Mycobacterium leprae is an intracellular obligate parasite that causes leprosy in humans, and it leads to the destruction of peripheral nerves and skin deformation. Here, we report an extensive analysis of the hypothetical proteins (HPs) from M. leprae strain Br4923, assigning their functions to better understand the mechanism of pathogenesis and to search for potential therapeutic interventions. The genome of M. leprae encodes 1604 proteins, of which the functions of 632 are not known (HPs). In this paper, we predicted the probable functions of 312 HPs. First, we classified all HPs into families and subfamilies on the basis of sequence similarity, followed by domain assignment, which provides many clues for their possible function. However, the functions of 320 proteins were not predicted because of low sequence similarity with proteins of known function. Annotated HPs were categorized into enzymes, binding proteins, transporters, and proteins involved in cellular processes. We found several novel proteins whose functions were unknown for M. leprae. These proteins have a requisite association with bacterial virulence and pathogenicity. Finally, our sequence-based analysis will be helpful for further validation and the search for potential drug targets while developing effective drugs to cure leprosy.

[1]  Daniel Lim,et al.  Structural Insight into the Transglycosylation Step of Bacterial Cell-Wall Biosynthesis , 2007, Science.

[2]  Peer Bork,et al.  SMART 7: recent updates to the protein domain annotation resource , 2011, Nucleic Acids Res..

[3]  Nathan Linial,et al.  ProtoNet: charting the expanding universe of protein sequences , 2013, Nature Biotechnology.

[4]  David A. Lee,et al.  Predicting protein function from sequence and structure , 2007, Nature Reviews Molecular Cell Biology.

[5]  Marco Punta,et al.  The Rough Guide to In Silico Function Prediction, or How To Use Sequence and Structure Information To Predict Protein Function , 2008, PLoS Comput. Biol..

[6]  S. Brunak,et al.  SignalP 4.0: discriminating signal peptides from transmembrane regions , 2011, Nature Methods.

[7]  B. Barrell,et al.  The decaying genome of Mycobacterium leprae. , 2001, Leprosy review.

[8]  N. Ratcliffe,et al.  Mycobacterium microti may protect itself from intracellular destruction by releasing cyclic AMP into phagosomes , 1975, Nature.

[9]  J. Rougemont,et al.  Comparative genomic and phylogeographic analysis of Mycobacterium leprae , 2009, Nature Genetics.

[10]  Md. Imtaiyaz Hassan,et al.  Discovering a potent small molecule inhibitor for gankyrin using de novo drug design approach , 2011, Int. J. Comput. Biol. Drug Des..

[11]  Zongchao Jia,et al.  Piecing together the structure–function puzzle: Experiences in structure‐based functional annotation of hypothetical proteins , 2007, Proteomics.

[12]  F. Ahmad,et al.  Functional Annotation of Conserved Hypothetical Proteins from Haemophilus influenzae Rd KW20 , 2013, PloS one.

[13]  Shaneen Singh,et al.  Inactivation of tesA Reduces Cell Wall Lipid Production and Increases Drug Susceptibility in Mycobacteria* , 2011, The Journal of Biological Chemistry.

[14]  Marco Punta,et al.  Beyond annotation transfer by homology: novel protein-function prediction methods to assist drug discovery. , 2005, Drug discovery today.

[15]  Narmada Thanki,et al.  CDD: conserved domains and protein three-dimensional structure , 2012, Nucleic Acids Res..

[16]  István Simon,et al.  The HMMTOP transmembrane topology prediction server , 2001, Bioinform..

[17]  Tige R. Rustad,et al.  Genes Required for Intrinsic Multidrug Resistance in Mycobacterium avium , 2004, Antimicrobial Agents and Chemotherapy.

[18]  Dinesh Gupta,et al.  VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens , 2008, BMC Bioinformatics.

[19]  F. Ahmad,et al.  Functional annotation of putative hypothetical proteins from Candida dubliniensis. , 2014, Gene.

[20]  Xiangshu Jin,et al.  The structure and mechanism of myo-inositol-1-phosphate synthase. , 2006, Sub-cellular biochemistry.

[21]  Jenn-Kang Hwang,et al.  Prediction of protein subcellular localization , 2006, Proteins.

[22]  A. Danchin,et al.  Adenylyl cyclases: a heterogeneous class of ATP-utilizing enzymes. , 1994, Progress in nucleic acid research and molecular biology.

[23]  J. Whisstock,et al.  Prediction of protein function from protein sequence and structure , 2003, Quarterly Reviews of Biophysics.

[24]  Gajendra P. S. Raghava,et al.  PSLpred: prediction of subcellular localization of bacterial proteins , 2005, Bioinform..

[25]  C. Baron,et al.  Targeting bacterial secretion systems: benefits of disarmament in the microcosm. , 2007, Infectious disorders drug targets.

[26]  Asimul Islam,et al.  The role of key residues in structure, function, and stability of cytochrome-c , 2013, Cellular and Molecular Life Sciences.

[27]  Guoqing Wang,et al.  Genome-wide screening of pathogenicity islands in Mycobacterium tuberculosis based on the genomic barcode visualization , 2014, Molecular Biology Reports.

[28]  Umashankar Vetrivel,et al.  A novel in silico approach to identify potential therapeutic targets in human bacterial pathogens , 2011, The HUGO Journal.

[29]  P. Brennan,et al.  Deciphering the proteomic profile of Mycobacterium leprae cell envelope , 2008, Proteomics.

[30]  Tsutomu Suzuki,et al.  Discovery and characterization of tRNAIle lysidine synthetase (TilS) , 2010, FEBS letters.

[31]  M. I. Hassan,et al.  Structure‐guided design of peptidic ligand for human prostate specific antigen , 2007, Journal of peptide science : an official publication of the European Peptide Society.

[32]  F. Ahmad,et al.  Progastriscin: structure, function, and its role in tumor progression. , 2010, Journal of molecular cell biology.

[33]  Raquel Tobes,et al.  The TetR Family of Transcriptional Repressors , 2005, Microbiology and Molecular Biology Reviews.

[34]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[35]  L. Rasmussen,et al.  Genome-wide screens for expressed hypothetical proteins. , 2012, Methods in molecular biology.

[36]  R. Rees Airborne infection with Mycobacterium leprae in mice. , 1977, International journal of leprosy and other mycobacterial diseases : official organ of the International Leprosy Association.

[37]  J Schultz,et al.  SMART, a simple modular architecture research tool: identification of signaling domains. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Nathan Linial,et al.  ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree , 2011, Nucleic Acids Res..

[39]  Jacquelyn S. Fetrow,et al.  Structural genomics and its importance for gene function analysis , 2000, Nature Biotechnology.

[40]  M. C. Pessolani,et al.  Comparative genomics of mycobacterial proteases. , 2007, Microbial pathogenesis.

[41]  C. Chothia,et al.  Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. , 2001, Journal of molecular biology.

[42]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[43]  N. Grishin,et al.  BTLCP proteins: a novel family of bacterial transglutaminase-like cysteine proteinases. , 2004, Trends in biochemical sciences.

[44]  A. Munro,et al.  Mycobacterium tuberculosis cytochrome P450 enzymes: a cohort of novel TB drug targets. , 2012, Biochemical Society transactions.

[45]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[46]  N. Chim,et al.  Structural and biochemical characterization of the essential DsbA-like disulfide bond forming protein from Mycobacterium tuberculosis , 2013, BMC Structural Biology.

[47]  N. Verma,et al.  Functional analysis of the O antigen glucosylation gene cluster of Shigella flexneri bacteriophage SfX. , 1999, Microbiology.

[48]  Martin Ester,et al.  PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes , 2010, Bioinform..

[49]  Søren Brunak,et al.  Non-classical protein secretion in bacteria , 2005, BMC Microbiology.

[50]  B. Barrell,et al.  Massive gene decay in the leprosy , 2001 .

[51]  M. Pallen The ESAT-6/WXG100 superfamily -- and a new Gram-positive secretion system? , 2002, Trends in microbiology.

[52]  V. Brahmachari,et al.  Analysis of Expression Profile of Mammalian Cell Entry (mce) Operons of Mycobacterium tuberculosis , 2003, Infection and Immunity.

[53]  W. Sly,et al.  Structure, function and applications of carbonic anhydrase isozymes. , 2013, Bioorganic & medicinal chemistry.

[54]  B. Barrell,et al.  Massive gene decay in the leprosy bacillus , 2001, Nature.

[55]  Jingchu Luo,et al.  Secreted protein prediction system combining CJ-SPHMM, TMHMM, and PSORT , 2003, Mammalian Genome.

[56]  G. Besra,et al.  Identification of a Novel Arabinofuranosyltransferase (AftA) Involved in Cell Wall Arabinan Biosynthesis in Mycobacterium tuberculosis* , 2006, Journal of Biological Chemistry.

[57]  K Falls,et al.  Multiplex sequencing of 1.5 Mb of the Mycobacterium leprae genome. , 1997, Genome research.

[58]  M. Brent Steady progress and recent breakthroughs in the accuracy of automated genome annotation , 2008, Nature Reviews Genetics.

[59]  M. I. Hassan,et al.  Structural Model of Human PSA: A Target for Prostate Cancer Therapy , 2007, Chemical biology & drug design.

[60]  A. Munro,et al.  Cytochrome P450--redox partner fusion enzymes. , 2007, Biochimica et biophysica acta.

[61]  X. Chen,et al.  SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence , 2003, Nucleic Acids Res..

[62]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[63]  Anushya Muruganujan,et al.  PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees , 2012, Nucleic Acids Res..

[64]  Jianping Xie,et al.  The underling mechanism of bacterial TetR/AcrR family transcriptional repressors. , 2013, Cellular signalling.

[65]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[66]  Peer Bork,et al.  Functional clues for hypothetical proteins based on genomic context analysis in prokaryotes. , 2004, Nucleic acids research.

[67]  Asimul Islam,et al.  Structure-based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for potential drug targets , 2014, 3 Biotech.

[68]  Tim J. P. Hubbard,et al.  SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..

[69]  Diana J L Williams,et al.  The Continuing Challenges of Leprosy , 2006, Clinical Microbiology Reviews.

[70]  F. Ahmad,et al.  Structural Characterization, Homology Modeling and Docking Studies of ARG674 Mutation in MyH8 Gene Associated with Trismus-Pseudocamptodactyly Syndrome , 2014 .

[71]  J. Fetrow,et al.  Sequence- and structure-based protein function prediction from genomic information. , 2001, Current opinion in drug discovery & development.

[72]  D. T. Jones,et al.  The sequence-structure relationship and protein function prediction. , 2009, Current opinion in structural biology.

[73]  Lewis Y. Geer,et al.  CDART: protein homology by domain architecture. , 2002, Genome research.

[74]  Jonathan Bath,et al.  DNA transport in bacteria , 2001, Nature Reviews Molecular Cell Biology.

[75]  M. Reed,et al.  Contribution of the Mycobacterium tuberculosis MmpL Protein Family to Virulence and Drug Resistance , 2005, Infection and Immunity.

[76]  Ron D. Appel,et al.  ExPASy: the proteomics server for in-depth protein knowledge and analysis , 2003, Nucleic Acids Res..

[77]  W. Donachie,et al.  A new Escherichia coli cell division gene, ftsK , 1995, Journal of bacteriology.

[78]  G. Schultz,et al.  Guanylyl cyclases, a growing family of signal‐transducing enzymes , 1991, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[79]  A. Dove Predicting protein function , 1999, Nature Biotechnology.

[80]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[81]  F. Ahmad,et al.  Structure-based function analysis of putative conserved proteins with isomerase activity from Haemophilus influenzae , 2014, 3 Biotech.

[82]  F. Ahmad,et al.  Structure-based functional annotation of putative conserved proteins having lyase activity from Haemophilus influenzae , 2014, 3 Biotech.

[83]  L. Irgens [The discovery of the leprosy bacillus]. , 2002, Tidsskrift for den Norske laegeforening : tidsskrift for praktisk medicin, ny raekke.

[84]  David A. Lee,et al.  New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures , 2012, Nucleic Acids Res..