Integrative analysis workflow for the structural and functional classification of C-type lectins

BackgroundIt is important to understand the roles of C-type lectins in the immune system due to their ubiquity and diverse range of functions in animal cells. It has been observed that currently confirmed C-type lectins share a highly conserved domain known as the C-type carbohydrate recognition domain (CRD). Using the sequence profile of the CRD, an increasing number of putative C-type lectins have been identified. Hence, it is highly needed to develop a systematic framework that enables us to elucidate their carbohydrate (glycan) recognition function, and discover their physiological and pathological roles.ResultsPresented herein is an integrated workflow for characterizing the sequence and structural features of novel C-type lectins. Our workflow utilizes web-based queries and available software suites to annotate features that can be found on the C-type lectin, given its amino acid sequence. At the same time, it incorporates modeling and analysis of glycans - a major class of ligands that interact with C-type lectins. Thereafter, the results are analyzed together with context-specific knowledge to filter off unlikely predictions. This allows researchers to design their subsequent experiments to confirm the functions of the C-type lectins in a systematic manner.ConclusionsThe efficacy and usefulness of our proposed immunoinformatics workflow was demonstrated by applying our integrated workflow to a novel C-type lectin -CLEC17A - and we report some of its possible functions that warrants further validation through wet-lab experiments.

[1]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[2]  C. Venkatachalam,et al.  LigScore: a novel scoring function for predicting binding affinities. , 2005, Journal of molecular graphics & modelling.

[3]  David A. Lee,et al.  Predicting protein function from sequence and structure , 2007, Nature Reviews Molecular Cell Biology.

[4]  Thierry Langer,et al.  Virtual screening for the discovery of bioactive natural products , 2008, Progress in drug research. Fortschritte der Arzneimittelforschung. Progres des recherches pharmaceutiques.

[5]  C. Venkatachalam,et al.  LigandFit: a novel method for the shape-directed rapid docking of ligands to protein active sites. , 2003, Journal of molecular graphics & modelling.

[6]  Maureen E. Taylor,et al.  Glycan arrays for functional glycomics , 2002, Genome Biology.

[7]  T. Soderling,et al.  A structural basis for substrate specificities of protein Ser/Thr kinases: primary sequence preference of casein kinases I and II, NIMA, phosphorylase kinase, calmodulin-dependent kinase II, CDK5, and Erk1 , 1996, Molecular and cellular biology.

[8]  Maureen E. Taylor,et al.  Structural insights into what glycan arrays tell us about how glycan-binding proteins interact with their ligands , 2009, Glycobiology.

[9]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[10]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[11]  Richard M. Jackson,et al.  An evaluation of automated homology modelling methods at low target-template sequence similarity , 2007, Bioinform..

[12]  Mark S. Johnson,et al.  Generating Conformer Ensembles Using a Multiobjective Genetic Algorithm , 2007, J. Chem. Inf. Model..

[13]  S. Brunak,et al.  Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites. , 2005, Glycobiology.

[14]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[15]  T Kita,et al.  Biosynthesis and Post-translational Processing of Lectin-like Oxidized Low Density Lipoprotein Receptor-1 (LOX-1) , 2000, The Journal of Biological Chemistry.

[16]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[17]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[18]  Jaques Reifman,et al.  PSPP: A Protein Structure Prediction Pipeline for Computing Clusters , 2009, PloS one.

[19]  Y. Kooyk,et al.  Protein-glycan interactions in the control of innate and adaptive immune responses , 2008, Nature Immunology.

[20]  Jacob P. Brady,et al.  Prolectin, a Glycan-binding Receptor on Dividing B Cells in Germinal Centers , 2009, The Journal of Biological Chemistry.

[21]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[22]  D J Diller,et al.  High throughput docking for library design and library prioritization , 2001, Proteins.

[23]  Peer Bork,et al.  SMART 6: recent updates and new developments , 2008, Nucleic Acids Res..

[24]  D. Underhill,et al.  The many faces of ITAMs. , 2007, Trends in immunology.

[25]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[26]  W. Weis,et al.  The C‐type lectin superfamily in the immune system , 1998, Immunological reviews.

[27]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[28]  Anne Imberty,et al.  Molecular Basis of the Differences in Binding Properties of the Highly Related C-type Lectins DC-SIGN and L-SIGN to Lewis X Trisaccharide and Schistosoma mansoni Egg Antigens* , 2004, Journal of Biological Chemistry.

[29]  R. Rudolph,et al.  Artificial, non-antibody binding proteins for pharmaceutical and industrial applications. , 2005, Trends in biotechnology.

[30]  Carl G. Figdor,et al.  Dendritic-cell immunotherapy: from ex vivo loading to in vivo targeting , 2007, Nature Reviews Immunology.

[31]  J. Gready,et al.  The C‐type lectin‐like domain superfamily , 2005, The FEBS journal.

[32]  S. Gringhuis,et al.  Signalling through C-type lectin receptors: shaping immune responses , 2009, Nature Reviews Immunology.

[33]  E. Kieff,et al.  The structural basis for the recognition of diverse receptor sequences by TRAF2. , 1999, Molecular cell.

[34]  Christian J. A. Sigrist,et al.  Nucleic Acids Research Advance Access published November 14, 2007 The 20 years of PROSITE , 2007 .

[35]  Jing-Fei Huang,et al.  Discovery Studio 2.0的模块扩展及应用:残基水平非键相互作用能量的自动批量计算 , 2011 .

[36]  Torsten Schwede,et al.  BIOINFORMATICS Bioinformatics Advance Access published November 12, 2005 The SWISS-MODEL Workspace: A web-based environment for protein structure homology modelling , 2022 .

[37]  Russ B Altman,et al.  Improving structure-based function prediction using molecular dynamics. , 2009, Structure.

[38]  Wonjun Park,et al.  An alpha-numeric code for representing N-linked glycan structures in secreted glycoproteins , 2009, Bioprocess and biosystems engineering.

[39]  Y. van Kooyk,et al.  Recognition of tumor glycans by antigen-presenting cells. , 2006, Current opinion in immunology.

[40]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[41]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[42]  Gennady M Verkhivker,et al.  Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. , 1995, Chemistry & biology.

[43]  Ajay N. Jain Scoring noncovalent protein-ligand interactions: A continuous differentiable function tuned to compute binding affinities , 1996, J. Comput. Aided Mol. Des..

[44]  Ian W. Davis,et al.  Structure validation by Cα geometry: ϕ,ψ and Cβ deviation , 2003, Proteins.

[45]  Tibor Kozár,et al.  Glycan and lectin microarrays for glycomics and medicinal applications , 2010, Medicinal research reviews.