Homology modeling of an RNP domain from a human RNA‐binding protein: Homology‐constrained energy optimization provides a criterion for distinguishing potential sequence alignments

We have recently described an automated approach for homology modeling using restrained molecular dynamics and simulated annealing procedures (Li et al, Protein Sci., 6:956–970,1997). We have employed this approach for constructing a homology model of the putative RNA‐binding domain of the human RNA‐binding protein with multiple splice sites (RBP‐MS). The regions of RBP‐MS which are homologous to the template protein snRNP U1A were constrained by “homology distance constraints,” while the conformation of the non‐homologous regions were defined only by a potential energy function. A full energy function without explicit solvent was employed to ensure that the calculated structures have good conformational energies and are physically reasonable. The effects of misalignment of the unknown and the template sequences were also explored in order to determine the feasibility of this homology modeling method for distinguishing possible sequence alignments based on considerations of the resulting conformational energies of modeled structures. Differences in the alignments of the unknown and the template sequences result in significant differences in the conformational energies of the calculated homology models. These results suggest that conformational energies and residual constraint violations in these homology‐constrained simulated annealing calculations can be used as criteria to distinguish between correct and incorrect sequence alignments and chain folds. Proteins 33:558–566, 1998. © 1998 Wiley‐Liss, Inc.

[1]  B. Kastner,et al.  Structure of spliceosomal snRNPs and their role in pre-mRNA splicing. , 1990, Biochimica et biophysica acta.

[2]  C. Epstein,et al.  A Review of its Symptomatology, Natural History, Pathologic Features, Genetics And Relationship to the Natural Aging Process , 1966 .

[3]  J. Hein Unified approach to alignment and phylogenies. , 1990, Methods in enzymology.

[4]  G. Varani,et al.  Solution structure of the N-terminal RNP domain of U1A protein: the role of C-terminal residues in structure stability and RNA binding. , 1996, Journal of molecular biology.

[5]  M. Wittekind,et al.  Interaction of the RNA‐binding domain of the hnRNP C proteins with RNA. , 1992, The EMBO journal.

[6]  Timothy F. Havel,et al.  A new method for building protein conformations from sequence alignments with homologues of known structure. , 1991, Journal of molecular biology.

[7]  T. Matsumoto,et al.  A unique human gene that spans over 230 kb in the human chromosome 8p11-12 and codes multiple family proteins sharing RNA-binding motifs. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[8]  G. Montelione,et al.  Solution NMR structure of the major cold shock protein (CspA) from Escherichia coli: identification of a binding epitope for DNA. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[9]  D. Drayna,et al.  Genetic linkage of Werner's syndrome to five markers on chromosome 8 , 1992, Nature.

[10]  F. Bosman,et al.  Loss of heterozygosity of chromosome 8 microsatellite loci implicates a candidate tumor suppressor gene between the loci D8S87 and D8S133 in human prostate cancer. , 1994, Cancer research.

[11]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[12]  H. Bellen,et al.  The Drosophila couch potato protein is expressed in nuclei of peripheral neuronal precursors and shows homology to RNA-binding proteins. , 1992, Genes & development.

[13]  G. Varani,et al.  NMR studies of U1 snRNA recognition by the N‐terminal RNP domain of the human U1A protein. , 1994, The EMBO journal.

[14]  Nobutoshi Ito,et al.  Crystal structure at 1.92 Å resolution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin , 1994, Nature.

[15]  G. Montelione,et al.  Simulated annealing with restrained molecular dynamics using CONGEN: Energy refinement of the NMR solution structures of epidermal and type‐α transforming growth factors , 1996, Protein science : a publication of the Protein Society.

[16]  Timothy F. Havel Predicting the Structure of the Flavodoxin from Eschericia coli by Homology Modeling, Distance Geometry and Molecular Dynamics , 1993 .

[17]  D. Scherly,et al.  Major determinants of the specificity of interaction between small nuclear ribonucleoproteins U1A and U2B" and their cognate RNAs , 1990, Nature.

[18]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[19]  P. Sillekens,et al.  cDNA cloning of the human U1 snRNA‐associated A protein: extensive homology between U1 and U2 snRNP‐specific proteins. , 1987, The EMBO journal.

[20]  S. Sudarsanam,et al.  An automated method for modeling proteins on known templates using distance geometry , 1993, Protein science : a publication of the Protein Society.

[21]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[22]  P. Evans,et al.  Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A , 1990, Nature.

[23]  A Sali,et al.  Modeling mutations and homologous proteins. , 1995, Current opinion in biotechnology.

[24]  G. Schellenberg,et al.  Positional Cloning of the Werner's Syndrome Gene , 1996, Science.

[25]  D. Scherly,et al.  Conserved amino acid residues within and outside of the N-terminal ribonucleoprotein motif of U1A small nuclear ribonucleoprotein involved in U1 RNA binding. , 1991, Journal of molecular biology.

[26]  G. Schellenberg,et al.  Homozygosity mapping and Werner's syndrome , 1992, The Lancet.

[27]  Y. Nakamura,et al.  Evidence for the presence of two tumor suppressor genes on chromosome 8p for colorectal carcinoma. , 1993, Cancer research.

[28]  T. Sasazuki,et al.  Family analysis of Werner's syndrome: A survey of 42 Japanese families with a review of the literature , 1981, Clinical genetics.

[29]  C. Sander,et al.  From genome sequences to protein function , 1994 .

[30]  G T Montelione,et al.  Homology modeling using simulated annealing of restrained molecular dynamics and conformational search calculations with CONGEN: Application in predicting the three‐dimensional structure of murine homeodomain Msx‐1 , 1997, Protein science : a publication of the Protein Society.

[31]  M. Inouye,et al.  Major cold shock protein of Escherichia coli. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Roberto Tejero,et al.  Simulated annealing with restrained molecular dynamics using a flexible restraint potential: Theory and evaluation with simulated NMR constraints , 1996, Protein science : a publication of the Protein Society.

[33]  R. Perham,et al.  Prediction of the three‐dimensional structures of the biotinylated domain from yeast pyruvate carboxylase and of the lipoylated H‐protein from the pea leaf glycine cleavage system: A new automated method for the prediction of protein tertiary structure , 1993, Protein science : a publication of the Protein Society.

[34]  M. Karplus,et al.  Conformational sampling using high‐temperature molecular dynamics , 1990, Biopolymers.