Factors correlating with significant differences between X-ray structures of myoglobin.

Validation of general ideas about the origins of conformational differences in proteins is critical in order to arrive at meaningful functional insights. Here, principal component analysis (PCA) and distance difference matrices are used to validate some such ideas about the conformational differences between 291 myoglobin structures from sperm whale, horse and pig. Almost all of the horse and pig structures form compact PCA clusters with only minor coordinate differences and outliers that are easily explained. The 222 whale structures form a few dense clusters with multiple outliers. A few whale outliers with a prominent distortion of the GH loop are very similar to the cluster of horse structures, which all have a similar GH-loop distortion apparently owing to intermolecular crystal lattice hydrogen bonds to the GH loop from residues near the distal histidine His64. The variations of the GH-loop coordinates in the whale structures are likely to be owing to the observed alternative intermolecular crystal lattice bond, with the change to the GH loop distorting bonds correlated with the binding of specific `unusual' ligands. Such an alternative intermolecular bond is not observed in horse myoglobins, obliterating any correlation with the ligands. Intermolecular bonds do not usually cause significant coordinate differences and cannot be validated as their universal cause. Most of the native-like whale myoglobin structure outliers can be correlated with a few specific factors. However, these factors do not always lead to coordinate differences beyond the previously determined uncertainty thresholds. The binding of unusual ligands by myoglobin, leading to crystal-induced distortions, suggests that some of the conformational differences between the apo and holo structures might not be `functionally important' but rather artifacts caused by the binding of `unusual' substrate analogs. The causes of P6 symmetry in myoglobin crystals and the relationship between crystal and solution structures are also discussed.

[1]  Gerard J. Kleywegt,et al.  On vital aid: the why, what and how of validation , 2009, Acta crystallographica. Section D, Biological crystallography.

[2]  R. Nussinov,et al.  Induced Fit, Conformational Selection and Independent Dynamic Segments: an Extended View of Binding Events Opinion , 2022 .

[3]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[4]  Mark Gerstein,et al.  Tools and databases to analyze protein flexibility; approaches to mapping implied features onto sequences. , 2003, Methods in enzymology.

[5]  Q. Gibson,et al.  Mapping the Pathways for O2 Entry Into and Exit from Myoglobin* , 2001, The Journal of Biological Chemistry.

[6]  Nathaniel Echols,et al.  Accessing protein conformational ensembles using room-temperature X-ray crystallography , 2011, Proceedings of the National Academy of Sciences.

[7]  A. Rashin,et al.  Diversity of function-related conformational changes in proteins: coordinate uncertainty, fragment rigidity, and stability. , 2010, Biochemistry.

[8]  M. DePristo,et al.  Heterogeneity and inaccuracy in protein structures solved by X-ray crystallography. , 2004, Structure.

[9]  T J Oldfield,et al.  Determination of the crystal structure of recombinant pig myoglobin by molecular replacement and its refinement. , 1990, Acta crystallographica. Section B, Structural science.

[10]  Evgeny B. Krissinel,et al.  Crystal contacts as nature's docking solutions , 2010, J. Comput. Chem..

[11]  G. Phillips,et al.  Comparison of the dynamics of myoglobin in different crystal forms. , 1990, Biophysical journal.

[12]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[13]  Rodrigo Lopez,et al.  A new bioinformatics analysis tools framework at EMBL–EBI , 2010, Nucleic Acids Res..

[14]  J. Janin,et al.  A dissection of specific and non-specific protein-protein interfaces. , 2004, Journal of molecular biology.

[15]  Pertseva Mn,et al.  Thermal expansion of a protein. , 1987 .

[16]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[17]  E J Dodson,et al.  Determination and restrained least-squares refinement of the structures of ribonuclease Sa and its complex with 3'-guanylic acid at 1.8 A resolution. , 1991, Acta crystallographica. Section B, Structural science.

[18]  Zbigniew Dauter,et al.  Stereochemical restraints revisited: how accurate are refinement targets and how much should protein structures be allowed to deviate from them? , 2007, Acta crystallographica. Section D, Biological crystallography.

[19]  M. Bolognesi,et al.  X-ray crystal structure of the ferric sperm whale myoglobin: imidazole complex at 2.0 A resolution. , 1991, Journal of molecular biology.

[20]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[21]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[22]  Randy J. Read,et al.  Overview of the CCP4 suite and current developments , 2011, Acta crystallographica. Section D, Biological crystallography.

[23]  F. Yang,et al.  Crystal structures of CO-, deoxy- and met-myoglobins at various pH values. , 1996, Journal of molecular biology.

[24]  Robert L Jernigan,et al.  Protein flexibility: coordinate uncertainties and interpretation of structural differences. , 2009, Acta crystallographica. Section D, Biological crystallography.

[25]  R Abagyan,et al.  Evaluating the energetics of empty cavities and internal mutations in proteins , 1997, Protein science : a publication of the Protein Society.

[26]  H Frauenfelder,et al.  The role of structure, energy landscape, dynamics, and allostery in the enzymatic function of myoglobin , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Oliviero Carugo,et al.  Protein—protein crystal‐packing contacts , 1997, Protein science : a publication of the Protein Society.

[28]  S H Bryant,et al.  Extent and nature of contacts between protein molecules in crystal lattices and between subunits of protein oligomers , 1997, Proteins.

[29]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[30]  Hiroyuki Toh,et al.  Statistical estimation of cluster boundaries in gene expression profile data , 2001, Bioinform..

[31]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[32]  Francis Rodier,et al.  Protein–protein interaction at crystal contacts , 1995, Proteins.

[33]  J. Janin,et al.  Structural domains in proteins and their role in the dynamics of protein function. , 1983, Progress in biophysics and molecular biology.

[34]  Boguslaw Stec,et al.  Sampling of the native conformational ensemble of myoglobin via structures in different crystalline environments , 2007, Proteins.