Application of the complex multivariate normal distribution to crystallographic methods with insights into multiple isomorphous replacement phasing.

Probabilistic methods involving maximum-likelihood parameter estimation have become a powerful tool in computational crystallography. At the centre of these methods are the relevant probability distributions. Here, equations are developed based on the complex multivariate normal distribution that generalize the distributions currently used in maximum-likelihood model and heavy-atom refinement. In this treatment, the effects of various sources of error in the experiment are considered separately and allowance is made for correlations among sources of error. The multivariate distributions presented are closely related to the distributions previously derived in ab initio phasing and can be applied to many different aspects of a crystallographic structure-determination process including model refinement, density modification, heavy-atom phasing and refinement or combinations of them. The underlying probability distributions for multiple isomorphous replacement are re-examined using these techniques. The re-analysis requires the underlying assumptions to be made explicitly and results in a variance term that, unlike those previously used for maximum-likelihood multiple isomorphous replacement phasing, is expressed explicitly in terms of structure-factor covariances. Test cases presented show that the newly derived multiple isomorphous replacement likelihood functions perform satisfactorily compared with currently used programs.

[1]  G. Bricogne,et al.  [27] Maximum-likelihood heavy-atom parameter refinement for multiple isomorphous replacement and multiwavelength anomalous diffraction methods. , 1997, Methods in enzymology.

[2]  G. Bricogne [23] Bayesian statistical viewpoint on structure determination: Basic concepts and examples. , 1997, Methods in enzymology.

[3]  G. Grimmett,et al.  Probability and random processes , 2002 .

[4]  H. Hauptman On integrating the techniques of direct methods with anomalous dispersion. I. The theoretical basis , 1982 .

[5]  F. Crick,et al.  The treatment of errors in the isomorphous replacement method , 1959 .

[6]  T. Skarzynski,et al.  Crystal structure of α-dendrotoxin from the green mamba venom and its comparison with the structure of bovine pancreatic trypsin inhibitor , 1992 .

[7]  E. A. Green A new statistical model for describing errors in isomorphous replacement data: the case of one derivative , 1979 .

[8]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[9]  R. Read Structure-factor probabilities for related structures , 1990 .

[10]  C. Giacovazzo,et al.  The method of joint probability distribution functions applied to MAD techniques. The two-wavelength case for acentric crystals. , 2001, Acta crystallographica. Section A, Foundations of crystallography.

[11]  Carmelo Giacovazzo,et al.  Direct Phasing in Crystallography: Fundamentals and Applications , 1998 .

[12]  G. Bricogne Maximum entropy and the foundations of direct methods , 1984 .

[13]  J. Karle,et al.  The calculation of ε associated with normalized structure factors, E , 1976 .

[14]  A. Klug Joint probability distribution of structure factors and the phase problem , 1958 .

[15]  C. Giacovazzo,et al.  The method of joint probability distribution functions applied to the one-wavelength anomalous-scattering (OAS) case. , 2001, Acta crystallographica. Section A, Foundations of crystallography.

[16]  H. Hauptman Solution of the phase problem , 1953 .

[17]  George H. Weiss,et al.  Introduction to Crystallographic Statistics , 1995 .

[18]  K. Pearson,et al.  Biometrika , 1902, The American Naturalist.

[19]  C. Giacovazzo The estimation of two-phase and three-phase invariants in P1 when anomalous scatterers are present , 1983 .

[20]  R. Wooding The multivariate distribution of complex normal variables , 1956 .

[21]  G. Bricogne A Bayesian statistical theory of the phase problem. I. A multichannel maximum-entropy formalism for constructing generalized joint probability distributions of structure factors , 1988 .

[22]  M. James,et al.  Structure of the calcium regulatory muscle protein troponin-C at 2.8 Å resolution , 1985, Nature.

[23]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.