A multi-stage Gaussian transformation algorithm for clinical laboratory data.

We have developed a multi-stage computer algorithm to transform non-normally distributed data to a normal distribution. This transformation is of value for calculation of laboratory reference intervals and for normalization of clinical laboratory variates before applying statistical procedures in which underlying data normality is assumed. The algorithm is able to normalize most laboratory data distributions with either negative or positive coefficients of skewness or kurtosis. Stepwise, a logarithmic transform removes asymmetry (skewness), then a Z-score transform and power function transform remove residual peakedness or flatness (kurtosis). Powerful statistical tests of data normality in the procedure help the user evaluate both the necessity for and the success of the data transformation. Erroneous assessments of data normality caused by rounded laboratory test values have been minimized by introducing computer-generated random noise into the data values. Reference interval endpoints that were estimated parametrically (mean +/- 2 SD) by using successfully transformed data were found to have a smaller root-mean-squared error than those estimated by the non-parametric percentile technique.

[1]  D. Lacher,et al.  The multivariate reference range: an alternative interpretation of multi-test profiles. , 1982, Clinical chemistry.

[2]  N. Draper,et al.  An Alternative Family of Transformations , 1980 .

[3]  J O Westgard,et al.  Statistical analysis of method comparison data. Testing normality. , 1979, American journal of clinical pathology.

[4]  Glassman Ab,et al.  Statistical manipulation for normalization of data: in vitro thyroid function tests. , 1976 .

[5]  R E Thiers,et al.  Statistical evaluation of method-comparison data. , 1975, Clinical chemistry.

[6]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[7]  A. H. Reed,et al.  Evaluation of a transformation method for estimation of normal range. , 1974, Clinical chemistry.

[8]  K. McPherson,et al.  The frequency distributions of commonly determined blood constituents in healthy blood donors. , 1974, Clinica chimica acta; international journal of clinical chemistry.

[9]  N. Tietz,et al.  Proposed standard method for measuring lipase activity in serum by a continuous sampling technique. , 1973, Clinical Chemistry.

[10]  Ellis S. Benson,et al.  Laboratory Data Analysis System: Section III—Multivariate Normality , 1972 .

[11]  D L DeMets,et al.  Estimation of normal ranges and cumulative proportions by transforming observed distributions to gaussian form. , 1972, Clinical chemistry.

[12]  Elveback Lr How high is high? A proposed alternative to the normal range. , 1972 .

[13]  A. H. Reed,et al.  Influence of statistical method used on the resulting estimate of normal range. , 1971, Clinical chemistry.

[14]  M. Brunden,et al.  A general method of determining normal ranges applied to blood values for dogs. , 1970, American journal of clinical pathology.

[15]  L. Elveback,et al.  Health, normality, and the ghost of Gauss. , 1970, JAMA.

[16]  L. Elveback,et al.  STATISTICAL METHODS OF ESTIMATING PERCENTILES , 1969 .

[17]  D. Mainland NORMAL VALUES IN MEDICINE , 1969, Annals of the New York Academy of Sciences.

[18]  M. Healy Normal Values from a Statistical Viewpoint , 1969, Bulletin de l'Academie royale de medecine de Belgique.

[19]  G. W. Snedecor STATISTICAL METHODS , 1967 .

[20]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[21]  J. Tukey On the Comparative Anatomy of Transformations , 1957 .

[22]  Bartlett Ms The use of transformations. , 1947 .

[23]  R. Reyment THE DISCRIMINANT FUNCTION IN SYSTEMATIC BIOLOGY , 1973 .