Logarithmic Transformations in Regression: Do You Transform Back Correctly?

Abstract The logarithmic transformation is often used in regression analysis for a variety of purposes such as the linearization of a nonlinear relationship between two or more variables. We have noticed that when this transformation is applied to the response variable, the computation of the point estimate of the conditional mean of the original response variable is often incorrect. Although the correct procedure has long been known in the scientific community and is well documented, an incorrect or misleading procedure is used in many business statistics textbooks. This incorrect procedure results in errors that can be quite significant. Our article uses a real-data business example which, in the context of making a decision about an advertising charge for a magazine, illustrates the correct procedure. This example also provides a sense of the magnitude of the error that would result if the incorrect procedure were used. The six percent error in our example could be substantially higher for other applications.

[1]  Dean W. Wichern,et al.  Business Statistics: Decision Making with Data , 1996 .

[2]  Dennis J. Sweeney,et al.  Essentials of statistics for business and economics , 1984 .

[3]  G. Peterson,et al.  Log Transformation Bias in Estimating Travel Cost Models , 1986 .

[4]  S. Christian Albright,et al.  Data Analysis and Decision Making , 2004 .

[5]  S. McLeod Overworked and Underpaid , 2002 .

[6]  Peter C. Bell,et al.  Statistics for business with spreadsheets : text & cases , 1992 .

[7]  E. Ziegel Basic Statistical Ideas for Managers (2nd ed.) , 2005 .

[8]  I. J. Good,et al.  Statistical Analysis for Decision Making, 2nd Edition. , 1978 .

[9]  Pallavi Chitturi Introduction to Business Statistics (6th ed.) , 2008 .

[10]  Kenneth Mullen,et al.  Applied Statistics for Business and Economics , 1975 .

[11]  Dennis J. Sweeney,et al.  Statistics for Business and Economics (11th Edition) , 2011 .

[12]  DAVID G. KENDALL,et al.  Introduction to Mathematical Statistics , 1947, Nature.

[13]  T. E. Dielman Applied Regression Analysis for Business and Economics , 1990 .

[14]  Thomas H. Short Applied Statistics With Microsoft Excel® , 2003, Technometrics.

[15]  Ronald D. Snee,et al.  Statistical Thinking: Improving Business Performance , 2001 .

[16]  R. Lyman Ott.,et al.  An introduction to statistical methods and data analysis , 1977 .

[17]  W. A. Ericson Introduction to Mathematical Statistics, 4th Edition , 1972 .

[18]  Amir D. Aczel Complete Business Statistics , 1992 .

[19]  Donald L. Harnett,et al.  Data, statistics, and decision models with Excel , 1998 .

[20]  Shuangzhe Liu,et al.  Regression diagnostics , 2020, Applied Quantitative Analysis for Real Estate.

[21]  Eric R. Ziegel Data Analysis for Managers with Microsoft® Excel , 2004, Technometrics.

[22]  N. Duan Smearing Estimate: A Nonparametric Retransformation Method , 1983 .

[23]  Mark L. Berenson,et al.  Basic Business Statistics : Concepts and Applications , 2007 .

[24]  David S. Moore,et al.  The Practice of Business Statistics: Using Data for Decisions , 2002 .

[25]  Richard J. Smith Logarithmic transformation bias in allometry , 1993 .

[26]  Henry W. Altland,et al.  Applied Regression Analysis for Business and Economics , 2002, Technometrics.

[27]  D. J. Finney On the Distribution of a Variate Whose Logarithm is Normally Distributed , 1941 .

[28]  Patrick Shannon,et al.  Business Statistics: A Decision-Making Approach , 1981 .

[29]  Harvey J. Brightman,et al.  Data Analysis In Plain English with Microsoft Excel , 1998 .