The development of a statistical computer software resource for medical research

1 Abstract The Development of a Statistical Software Resource for Medical Research: MD Thesis of Iain Edward Buchan Medical research is often weakened by poor statistical practice, and inappropriate use of statistical computer software is part of this problem. The statistical knowledge that medical researchers require has traditionally been gained in both dedicated and ad hoc learning time, often separate from the research processes in which the statistical methods are applied. Computer software, however, can be written to flexibly support statistical practice. The work of this thesis was to explore the possibility of, and if possible, to create, a resource supporting medical researchers in statistical knowledge and calculation at the point of need. The work was carried out over eleven years, and was directed towards the medical research community in general. Statistical and Software Engineering methods were used to produce a unified statistical computational and knowledge support resource. Mathematically and computationally robust approaches to statistical methods were continually sought from current literature. The type of evaluation undertaken was formative; this included monitoring uptake of the software and feedback from its users, comparisons with other software, reviews in peer reviewed publications, and testing of results against classical and reference data. Large-scale opportunistic feedback from users of this resource was employed in its continuous improvement. The software resulting from the work of this thesis is provided herein as supportive evidence. Results of applying the software to classical reference data are shown in the written thesis. The scope and presentation of statistical methods are considered in a comparison of the software with common statistical software resources. This comparison showed that the software written for this thesis more closely matched statistical methods commonly used in medical research, and contained more statistical knowledge support materials. Up to October 31st 2000, uptake of the software was recorded for 5621 separate instances by individuals or institutions. The development has been self-sustaining. Medical researchers need to have sufficient statistical understanding, just as statistical researchers need to sufficiently understand the nature of data. Statistical software tools may damage statistical practice if they distract attention from statistical goals and tasks, onto the tools themselves. The work of this thesis provides a practical computing framework supporting statistical knowledge and calculation in medical research. This work has shown that sustainable software can be engineered to improve statistical appreciation and practice in ways that are beyond the reach of traditional medical statistical education.

[1]  N. Breslow,et al.  Statistical methods in cancer research. Volume II--The design and analysis of cohort studies. , 1987, IARC scientific publications.

[2]  Michael J. Campbell,et al.  Medical Statistics: A Commonsense Approach, 3rd Edition , 1999 .

[3]  R. Edwards,et al.  Dissociation of body-temperature and melatonin secretion circadian rhythms in patients with chronic fatigue syndrome. , 1996, Clinical physiology.

[4]  D. Kleinbaum,et al.  Applied Regression Analysis and Other Multivariate Methods , 1978 .

[5]  Ajit C. Tamhane,et al.  Small sample confidence intervals for the difference,ratio and odds ratio of two success probabilities , 1993 .

[6]  G. P. Bhattacharjee,et al.  Inverse of the Incomplete Beta Function Ratio , 1973 .

[7]  D. Sackett,et al.  The Ends of Human Life: Medical Ethics in a Liberal Polity , 1992, Annals of Internal Medicine.

[8]  David Sellu Practical Personal Computing for Healthcare Professionals , 1994 .

[9]  D. Kleinbaum,et al.  Applied regression analysis and other multivariable methods, 3rd ed. , 1998 .

[10]  S. Walter,et al.  Calculation of attributable risks from epidemiological data. , 1978, International journal of epidemiology.

[11]  ANOTHER LOOK AT INTER-RATER AGREEMENT , 1986 .

[12]  J. Royston Expected Normal Order Statistics (Exact and Approximate) , 1982 .

[13]  Cyrus R. Mehta,et al.  A hybrid algorithm for fisher's exact test in unordered rxc contingency tables , 1986 .

[14]  Mark E. Johnson,et al.  The Incidence of Monotone Likelihood in the Cox Model , 1981 .

[15]  L. M. Anderson Statistics with Confidence. Confidence Intervals and Statistical Guidelines , 1989 .

[16]  A. M. Nikiforov Exact Smirnov Two-sample Tests for Arbitrary Distributions , 1994 .

[17]  Nitin R. Patel,et al.  ALGORITHM 643: FEXACT: a FORTRAN subroutine for Fisher's exact test on unordered r×c contingency tables , 1986, TOMS.

[18]  J. Gleason An accurate, non-iterative approximation for studentized range quantiles , 1999 .

[19]  J J Gart,et al.  Approximate interval estimation of the difference in binomial parameters: correction for skewness and extension to multiple tables. , 1990, Biometrics.

[20]  Douglas G. Altman,et al.  Statistics Notes: Transforming data , 1996 .

[21]  J. Peto,et al.  Asymptotically Efficient Rank Invariant Test Procedures , 1972 .

[22]  Wayne Nelson Theory and Applications of Hazard Plotting for Censored Failure Data , 2000, Technometrics.

[23]  A Donner,et al.  A goodness-of-fit approach to inference procedures for the kappa statistic: confidence interval construction, significance-testing and sample size estimation. , 1992, Statistics in medicine.

[24]  I. D. Hill,et al.  Algorithm 291: logarithm of gamma function , 1966, CACM.

[25]  Ron Brookmeyer,et al.  A Confidence Interval for the Median Survival Time , 1982 .

[26]  D. E. Roberts,et al.  Algorithm AS 91: The Percentage Points of the χ 2 Distribution , 1975 .

[27]  Norman E. Breslow,et al.  The design and analysis of cohort studies , 1987 .

[28]  I. Buchan,et al.  Prevalence of overweight and obese children between 1989 and 1998: population based series of cross sectional studies , 2001, BMJ : British Medical Journal.

[29]  G. Barrie Wetherill,et al.  Intermediate Statistical Methods , 1982 .

[30]  A. DiCenso Clinically useful measures of the effects of treatment , 2001, Evidence-based nursing.

[31]  David A. Heiser,et al.  On the accuracy of statistical procedures in Microsoft Excel 2007 , 1999, Comput. Stat. Data Anal..

[32]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[33]  J. Cuzick,et al.  A Wilcoxon-type test for trend. , 1985, Statistics in medicine.

[34]  F. Liddell Simplified exact analysis of case-referent studies: matched pairs; dichotomous exposure. , 1983, Journal of epidemiology and community health.

[35]  John M. Lachin,et al.  Two-Sample Asymptotically Distribution-Free Tests for Incomplete Multivariate Observations , 1984 .

[36]  David R. Jones,et al.  An introduction to bayesian methods in health technology assessment , 1999, BMJ.

[37]  V. Hertzberg,et al.  Statistics in Epidemiology: Methods, Techniques and Applications , 1996 .

[38]  Peter W. Jones MEDICAL STATISTICS: A COMMONSENSE APPROACH 3rd edition , 2001 .

[39]  Algorithm AS 76: An Integral Useful in Calculating Non-Central t and Bivariate Normal Probabilities , 1974 .

[40]  The Natural Duration of Lung Cancer. , 1955, Canadian Medical Association journal.

[41]  N. L. Johnson,et al.  Continuous Univariate Distributions. , 1995 .

[42]  N. Breslow,et al.  The analysis of case-control studies , 1980 .

[43]  O. Kempthorne Experimental Designs, 2nd Edition , 1958 .

[44]  E. Masson,et al.  Vascular reactivity to noradrenaline and neuropeptide Y in the streptozotocin‐induced diabetic rat , 1995, European journal of clinical investigation.

[45]  M. Scriven The methodology of evaluation , 1966 .

[46]  Edward J. Wegman On the eve of the 21st century: Statistical science at a crossroads , 2000 .

[47]  J. Fleiss,et al.  Meta-analysis in epidemiology, with special reference to studies of the association between exposure to environmental tobacco smoke and lung cancer: a critique. , 1991, Journal of clinical epidemiology.

[48]  N. Breslow,et al.  Statistical methods in cancer research: volume 1- The analysis of case-control studies , 1980 .

[49]  D. E. Roberts,et al.  The Upper Tail Probabilities of Spearman's Rho , 1975 .

[50]  S. Stigler,et al.  The History of Statistics: The Measurement of Uncertainty before 1900 by Stephen M. Stigler (review) , 1986, Technology and Culture.

[51]  Norman T. J. Bailey Mathematics, statistics, and systems for health. , 1977 .

[52]  W. Haenszel,et al.  Statistical aspects of the analysis of data from retrospective studies of disease. , 1959, Journal of the National Cancer Institute.

[53]  D. Newman,et al.  THE DISTRIBUTION OF RANGE IN SAMPLES FROM A NORMAL POPULATION, EXPRESSED IN TERMS OF AN INDEPENDENT ESTIMATE OF STANDARD DEVIATION , 1939 .

[54]  Tosiya Sato,et al.  Confidence limits for the common odds ratio based on the asymptotic distribution of the Mantel-Haenszel estimator , 1990 .

[55]  T. Beauchamp,et al.  Principles of biomedical ethics , 1991 .

[56]  M. Pike,et al.  Design and analysis of randomized clinical trials requiring prolonged observation of each patient. II. analysis and examples. , 1977, British Journal of Cancer.

[57]  Ronald H. Randles,et al.  A test for correlation based on Kendall's tau , 1988 .

[58]  Stephen M. Stigler,et al.  The History of Statistics: The Measurement of Uncertainty before 1900 , 1986 .

[59]  James W. Longley An Appraisal of Least Squares Programs for the Electronic Computer from the Point of View of the User , 1967 .

[60]  Gertrude M. Cox,et al.  Experimental Design , 2019, Simulation and Computational Red Teaming for Problem Solving.

[61]  I. Buchan,et al.  Local clinical guidelines: description and evaluation of a participative method for their development and implementation. , 1996, Family practice.

[62]  R. E. Lund,et al.  Algorithm AS 190: Probabilities and Upper Quantiles for the Studentized Range , 1983 .

[63]  L L Kupper,et al.  Comparisons of confidence intervals for attributable risk. , 1981, Biometrics.

[64]  J M Robins,et al.  Estimation of a common effect parameter from sparse follow-up data. , 1985, Biometrics.

[65]  C. Dowrick,et al.  Twelve month outcome of depression in general practice: does detection or disclosure make a difference? , 1995, BMJ.

[66]  S E Vollset,et al.  Confidence intervals for a binomial proportion. , 1994, Statistics in medicine.

[67]  I. Buchan,et al.  Alterations in prescribing by general practitioner fundholders: an observational study , 1995, BMJ.