Tympanic temperature measurements: Are they reliable in the critically ill? A clinical study of measures of agreement*

Objective: Accurate measurement of temperature is vital in the intensive care setting. A prospective trial was performed to compare the accuracy of tympanic, urinary, and axillary temperatures with that of pulmonary artery (PA) core temperature measurements. Design: A total of 110 patients were enrolled in a prospective observational cohort study. Setting: Multidisciplinary intensive care unit of a university teaching hospital. Patients: The cohort was (mean ± sd) 65 ± 16 yrs of age, Acute Physiology and Chronic Health Evaluation (APACHE) II score was 25 ± 9, 58% of the patients were men, and 76% were mechanically ventilated. The accuracy of tympanic (averaged over both ears), axillary (averaged over both sides), and urinary temperatures was referenced (as mean difference, &Dgr; degrees centigrade) to PA temperatures as standard in 6,703 recordings. Lin concordance correlation (pc) and Bland–Altman 95% limits of agreement (degrees centigrade) described the relationship between paired measurements. Regression analysis (linear mixed model) assessed covariate confounding with respect to temperature modes and reliability formulated as an intraclass correlation coefficient. Measurements and Main Results: Concordance of PA temperatures with tympanic, urinary, and axillary was 0.77, 0.92, and 0.83, respectively. Compared with PA temperatures, &Dgr; (limits of agreement) were 0.36°C (−0.56°C, 1.28°C), −0.05°C (−0.69°C, 0.59°C), and 0.30°C (−0.42°C, 1.01°C) for tympanic, urinary, and axillary temperatures, respectively. Temperature measurement mode effect, estimated via regression analysis, was consistent with concordance and &Dgr; (PA vs. urinary, p = .98). Patient age (p = .03), sedation score (p = .0001), and dialysis (p = .0001) had modest negative relations with temperature; quadratic relationships were identified with adrenaline and dobutamine. No interactions with particular temperature modes were identified (p ≥ .12 for all comparisons) and no relationship was identified with either mean arterial pressure or APACHE II score (p ≥ .64). The average temperature mode intraclass correlation coefficient for test–retest reliability was 0.72. Conclusion: Agreement of tympanic with pulmonary temperature was inferior to that of urinary temperature, which, on overall assessment, seemed more likely to reflect PA core temperature.

[1]  Peter Schuck,et al.  Assessing reproducibility for interval data in health-related quality of life questionnaires: Which coefficient should be used? , 2004, Quality of Life Research.

[2]  Sophia Rabe-Hesketh,et al.  Multilevel and Longitudinal Modeling Using Stata , 2005 .

[3]  Nicholas J. Cox,et al.  A Multivariable Scatterplot Smoother , 2005 .

[4]  R Bender,et al.  [Comparing methods of measurement]. , 2007, Deutsche medizinische Wochenschrift.

[5]  Diane K. Michelson,et al.  Components of Variance , 2003, Technometrics.

[6]  G. Molenberghs,et al.  Applying linear mixed models to estimate reliability in clinical trial data with repeated measurements. , 2004, Controlled clinical trials.

[7]  P. Mackowiak,et al.  Effects of anatomic site, oral stimulation, and body position on estimates of body temperature. , 1996, Archives of internal medicine.

[8]  Kernel Density Estimators: An Approach to Understanding How Groups Differ , 2004 .

[9]  G. Brengelmann,et al.  Independence of brain and tympanic temperatures in an unanesthetized human. , 1988, Journal of applied physiology.

[10]  T. Schmitz,et al.  A comparison of five methods of temperature measurement in febrile intensive care patients. , 1995, American journal of critical care : an official publication, American Association of Critical-Care Nurses.

[11]  D. Altman,et al.  A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement. , 1990, Computers in biology and medicine.

[12]  Ronir Raggio Luiz,et al.  More than one statistical strategy to assess agreement of quantitative measurements may usefully be reported. , 2005, Journal of clinical epidemiology.

[13]  M. Joffres,et al.  Oesophageal, rectal, axillary, tympanic and pulmonary artery temperatures during cardiac surgery , 1998, Canadian journal of anaesthesia = Journal canadien d'anesthesie.

[14]  R. Fildes Conditioning Diagnostics: Collinearity and Weak Data in Regression , 1993 .

[15]  Bendix Carstensen,et al.  Comparing and predicting between several methods of measurement. , 2004, Biostatistics.

[16]  K K Giuliano,et al.  Temperature measurement in critically ill orally intubated adults: a comparison of pulmonary artery core, tympanic, and oral methods. , 1999, Critical care medicine.

[17]  J J Bartko,et al.  Measures of agreement: a single procedure. , 1994, Statistics in medicine.

[18]  B. Binkowitz,et al.  Guidelines for measurement validation in clinical trial design. , 1999, Journal of biopharmaceutical statistics.

[19]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[20]  Michael Buist,et al.  Induced hypothermia in critical care medicine: A review , 2003, Critical care medicine.

[21]  Graham Dunn,et al.  Statistical Evaluation of Measurement Errors: Design and Analysis of Reliability Studies , 2004 .

[22]  L. Lin,et al.  A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.

[23]  M. J. Romano,et al.  Infrared tympanic thermometry in the pediatric intensive care unit , 1993, Critical care medicine.

[24]  G Dunn,et al.  Modelling method comparison data , 1999, Statistical methods in medical research.

[25]  S. Sagawa,et al.  Esophageal and tympanic temperature responses to core blood temperature changes during hyperthermia. , 1986, Journal of applied physiology.

[26]  P. Vargha,et al.  A critical discussion of intraclass correlation coefficients. , 1997, Statistics in medicine.

[27]  M. Stokes,et al.  Reliability of assessment tools in rehabilitation: an illustration of appropriate statistical analyses , 1998, Clinical rehabilitation.

[28]  David A. Belsley,et al.  Conditioning Diagnostics: Collinearity and Weak Data in Regression , 1991 .

[29]  John Ludbrook,et al.  Statistical Techniques For Comparing Measurers And Methods Of Measurement: A Critical Review , 2002, Clinical and experimental pharmacology & physiology.

[30]  R. Erickson,et al.  Accuracy of infrared ear thermometry and other temperature methods in adults. , 1994, American journal of critical care : an official publication, American Association of Critical-Care Nurses.

[31]  R. Erickson The continuing question of how best to measure body temperature. , 1999, Critical care medicine.

[32]  C. Nickerson A note on a concordance correlation coefficient to evaluate reproducibility , 1997 .

[33]  N F de Keizer,et al.  Reliability and accuracy of Sequential Organ Failure Assessment (SOFA) scoring , 2005, Critical care medicine.

[34]  G. Dunn,et al.  Design and analysis of reliability studies. , 1992, Statistical methods in medical research.

[35]  P. Fulbrook Core temperature measurement: a comparison of rectal, axillary and pulmonary artery blood temperature. , 1993, Intensive & critical care nursing.

[36]  J. Ludbrook SPECIAL ARTICLE COMPARING METHODS OF MEASUREMENT , 1997 .

[37]  A. Hedayat,et al.  Statistical Methods in Assessing Agreement , 2002 .

[38]  T Togawa,et al.  Body temperature measurement. , 1985, Clinical physics and physiological measurement : an official journal of the Hospital Physicists' Association, Deutsche Gesellschaft fur Medizinische Physik and the European Federation of Organisations for Medical Physics.

[39]  Graham Dunn,et al.  Review papers : Design and analysis of reliability studies , 1992 .

[40]  A. Beckett,et al.  AKUFO AND IBARAPA. , 1965, Lancet.

[41]  Y. Amoateng-Adjepong,et al.  Accuracy of an infrared tympanic thermometer. , 1999, Chest.

[42]  Charles E. Smith,et al.  Comparison of esophageal, tympanic, and forehead skin temperatures in adult patients. , 1996, Journal of clinical anesthesia.

[43]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[44]  J. Levin,et al.  Should providers of treatment be regarded as a random factor? If it ain't broke, don't "fix" it: a comment on Siemer and Joormann (2003). , 2003, Psychological methods.

[45]  M. Petersen,et al.  Can training improve the results with infrared tympanic thermometers? , 1997, Acta anaesthesiologica Scandinavica.

[46]  J. Kuha AIC and BIC , 2004 .

[47]  D. Altman,et al.  Comparing methods of measurement: why plotting difference against standard method is misleading , 1995, The Lancet.

[48]  D. Koh,et al.  Statistical evaluation of agreement between two methods for measuring a quantitative variable. , 1989, Computers in biology and medicine.

[49]  Douglas G Altman,et al.  Dichotomizing continuous predictors in multiple regression: a bad idea , 2006, Statistics in medicine.

[50]  Maurice G. Kendall,et al.  A THEORY OF RANDOMNESS , 1941 .

[51]  K. Dracup,et al.  Urinary Bladder and Rectal Temperature Monitoring During Clinical Hypothermia , 1989, Nursing research.

[52]  Comparison of Tympanic, Esophageal and Blood Temperatures during Mild Hypothermic Cardiopulmonary Bypass: A Study using an Infrared Emission Detection Tympanic Thermometer , 2004, Journal of Clinical Monitoring.

[53]  K. Linnet,et al.  Evaluation of regression procedures for methods comparison studies. , 1993, Clinical chemistry.

[54]  P. Macintyre,et al.  Acute Pain Management - A Practical Guide , 2001 .

[55]  L. Muller,et al.  Temperature measurement in intensive care patients: comparison of urinary bladder, oesophageal, rectal, axillary, and inguinal methods versus pulmonary artery core method , 2003, Intensive Care Medicine.

[56]  Edwin L. Bradley,et al.  An omnibus test for comparing two measuring devices , 1991 .

[57]  J. Youngblut,et al.  A comparison of pulmonary artery, rectal, and tympanic membrane temperature measurement in the ICU. , 1993, Heart & lung : the journal of critical care.

[58]  A. Giuliano,et al.  Temperature measurement in critically ill adults: a comparison of tympanic and oral methods. , 2000, American journal of critical care : an official publication, American Association of Critical-Care Nurses.

[59]  Lluís Jover,et al.  Estimating the Generalized Concordance Correlation Coefficient through Variance Components , 2003, Biometrics.

[60]  D. Nierman,et al.  Core temperature measurement in the intensive care unit , 1991, Critical care medicine.

[61]  R S Erickson,et al.  Comparison of ear‐based, bladder, oral, and axillary methods for core temperature measurement , 1993, Critical care medicine.

[62]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[63]  Patty Solomon,et al.  Components of Variance , 2002 .

[64]  E. Legome Infrared tympanic thermometry in the pediatric intensive care unit: Romano MJ, Fortenberry JD, Autrey E, et al Crit Care Med 21:1181–1185 Aug 1993 , 1994 .