Inter-rater agreement in the assessment of abnormal chest X-ray findings for tuberculosis between two Asian countries

BackgroundInter-rater agreement in the interpretation of chest X-ray (CXR) films is crucial for clinical and epidemiological studies of tuberculosis. We compared the readings of CXR films used for a survey of tuberculosis between raters from two Asian countries.MethodsOf the 11,624 people enrolled in a prevalence survey in Hanoi, Viet Nam, in 2003, we studied 258 individuals whose CXR films did not exclude the possibility of active tuberculosis. Follow-up films obtained from accessible individuals in 2006 were also analyzed. Two Japanese and two Vietnamese raters read the CXR films based on a coding system proposed by Den Boon et al. and another system newly developed in this study. Inter-rater agreement was evaluated by kappa statistics. Marginal homogeneity was evaluated by the generalized estimating equation (GEE).ResultsCXR findings suspected of tuberculosis differed between the four raters. The frequencies of infiltrates and fibrosis/scarring detected on the films significantly differed between the raters from the two countries (P < 0.0001 and P = 0.0082, respectively, by GEE). The definition of findings such as primary cavity, used in the coding systems also affected the degree of agreement.ConclusionsCXR findings were inconsistent between the raters with different backgrounds. High inter-rater agreement is a component necessary for an optimal CXR coding system, particularly in international studies. An analysis of reading results and a thorough discussion to achieve a consensus would be necessary to achieve further consistency and high quality of reading.

[1]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2]  J. Elmore,et al.  Accuracy of screening mammography using single versus independent double interpretation. , 2000, AJR. American journal of roentgenology.

[3]  D. Menzies,et al.  Chest radiograph abnormalities associated with tuberculosis: reproducibility and yield of active cases. , 2002, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[4]  H. Kundel,et al.  Measurement of observer agreement. , 2003, Radiology.

[5]  T. Frieden,et al.  Toman's Tuberculosis: Case Detection, Treatment and Monitoring: Questions and Answers , 2004 .

[6]  H Meme,et al.  The role and performance of chest X-ray for the diagnosis of tuberculosis: A cost-effectiveness analysis in Nairobi, Kenya , 2005, BMC infectious diseases.

[7]  E. Bateman,et al.  Development and evaluation of a new chest radiograph reading and recording system for epidemiological surveys of tuberculosis and lung disease. , 2005, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[8]  R. Chaisson,et al.  Active case finding of tuberculosis: historical perspective and future prospects. , 2005, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[9]  R. Atun,et al.  Variability in interpretation of chest radiographs among Russian clinicians and implications for screening programmes: observational study , 2005, BMJ : British Medical Journal.

[10]  Dermot Maher,et al.  International standards for tuberculosis care. , 2006, The Lancet. Infectious diseases.

[11]  C. Andrzejewski,et al.  BMC Infectious Diseases , 2006 .

[12]  J. Zellweger,et al.  Intra-observer and overall agreement in the radiological assessment of tuberculosis. , 2006, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[13]  S. Sakurada,et al.  A survey of tuberculosis prevalence in Hanoi, Vietnam. , 2007, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[14]  M. Westwood,et al.  Are you reading what we are reading? The effect of who interprets medical images on estimates of diagnostic test accuracy in systematic reviews. , 2007, The British journal of radiology.

[15]  G. Marks,et al.  Radiographic predictors of subsequent reactivation of tuberculosis. , 2007, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[16]  A. Story,et al.  Diagnostic accuracy of digital chest radiography for pulmonary tuberculosis in a UK urban population , 2010, European Respiratory Journal.

[17]  Paul M Kelly,et al.  A simple, valid, numerical score for grading chest x-ray severity in adult smear-positive pulmonary tuberculosis , 2010, Thorax.