Inter-observer Variability Analysis of Automatic Lung Delineation in Normal and Disease Patients

Human interaction has become almost mandatory for an automated medical system wishing to be accepted by clinical regulatory agencies such as Food and Drug Administration. Since this interaction causes variability in the gathered data, the inter-observer and intra-observer variability must be analyzed in order to validate the accuracy of the system. This study focuses on the variability from different observers that interact with an automated lung delineation system that relies on human interaction in the form of delineation of the lung borders. The database consists of High Resolution Computed Tomography (HRCT): 15 normal and 81 diseased patients’ images taken retrospectively at five levels per patient. Three observers manually delineated the lungs borders independently and using software called ImgTracer™ (AtheroPoint™, Roseville, CA, USA) to delineate the lung boundaries in all five levels of 3-D lung volume. The three observers consisted of Observer-1: lesser experienced novice tracer who is a resident in radiology under the guidance of radiologist, whereas Observer-2 and Observer-3 are lung image scientists trained by lung radiologist and biomedical imaging scientist and experts. The inter-observer variability can be shown by comparing each observer’s tracings to the automated delineation and also by comparing each manual tracing of the observers with one another. The normality of the tracings was tested using D’Agostino-Pearson test and all observers tracings showed a normal P-value higher than 0.05. The analysis of variance (ANOVA) test between three observers and automated showed a P-value higher than 0.89 and 0.81 for the right lung (RL) and left lung (LL), respectively. The performance of the automated system was evaluated using Dice Similarity Coefficient (DSC), Jaccard Index (JI) and Hausdorff (HD) Distance measures. Although, Observer-1 has lesser experience compared to Obsever-2 and Obsever-3, the Observer Deterioration Factor (ODF) shows that Observer-1 has less than 10 % difference compared to the other two, which is under acceptable range as per our analysis. To compare between observers, this study used regression plots, Bland-Altman plots, two tailed T-test, Mann-Whiney, Chi-Squared tests which showed the following P-values for RL and LL: (i) Observer-1 and Observer-3 were: 0.55, 0.48, 0.29 for RL and 0.55, 0.59, 0.29 for LL; (ii) Observer-1 and Observer-2 were: 0.57, 0.50, 0.29 for RL and 0.54, 0.59, 0.29 for LL; (iii) Observer-2 and Observer-3 were: 0.98, 0.99, 0.29 for RL and 0.99, 0.99, 0.29 for LL. Further, CC and R-squared coefficients were computed between observers which came out to be 0.9 for RL and LL. All three observers however manage to show the feature that diseased lungs are smaller than normal lungs in terms of area.

[1]  Kaustav Nandy,et al.  Interactive segmentation and tracking in optical microscopic images , 2012, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[2]  Juan Ruiz-Alzola,et al.  Comments on: A methodology for evaluation of boundary detection algorithms on medical images , 2004, IEEE Trans. Medical Imaging.

[3]  Margrit Betke,et al.  Small pulmonary nodules: volume measurement at chest CT--phantom study. , 2003, Radiology.

[4]  Hong Yan,et al.  Delineating low‐count defective‐contour SPECT lung scans for PE diagnosis using adaptive dual exponential thresholding and active contours , 2010, Int. J. Imaging Syst. Technol..

[5]  L. Broemeling,et al.  Interobserver and intraobserver variability in measurement of non-small-cell carcinoma lung lesions: implications for assessment of tumor response. , 2003, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[6]  Sherri L. Jackson Research Methods and Statistics: A Critical Thinking Approach , 2005 .

[7]  Bram van Ginneken,et al.  Automatic Segmentation of Pulmonary Segments From Volumetric Chest CT Scans , 2009, IEEE Transactions on Medical Imaging.

[8]  U. Rajendra Acharya,et al.  Inter- and intra-observer variability analysis of completely automated cIMT measurement software (AtheroEdge™) and its benchmarking against commercial ultrasound scanner and expert Readers , 2013, Comput. Biol. Medicine.

[9]  P Sharman,et al.  Interstitial lung disease due to fumes from heat-cutting polymer rope. , 2013, Occupational medicine.

[10]  Bennet M Wang,et al.  Clinical Atlas of Interstitial Lung Disease , 2006 .

[11]  R. Birdwell,et al.  Breast Imaging Reporting and Data System Lexicon for US: Interobserver Agreement for Assessment of Breast Masses , 2010 .

[12]  Jamshid Dehmeshki,et al.  Segmentation of Pulmonary Nodules in Thoracic CT Scans: A Region Growing Approach , 2008, IEEE Transactions on Medical Imaging.

[13]  J. Suri,et al.  Atherosclerotic risk stratification strategy for carotid arteries using texture-based features. , 2012, Ultrasound in medicine & biology.

[14]  Ellen Kao,et al.  Breast imaging reporting and data system lexicon for US: interobserver agreement for assessment of breast masses. , 2009, Radiology.

[15]  Bram van Ginneken,et al.  Automated segmentation of pulmonary structures in thoracic computed tomography scans: a review , 2013, Physics in medicine and biology.

[16]  Nuno Ferreira,et al.  Automated identification of the lung contours in positron emission tomography , 2013 .

[17]  J M Bland,et al.  Statistical methods for assessing agreement between two methods of clinical measurement , 1986 .

[18]  Jasjit S. Suri,et al.  Reliable and accurate psoriasis disease classification in dermatology images using comprehensive feature space in machine learning paradigm , 2015, Expert Syst. Appl..

[19]  Heinz-Otto Peitgen,et al.  Lung lobe segmentation by anatomy-guided 3D watershed transform , 2003, SPIE Medical Imaging.

[20]  K. Hopper,et al.  Analysis of interobserver and intraobserver variability in CT tumor measurements. , 1996, AJR. American journal of roentgenology.

[21]  P Krishna Kumar,et al.  A Review on Carotid Ultrasound Atherosclerotic Tissue Characterization and Stroke Risk Stratification in Machine Learning Framework , 2015, Current Atherosclerosis Reports.

[22]  Aly A. Farag,et al.  Deformable Models: Theory & Biomaterial Applications (Topics in Biomedical Engineering. International Book Series) , 2007 .

[23]  Jasjit S. Suri,et al.  CAUDLES-EF: Carotid Automated Ultrasound Double Line Extraction System Using Edge Flow , 2011, Journal of Digital Imaging.

[24]  G Narasinga rao,et al.  The Role of Pattern Recognition in Computer-Aided Diagnosis and Computer-Aided Detection in Medical Imaging: A Clinical Validation , 2010 .

[25]  Kunio Doi,et al.  Computer-aided diagnosis in medical imaging: Historical review, current status and future potential , 2007, Comput. Medical Imaging Graph..

[26]  Bram van Ginneken,et al.  Automated segmentation of pulmonary structures in thoracic computed tomography scans: a review , 2013 .

[27]  D. Wolfe,et al.  Nonparametric Statistical Methods. , 1974 .

[28]  Eric A. Hoffman,et al.  Automatic lung segmentation for accurate quantitation of volumetric X-ray CT images , 2001, IEEE Transactions on Medical Imaging.

[29]  Lena Costaridou,et al.  Texture classification-based segmentation of lung affected by interstitial pneumonia in high-resolution CT. , 2008, Medical physics.

[30]  Filippo Molinari,et al.  Contribution CHARACTERIZATION OF SINGLE THYROID NODULES BY CONTRAST-ENHANCED 3-D ULTRASOUND , 2010 .

[31]  Gordon Cooke,et al.  Rheumatoid Arthritis (RA) associated interstitial lung disease (ILD). , 2013, European journal of internal medicine.

[32]  Alireza Osareh,et al.  A Segmentation Method of Lung Cavities Using Region Aided Geometric Snakes , 2009, Journal of Medical Systems.

[33]  W. Heindel,et al.  Spiral CT of pulmonary nodules: interobserver variation in assessment of lesion size , 2000, European Radiology.

[34]  Jasjit S. Suri,et al.  Automatic Lung Segmentation Using Control Feedback System: Morphology and Texture Paradigm , 2015, Journal of Medical Systems.

[35]  Jasjit S. Suri,et al.  Plaque Echolucency and Stroke Risk in Asymptomatic Carotid Stenosis: A Systematic Review and Meta-Analysis , 2015, Stroke.

[36]  Ayman El-Baz,et al.  Lung imaging and computer-aided diagnosis , 2011 .

[37]  D. Lynch,et al.  Interobserver variability in the CT assessment of honeycombing in the lungs. , 2013, Radiology.

[38]  Ali Idri,et al.  Empirical Studies on Usability of mHealth Apps: A Systematic Literature Review , 2015, Journal of Medical Systems.

[39]  Michael Kay,et al.  Thurlbeck's Pathology of the Lung , 2005 .

[40]  J. Baker,et al.  Computer-aided classification of breast masses: performance and interobserver variability of expert radiologists versus residents. , 2011, Radiology.

[41]  U. Rajendra Acharya,et al.  Automated classification of patients with coronary artery disease using grayscale features from left ventricle echocardiographic images , 2013, Comput. Methods Programs Biomed..

[42]  Jasjit S. Suri,et al.  3D Imaging Technologies in Atherosclerosis , 2015, Springer US.

[43]  R. Matthay,et al.  INTERSTITIAL LUNG DISEASE IN POLYMYOSITIS AND DERMATOMYOSITIS: ANALYSIS OF SIX CASES AND REVIEW OF THE LITERATURE , 1976, Medicine.

[44]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[45]  Nuno Ferreira,et al.  An Algorithm for the Pulmonary Border Extraction in PET Images , 2012 .

[46]  Jasjit S. Suri,et al.  Computer-aided diagnosis of psoriasis skin images with HOS, texture and color features: A first comparative study of its kind , 2016, Comput. Methods Programs Biomed..

[47]  E L Bolson,et al.  Variability in the measurement of regional left ventricular wall motion from contrast angiograms. , 1983, Circulation.

[48]  Carlos Ferreira,et al.  Quantitative evaluation of a pulmonary contour segmentation algorithm in X-ray computed tomography images. , 2004, Academic radiology.

[49]  Ranganathan Hariharan,et al.  Automated texture‐based characterization of fibrosis and carcinoma using low‐dose lung CT images , 2014, Int. J. Imaging Syst. Technol..

[50]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..