Automatic Knee Osteoarthritis Diagnosis from Plain Radiographs: A Deep Learning-Based Approach

Knee osteoarthritis (OA) is the most common musculoskeletal disorder. OA diagnosis is currently conducted by assessing symptoms and evaluating plain radiographs, but this process suffers from subjectivity. In this study, we present a new transparent computer-aided diagnosis method based on the Deep Siamese Convolutional Neural Network to automatically score knee OA severity according to the Kellgren-Lawrence grading scale. We trained our method using the data solely from the Multicenter Osteoarthritis Study and validated it on randomly selected 3,000 subjects (5,960 knees) from Osteoarthritis Initiative dataset. Our method yielded a quadratic Kappa coefficient of 0.83 and average multiclass accuracy of 66.71% compared to the annotations given by a committee of clinical experts. Here, we also report a radiological OA diagnosis area under the ROC curve of 0.93. Besides this, we present attention maps highlighting the radiological features affecting the network decision. Such information makes the decision process transparent for the practitioner, which builds better trust toward automatic methods. We believe that our model is useful for clinical decision making and for OA research; therefore, we openly release our training codes and the data set created in this study.

[1]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Naomi S. Altman,et al.  Points of Significance: Model selection and overfitting , 2016, Nature Methods.

[4]  Simo Saarakkala,et al.  Association between subchondral bone structure and osteoarthritis histopathological grade , 2016, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[5]  Simo Saarakkala,et al.  A Novel Method for Automatic Localization of Joint Area on Knee Plain Radiographs , 2017, SCIA.

[6]  Lars Engebretsen,et al.  Defining the presence of radiographic knee osteoarthritis: a comparison between the Kellgren and Lawrence system and OARSI atlas criteria , 2015, Knee Surgery, Sports Traumatology, Arthroscopy.

[7]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[8]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  Alexander Binder,et al.  Explaining nonlinear classification decisions with deep Taylor decomposition , 2015, Pattern Recognit..

[10]  Lior Shamir,et al.  ASSESSMENT OF OSTEOARTHRITIS INITIATIVE–KELLGREN AND LAWRENCE SCORING PROJECTS QUALITY USING COMPUTER ANALYSIS , 2010 .

[11]  Dacre Je,et al.  THE AUTOMATIC ASSESSMENT OF KNEE RADIOGRAPHS IN OSTEOARTHRITIS USING DIGITAL IMAGE ANALYSIS , 1989 .

[12]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[13]  Bo Zhao,et al.  Diversified Visual Attention Networks for Fine-Grained Object Classification , 2016, IEEE Transactions on Multimedia.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Noel E. O'Connor,et al.  Automatic Detection of Knee Joints and Quantification of Knee Osteoarthritis Severity Using Convolutional Neural Networks , 2017, MLDM.

[18]  Marek Kurzynski,et al.  A dissimilarity-based multiple classifier system for trabecular bone texture in detection and prediction of progression of knee osteoarthritis , 2012, Proceedings of the Institution of Mechanical Engineers. Part H, Journal of engineering in medicine.

[19]  J. Kellgren,et al.  Radiological Assessment of Osteo-Arthrosis , 1957, Annals of the rheumatic diseases.

[20]  J. Dacre,et al.  The automatic assessment of knee radiographs in osteoarthritis using digital image analysis. , 1989, British journal of rheumatology.

[21]  D. Shen,et al.  Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans , 2016, Scientific Reports.

[22]  Timothy F. Cootes,et al.  Automated Shape and Texture Analysis for Detection of Osteoarthritis from Radiographs of the Knee , 2015, MICCAI.

[23]  L. McLean,et al.  Validity and sensitivity to change of three scales for the radiographic assessment of knee osteoarthritis using images from the Multicenter Osteoarthritis Study (MOST). , 2015, Osteoarthritis and cartilage.

[24]  Ramprasaath R. Selvaraju,et al.  Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization , 2016 .

[25]  J B Renner,et al.  Comparative evaluation of three semi-quantitative radiographic grading techniques for knee osteoarthritis in terms of validity and reproducibility in 1759 X-rays: report of the OARSI-OMERACT task force. , 2008, Osteoarthritis and cartilage.

[26]  Tom Cox,et al.  The financial burden of psychosocial workplace aggression: A systematic review of cost-of-illness studies , 2018 .

[27]  Jaume Puig-Junoy,et al.  Socio-economic costs of osteoarthritis: a systematic review of cost-of-illness studies. , 2015, Seminars in arthritis and rheumatism.

[28]  Joseph Antony,et al.  Quantifying radiographic knee osteoarthritis severity using deep convolutional neural networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[29]  M. Karsdal,et al.  Disease-modifying treatments for osteoarthritis (DMOADs) of the knee and hip: lessons learned from failures and opportunities for the future. , 2016, Osteoarthritis and cartilage.

[30]  J. Wolfe,et al.  The Invisible Gorilla Strikes Again , 2013, Psychological science.

[31]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[32]  Timothy F. Cootes,et al.  Fully automated shape analysis for detection of Osteoarthritis from lateral knee radiographs , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[33]  T. Vos,et al.  The global burden of rheumatoid arthritis: estimates from the Global Burden of Disease 2010 study , 2014, Annals of the rheumatic diseases.

[34]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[35]  L. Ferrucci,et al.  Early detection of radiographic knee osteoarthritis using computer-aided analysis. , 2009, Osteoarthritis and cartilage.

[36]  T. Vos,et al.  The global burden of hip and knee osteoarthritis: estimates from the Global Burden of Disease 2010 study , 2014, Annals of the rheumatic diseases.