Real-time artificial intelligence for detection of upper gastrointestinal cancer by endoscopy: a multicentre, case-control, diagnostic study.

BACKGROUND Upper gastrointestinal cancers (including oesophageal cancer and gastric cancer) are the most common cancers worldwide. Artificial intelligence platforms using deep learning algorithms have made remarkable progress in medical imaging but their application in upper gastrointestinal cancers has been limited. We aimed to develop and validate the Gastrointestinal Artificial Intelligence Diagnostic System (GRAIDS) for the diagnosis of upper gastrointestinal cancers through analysis of imaging data from clinical endoscopies. METHODS This multicentre, case-control, diagnostic study was done in six hospitals of different tiers (ie, municipal, provincial, and national) in China. The images of consecutive participants, aged 18 years or older, who had not had a previous endoscopy were retrieved from all participating hospitals. All patients with upper gastrointestinal cancer lesions (including oesophageal cancer and gastric cancer) that were histologically proven malignancies were eligible for this study. Only images with standard white light were deemed eligible. The images from Sun Yat-sen University Cancer Center were randomly assigned (8:1:1) to the training and intrinsic verification datasets for developing GRAIDS, and the internal validation dataset for evaluating the performance of GRAIDS. Its diagnostic performance was evaluated using an internal and prospective validation set from Sun Yat-sen University Cancer Center (a national hospital) and additional external validation sets from five primary care hospitals. The performance of GRAIDS was also compared with endoscopists with three degrees of expertise: expert, competent, and trainee. The diagnostic accuracy, sensitivity, specificity, positive predictive value, and negative predictive value of GRAIDS and endoscopists for the identification of cancerous lesions were evaluated by calculating the 95% CIs using the Clopper-Pearson method. FINDINGS 1 036 496 endoscopy images from 84 424 individuals were used to develop and test GRAIDS. The diagnostic accuracy in identifying upper gastrointestinal cancers was 0·955 (95% CI 0·952-0·957) in the internal validation set, 0·927 (0·925-0·929) in the prospective set, and ranged from 0·915 (0·913-0·917) to 0·977 (0·977-0·978) in the five external validation sets. GRAIDS achieved diagnostic sensitivity similar to that of the expert endoscopist (0·942 [95% CI 0·924-0·957] vs 0·945 [0·927-0·959]; p=0·692) and superior sensitivity compared with competent (0·858 [0·832-0·880], p<0·0001) and trainee (0·722 [0·691-0·752], p<0·0001) endoscopists. The positive predictive value was 0·814 (95% CI 0·788-0·838) for GRAIDS, 0·932 (0·913-0·948) for the expert endoscopist, 0·974 (0·960-0·984) for the competent endoscopist, and 0·824 (0·795-0·850) for the trainee endoscopist. The negative predictive value was 0·978 (95% CI 0·971-0·984) for GRAIDS, 0·980 (0·974-0·985) for the expert endoscopist, 0·951 (0·942-0·959) for the competent endoscopist, and 0·904 (0·893-0·916) for the trainee endoscopist. INTERPRETATION GRAIDS achieved high diagnostic accuracy in detecting upper gastrointestinal cancers, with sensitivity similar to that of expert endoscopists and was superior to that of non-expert endoscopists. This system could assist community-based hospitals in improving their effectiveness in upper gastrointestinal cancer diagnoses. FUNDING The National Key R&D Program of China, the Natural Science Foundation of Guangdong Province, the Science and Technology Program of Guangdong, the Science and Technology Program of Guangzhou, and the Fundamental Research Funds for the Central Universities.

[1]  Kenshi Yao,et al.  Optimizing early upper gastrointestinal cancer detection at endoscopy , 2015, Nature Reviews Gastroenterology &Hepatology.

[2]  S. Menon,et al.  How commonly is upper gastrointestinal cancer missed at endoscopy? A meta-analysis , 2014, Endoscopy International Open.

[3]  I. Choi,et al.  Effectiveness of the Korean National Cancer Screening Program in Reducing Gastric Cancer Mortality. , 2017, Gastroenterology.

[4]  M. Gurcan,et al.  Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images: a retrospective, multicohort, diagnostic study. , 2019, The Lancet. Oncology.

[5]  C. Hamashima Update version of the Japanese Guidelines for Gastric Cancer Screening. , 2018, Japanese journal of clinical oncology.

[6]  M. Fujishiro,et al.  Diagnostic outcomes of esophageal cancer by artificial intelligence using convolutional neural networks. , 2019, Gastrointestinal endoscopy.

[7]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[8]  Hisao Tajiri,et al.  An Asian consensus on standards of diagnostic upper endoscopy for neoplasia , 2018, Gut.

[9]  Xin Sun,et al.  Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence , 2019, Nature Medicine.

[10]  Nicolas Chapados,et al.  Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model , 2017, Gut.

[11]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[12]  D. Coit,et al.  Proposal of a new stage grouping of gastric cancer for TNM classification: International Gastric Cancer Association staging project , 2017, Gastric Cancer.

[13]  S. Toda,et al.  Two years' intensive training in endoscopic diagnosis facilitates detection of early gastric cancer. , 2012, Internal medicine.

[14]  J. Koea,et al.  Patient information needs in upper gastrointestinal cancer: what patients and their families want to know , 2018, ANZ journal of surgery.

[15]  H. Ishwaran,et al.  Recommendations for pathologic staging (pTNM) of cancer of the esophagus and esophagogastric junction for the 8th edition AJCC/UICC staging manuals. , 2016, Diseases of the esophagus : official journal of the International Society for Diseases of the Esophagus.

[16]  Rebecca L. Siegel Mph,et al.  Cancer statistics, 2018 , 2018 .

[17]  Y. Naito,et al.  Blue laser imaging-bright improves the real-time detection rate of early gastric cancer: a randomized controlled study. , 2019, Gastrointestinal endoscopy.

[18]  M. Fujishiro,et al.  Application of artificial intelligence using a convolutional neural network for detecting gastric cancer in endoscopic images , 2018, Gastric Cancer.

[19]  B. Wilson,et al.  Endoscopic detection of early upper GI cancers. , 2005, Best practice & research. Clinical gastroenterology.

[20]  Hisao Tajiri,et al.  Early detection of superficial squamous cell carcinoma in the head and neck region and esophagus by narrow band imaging: a multicenter randomized controlled trial. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[21]  Bastian Leibe,et al.  Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  C. Compton,et al.  The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population‐based to a more “personalized” approach to cancer staging , 2017, CA: a cancer journal for clinicians.

[23]  Diana S. M. Buist,et al.  Will Machine Learning Tip the Balance in Breast Cancer Screening? , 2017, JAMA oncology.

[24]  Helmut Neumann,et al.  Confocal laser endomicroscopy: technical advances and clinical applications. , 2010, Gastroenterology.

[25]  Yuichi Mori,et al.  Artificial intelligence in gastrointestinal endoscopy: The future is almost here , 2018, World journal of gastrointestinal endoscopy.