Methods for improving colorectal cancer annotation efficiency for artificial intelligence-observer training

BACKGROUND Missing occult cancer lesions accounts for the most diagnostic errors in retrospective radiology reviews as early cancer can be small or subtle, making the lesions difficult to detect. Second-observer is the most effective technique for reducing these events and can be economically implemented with the advent of artificial intelligence (AI). AIM To achieve appropriate AI model training, a large annotated dataset is necessary to train the AI models. Our goal in this research is to compare two methods for decreasing the annotation time to establish ground truth: Skip-slice annotation and AI-initiated annotation. METHODS We developed a 2D U-Net as an AI second observer for detecting colorectal cancer (CRC) and an ensemble of 5 differently initiated 2D U-Net for ensemble technique. Each model was trained with 51 cases of annotated CRC computed tomography of the abdomen and pelvis, tested with 7 cases, and validated with 20 cases from The Cancer Imaging Archive cases. The sensitivity, false positives per case, and estimated Dice coefficient were obtained for each method of training. We compared the two methods of annotations and the time reduction associated with the technique. The time differences were tested using Friedman’s two-way analysis of variance. RESULTS Sparse annotation significantly reduces the time for annotation particularly skipping 2 slices at a time (P < 0.001). Reduction of up to 2/3 of the annotation does not reduce AI model sensitivity or false positives per case. Although initializing human annotation with AI reduces the annotation time, the reduction is minimal, even when using an ensemble AI to decrease false positives. CONCLUSION Our data support the sparse annotation technique as an efficient technique for reducing the time needed to establish the ground truth.

[1]  J. Fletcher,et al.  Automated Artificial Intelligence Model Trained on a Large Dataset Can Detect Pancreas Cancer on Diagnostic CTs as well as Visually Occult Pre-invasive Cancer on Pre-diagnostic CTs. , 2023, Gastroenterology.

[2]  C. Johnson,et al.  Strategies for improving colorectal cancer detection with routine computed tomography , 2023, Abdominal Radiology.

[3]  D. Dong,et al.  Artificial intelligence in gastric cancer: applications and challenges , 2022, Gastroenterology report.

[4]  A. McPherson,et al.  Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer , 2022, Nature Cancer.

[5]  C. Murphy,et al.  Colorectal Cancer in Younger Adults. , 2022, Hematology/oncology clinics of North America.

[6]  SamirK. Gupta Screening for Colorectal Cancer. , 2022, Hematology/oncology clinics of North America.

[7]  E. Westman,et al.  Comparison of Two-Dimensional- and Three-Dimensional-Based U-Net Architectures for Brain Tissue Classification in One-Dimensional Brain CT , 2022, Frontiers in Computational Neuroscience.

[8]  H. Moon,et al.  Active Learning Performance in Labeling Radiology Images Is 90% Effective , 2021, Frontiers in Radiology.

[9]  Andre Mastmeyer,et al.  Comparison of 2D vs 3D U-Net Organ Segmentation in abdominal 3D CT images , 2021, CSRN.

[10]  Xiaosheng He,et al.  Gastrointestinal cancers in China, the USA, and Europe , 2021, Gastroenterology report.

[11]  D. Brooks,et al.  Lowering the colorectal cancer screening age improves predicted outcomes in a microsimulation model , 2021, Current medical research and opinion.

[12]  N. Ayache,et al.  Applications of artificial intelligence in cardiovascular imaging , 2021, Nature Reviews Cardiology.

[13]  Ruixin Zhu,et al.  Identification of microbial markers across populations in early detection of colorectal cancer , 2020, Nature Communications.

[14]  M. Lungren,et al.  Preparing Medical Imaging Data for Machine Learning. , 2020, Radiology.

[15]  P. Lavin,et al.  Cross-sectional adherence with the multi-target stool DNA test for colorectal cancer screening: Real-world data from a large cohort of older adults , 2020, Journal of medical screening.

[16]  Jose Dolz,et al.  Bounding boxes for weakly supervised segmentation: Global constraints get close to full supervision , 2020, MIDL.

[17]  Junfeng Zhu,et al.  Ensemble Learning Models Based on Noninvasive Features for Type 2 Diabetes Screening: Model Development and Validation , 2019, JMIR medical informatics.

[18]  S. Pita-Fernández,et al.  Emergency presentation of colorectal patients in Spain , 2018, PloS one.

[19]  Heung-Il Suk,et al.  Multi-Scale Gradual Integration CNN for False Positive Reduction in Pulmonary Nodule Detection , 2018, Neural Networks.

[20]  Lanfen Lin,et al.  A deep 3D residual CNN for false-positive reduction in pulmonary nodule detection. , 2018, Medical physics.

[21]  Brandon K. Fornwalt,et al.  Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration , 2018, npj Digital Medicine.

[22]  S. Bicknell,et al.  The Accuracy of Colorectal Cancer Detection by Computed Tomography in the Unprepared Large Bowel in a Community-Based Hospital , 2018, Canadian Association of Radiologists journal = Journal l'Association canadienne des radiologistes.

[23]  J. Wang-Rodriguez,et al.  Replacing the Guaiac Fecal Occult Blood Test With the Fecal Immunochemical Test Increases Proportion of Individuals Screened in a Large Healthcare Setting , 2017, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[24]  Hao Chen,et al.  Multilevel Contextual 3-D CNNs for False Positive Reduction in Pulmonary Nodule Detection , 2017, IEEE Transactions on Biomedical Engineering.

[25]  Thomas Brox,et al.  3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation , 2016, MICCAI.

[26]  Abhinav Gupta,et al.  Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[28]  Giovanni Seni,et al.  Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions , 2010, Ensemble Methods in Data Mining.

[29]  Guido Gerig,et al.  User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability , 2006, NeuroImage.

[30]  A. Megibow,et al.  Carcinoma of the colon: detection and preoperative staging by CT. , 1988, AJR. American journal of roentgenology.

[31]  P. Koo,et al.  Detecting Early Colorectal Cancer on Routine CT Scan of the Abdomen and Pelvis Can Improve Patient’s 5-year Survival , 2019, Archives of Biomedical and Clinical Research.