A prediction model for advanced colorectal neoplasia in an asymptomatic screening population

Background An electronic medical record (EMR) database of a large unselected population who received screening colonoscopies may minimize sampling error and represent real-world estimates of risk for screening target lesions of advanced colorectal neoplasia (CRN). Our aim was to develop and validate a prediction model for assessing the probability of advanced CRN using a clinical data warehouse. Methods A total of 49,450 screenees underwent their first colonoscopy as part of a health check-up from 2002 to 2012 at Samsung Medical Center, and the dataset was constructed by means of natural language processing from the computerized EMR system. The screenees were randomized into training and validation sets. The prediction model was developed using logistic regression. The model performance was validated and compared with existing models using area under receiver operating curve (AUC) analysis. Results In the training set, age, gender, smoking duration, drinking frequency, and aspirin use were identified as independent predictors for advanced CRN (adjusted P < .01). The developed model had good discrimination (AUC = 0.726) and was internally validated (AUC = 0.713). The high-risk group had a 3.7-fold increased risk of advanced CRN compared to the low-risk group (1.1% vs. 4.0%, P < .001). The discrimination performance of the present model for high-risk patients with advanced CRN was better than that of the Asia-Pacific Colorectal Screening score (AUC = 0.678, P < .001) and Schroy’s CAN index (AUC = 0.672, P < .001). Conclusion The present 5-item risk model can be calculated readily using a simple questionnaire and can identify the low- and high-risk groups of advanced CRN at the first screening colonoscopy. This model may increase colorectal cancer risk awareness and assist healthcare providers in encouraging the high-risk group to undergo a colonoscopy.

[1]  Byung-Ho Nam,et al.  Risk Prediction Model for Colorectal Cancer: National Health Insurance Corporation Study, Korea , 2014, PloS one.

[2]  D. Chang,et al.  Association between markers of glucose metabolism and risk of colorectal adenoma. , 2014, Gastroenterology.

[3]  M. Kaminski,et al.  A score to estimate the likelihood of detecting advanced colorectal neoplasia at colonoscopy , 2014, Gut.

[4]  Joe Kesterson,et al.  Comparing methods for identifying pancreatic cancer patients using electronic data sources. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[5]  I. Mesters,et al.  Participation in colorectal cancer screening trials after first-time invitation: a systematic review , 2011, Endoscopy.

[6]  J. Griffith,et al.  A Risk Prediction Index for Advanced Colorectal Neoplasia at Screening Colonoscopy , 2015, The American Journal of Gastroenterology.

[7]  C. Mathers,et al.  Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012 , 2015, International journal of cancer.

[8]  J. Thompson,et al.  Early detection of colon cancer-the kaiser permanente northwest 30-year history: how do we measure success? Is it the test, the number of tests, the stage, or the percentage of screen-detected patients? , 2011, The Permanente journal.

[9]  H. Park,et al.  Prevalence and risk of colorectal neoplasms in asymptomatic, average-risk screenees 40 to 49 years of age. , 2010, Gastrointestinal endoscopy.

[10]  J. Sohn,et al.  Diagnostic Coding for Intramucosal Carcinoma and Neuroendocrine Tumor in the Colorectum: Proposal for Avoiding Confusing Coding in Korea , 2015, Clinical Endoscopy.

[11]  S. Shin,et al.  [Korean guidelines for colorectal cancer screening and polyp detection]. , 2012, The Korean journal of gastroenterology = Taehan Sohwagi Hakhoe chi.

[12]  Charles J. Lightdale,et al.  The Paris endoscopic classification of superficial neoplastic lesions: esophagus, stomach, and colon: November 30 to December 1, 2002. , 2003, Gastrointestinal endoscopy.

[13]  Justin A. Strauss,et al.  Identifying primary and recurrent cancers using a SAS-based natural language processing algorithm , 2012, J. Am. Medical Informatics Assoc..

[14]  Douglas K Rex,et al.  Guidelines for colonoscopy surveillance after screening and polypectomy: a consensus update by the US Multi-Society Task Force on Colorectal Cancer. , 2012, Gastroenterology.

[15]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[16]  Dean F. Sittig,et al.  Natural language processing in the electronic medical record: assessing clinician adherence to tobacco treatment guidelines. , 2005, American journal of preventive medicine.

[17]  Yikyung Park,et al.  Validation of a colorectal cancer risk prediction model among white patients age 50 years and older. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[18]  Joe Kesterson,et al.  Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms. , 2010, HPB : the official journal of the International Hepato Pancreato Biliary Association.

[19]  Huimin Xu,et al.  Derivation and validation of a prediction rule for estimating advanced colorectal neoplasm risk in average-risk Chinese. , 2012, American journal of epidemiology.

[20]  Douglas K Rex,et al.  Corrigendum: American College of Gastroenterology Guidelines for Colorectal Cancer Screening 2008 , 2009, The American Journal of Gastroenterology.

[21]  David F Ransohoff,et al.  Using Risk for Advanced Proximal Colonic Neoplasia To Tailor Endoscopic Screening for Colorectal Cancer , 2003, Annals of Internal Medicine.

[22]  Christopher G. Chute,et al.  Technical Brief: Mayo Clinic NLP System for Patient Smoking Status Identification , 2008, J. Am. Medical Informatics Assoc..

[23]  Graham A Colditz,et al.  Cumulative risk of colon cancer up to age 70 years by risk factor status using data from the Nurses' Health Study. , 2009, American journal of epidemiology.

[24]  Maher M El-Masri,et al.  Handling missing data in self-report measures. , 2005, Research in nursing & health.

[25]  W. Willett,et al.  Harvard Report on Cancer Prevention Volume 4: Harvard Cancer Risk Index , 2004, Cancer Causes & Control.

[26]  J. Sung,et al.  The Asia-Pacific Colorectal Screening score: a validated tool that stratifies risk for colorectal advanced neoplasia in asymptomatic Asian subjects , 2011, Gut.

[27]  J. Buring,et al.  Development of a risk score for colorectal cancer in men. , 2007, The American journal of medicine.

[28]  M. Gail,et al.  Colorectal cancer risk prediction tool for white men and women without known susceptibility. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.