Development and validation of a predictive model for detection of colorectal cancer in primary care by analysis of complete blood counts: a binational retrospective study

Abstract Objective The use of risk prediction models grows as electronic medical records become widely available. Here, we develop and validate a model to identify individuals at increased risk for colorectal cancer (CRC) by analyzing blood counts, age, and sex, then determine the model’s value when used to supplement conventional screening. Materials and Methods Primary care data were collected from a cohort of 606 403 Israelis (of whom 3135 were diagnosed with CRC) and a case control UK dataset of 5061 CRC cases and 25 613 controls. The model was developed on 80% of the Israeli dataset and validated using the remaining Israeli and UK datasets. Performance was evaluated according to the area under the curve, specificity, and odds ratio at several working points. Results Using blood counts obtained 3–6 months before diagnosis, the area under the curve for detecting CRC was 0.82 ± 0.01 for the Israeli validation set. The specificity was 88 ± 2% in the Israeli validation set and 94 ± 1% in the UK dataset. Detecting 50% of CRC cases, the odds ratio was 26 ± 5 and 40 ± 6, respectively, for a false-positive rate of 0.5%. Specificity for 50% detection was 87 ± 2% a year before diagnosis and 85 ± 2% for localized cancers. When used in addition to the fecal occult blood test, our model enabled more than a 2-fold increase in CRC detection. Discussion Comparable results in 2 unrelated populations suggest that the model should generally apply to the detection of CRC in other groups. The model’s performance is superior to current iron deficiency anemia management guidelines, and may help physicians to identify individuals requiring additional clinical evaluation. Conclusions Our model may help to detect CRC earlier in clinical practice.

[1]  O Jolobe,et al.  Guidelines for the management of iron deficiency anaemia , 2001, Gut.

[2]  L M Schuman,et al.  The effect of fecal occult-blood screening on the incidence of colorectal cancer. , 2000, The New England journal of medicine.

[3]  D. Lieberman,et al.  One-time screening for colorectal cancer with combined fecal occult-blood testing and examination of the distal colon. , 2001, The New England journal of medicine.

[4]  J. Hippisley-Cox,et al.  Identifying patients with suspected colorectal cancer in primary care: derivation and validation of an algorithm. , 2012, The British journal of general practice : the journal of the Royal College of General Practitioners.

[5]  Cancer,et al.  Once-only flexible sigmoidoscopy screening in prevention of colorectal cancer: a multicentre randomised controlled trial , 2010, The Lancet.

[6]  Bianca Zadrozny,et al.  Transforming classifier scores into accurate multiclass probability estimates , 2002, KDD.

[7]  E. Hing,et al.  Use and characteristics of electronic health record systems among office-based physician practices: United States, 2001-2013. , 2014, NCHS data brief.

[8]  O. Jolobe Guidelines for the management of iron deficiency anaemia , 2001, Gut.

[9]  C. Mathers,et al.  Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012 , 2015, International journal of cancer.

[10]  J. Cuzick,et al.  European guidelines for quality assurance in colorectal cancer screening and diagnosis: Overview and introduction to the full Supplement publication , 2012, Endoscopy.

[11]  J W Arends,et al.  Regression analysis of prognostic factors in colorectal cancer after curative resections , 1988, Diseases of the Colon & Rectum.

[12]  Hardeep Singh,et al.  Missed Opportunities to Initiate Endoscopic Evaluation for Colorectal Cancer Diagnosis , 2009, The American Journal of Gastroenterology.

[13]  D. Rockey,et al.  Iron deficiency and gastrointestinal malignancy: a population-based cohort study. , 2002, The American journal of medicine.

[14]  S. Lemeshow,et al.  A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. , 1993, JAMA.

[15]  Martin Roland,et al.  Linking physicians' pay to the quality of care--a major experiment in the United kingdom. , 2004, The New England journal of medicine.

[16]  R. Hobbs,et al.  Iron deficiency anaemia and delayed diagnosis of colorectal cancer: a retrospective cohort study , 2011, Colorectal disease : the official journal of the Association of Coloproctology of Great Britain and Ireland.

[17]  W. Hamilton,et al.  The CAPER studies: five case-control studies aimed at identifying and quantifying the risk of cancer in symptomatic primary care patients , 2009, British Journal of Cancer.

[18]  D. Cox Two further applications of a model for binary regression , 1958 .

[19]  F. Chan,et al.  An updated Asia Pacific Consensus Recommendations on colorectal cancer screening , 2014, Gut.

[20]  S. Schneeweiss Learning from big health care data. , 2014, The New England journal of medicine.

[21]  E. Steyerberg,et al.  Prognosis Research Strategy (PROGRESS) 3: Prognostic Model Research , 2013, PLoS medicine.

[22]  N. Obuchowski,et al.  Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures , 2010, Epidemiology.

[23]  M. Short,et al.  Iron deficiency anemia: evaluation and management. , 2013, American family physician.

[24]  Y. Tabak,et al.  Using Automated Clinical Data for Risk Adjustment: Development and Validation of Six Disease-Specific Mortality Predictive Models for Pay-for-Performance , 2007, Medical care.

[25]  Carlos Sofia,et al.  Acute pancreatitis associated with a nontraumatic, intramural duodenal hematoma , 2013, Endoscopy.

[26]  A. Bourke,et al.  Generalisability of The Health Improvement Network (THIN) database: demographics, chronic disease prevalence and mortality rates. , 2011, Informatics in primary care.

[27]  Shivan J. Mehta,et al.  Colorectal cancer screening, version 1.2015: Featured updates to the NCCN guidelines , 2015 .

[28]  R. N. Patterson,et al.  Iron deficiency anaemia: are the British Society of Gastroenterology guidelines being adhered to? , 2003, Postgraduate medical journal.

[29]  S. Ishihara,et al.  Proximal shift of colorectal cancer along with aging. , 2014, Clinical colorectal cancer.

[30]  P. Shekelle,et al.  Screening for Colorectal Cancer: A Guidance Statement From the American College of Physicians , 2012, Annals of Internal Medicine.

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  Bianca Zadrozny,et al.  Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers , 2001, ICML.

[33]  A. H. Murphy,et al.  “Good” Probability Assessors , 1968 .

[34]  Farzad Mostashari,et al.  Adoption of electronic health records grows rapidly, but fewer than half of US hospitals had at least a basic system in 2012. , 2013, Health affairs.

[35]  Dennie V. Jones,et al.  The value of a complete blood count in predicting cancer of the colon. , 2004, Cancer detection and prevention.

[36]  Xiaowu Sun,et al.  Using electronic health record data to develop inpatient mortality predictive model: Acute Laboratory Risk of Mortality Score (ALaRMS) , 2013, J. Am. Medical Informatics Assoc..

[37]  J. Zimmerman,et al.  Acute Physiology and Chronic Health Evaluation (APACHE) IV: Hospital mortality assessment for today’s critically ill patients* , 2006, Critical care medicine.

[38]  Varda Shalev,et al.  Variations in hemoglobin before colorectal cancer diagnosis , 2010, European journal of cancer prevention : the official journal of the European Cancer Prevention Organisation.

[39]  Gabriel J. Escobar,et al.  Risk-Adjusting Hospital Inpatient Mortality Using Automated Inpatient, Outpatient, and Laboratory Databases , 2008, Medical care.

[40]  J. Hippisley-Cox,et al.  Identifying patients with suspected lung cancer in primary care: derivation and validation of an algorithm. , 2011, The British journal of general practice : the journal of the Royal College of General Practitioners.

[41]  Xiaowu Sun,et al.  Development and validation of a disease-specific risk adjustment system using automated clinical data. , 2010, Health services research.

[42]  Amy B. Knudsen,et al.  Sensitivity of immunochemical faecal occult blood testing for detecting left- vs right-sided colorectal neoplasia , 2011, British Journal of Cancer.