Validity of ICD-9-CM codes for breast, lung and colorectal cancers in three Italian administrative healthcare databases: a diagnostic accuracy study protocol

Introduction Administrative healthcare databases are useful tools to study healthcare outcomes and to monitor the health status of a population. Patients with cancer can be identified through disease-specific codes, prescriptions and physician claims, but prior validation is required to achieve an accurate case definition. The objective of this protocol is to assess the accuracy of International Classification of Diseases Ninth Revision—Clinical Modification (ICD-9-CM) codes for breast, lung and colorectal cancers in identifying patients diagnosed with the relative disease in three Italian administrative databases. Methods and analysis Data from the administrative databases of Umbria Region (910 000 residents), Local Health Unit 3 of Napoli (1 170 000 residents) and Friuli-Venezia Giulia Region (1 227 000 residents) will be considered. In each administrative database, patients with the first occurrence of diagnosis of breast, lung or colorectal cancer between 2012 and 2014 will be identified using the following groups of ICD-9-CM codes in primary position: (1) 233.0 and (2) 174.x for breast cancer; (3) 162.x for lung cancer; (4) 153.x for colon cancer and (5) 154.0–154.1 and 154.8 for rectal cancer. Only incident cases will be considered, that is, excluding cases that have the same diagnosis in the 5 years (2007–2011) before the period of interest. A random sample of cases and non-cases will be selected from each administrative database and the corresponding medical charts will be assessed for validation by pairs of trained, independent reviewers. Case ascertainment within the medical charts will be based on (1) the presence of a primary nodular lesion in the breast, lung or colon–rectum, documented with imaging or endoscopy and (2) a cytological or histological documentation of cancer from a primary or metastatic site. Sensitivity and specificity with 95% CIs will be calculated. Dissemination Study results will be disseminated widely through peer-reviewed publications and presentations at national and international conferences.

[1]  Maria Laura Luchetta,et al.  The Current State of Validation of Administrative Healthcare Databases in Italy: A Systematic Review , 2014 .

[2]  F. Monaco,et al.  Accuracy of the ICD-9 codes for identifying TIA and stroke in an Italian automated database , 2004, Neurological Sciences.

[3]  A. Konski Clinical and Economic Outcomes Analyses of Women Developing Breast Cancer in a Managed Care Organization , 2005, American journal of clinical oncology.

[4]  Hude Quan,et al.  BMC Health Services Research BioMed Central Correspondence , 2006 .

[5]  Ning Liu,et al.  Utilization and costs of home care for patients with colorectal cancer: a population-based study. , 2014, CMAJ open.

[6]  R. Da Cas,et al.  Cohort study of hepatotoxicity associated with nimesulide and other non-steroidal anti-inflammatory drugs , 2003, BMJ : British Medical Journal.

[7]  P. Ziprin,et al.  Systematic review of discharge coding accuracy. , 2012, Journal of public health.

[8]  M. Cattaruzza,et al.  [The role of the quality of hospital discharge records on the comparative evaluation of outcomes: the example of chronic obstructive pulmonary disease (COPD)]. , 2012, Epidemiologia e prevenzione.

[9]  S. Rosso,et al.  Cancer prevalence in Italy: an analysis of geographic variability , 2012, Cancer Causes & Control.

[10]  S. Jick Fresh evidence confirms links between newer contraceptive pills and higher risk of venous thromboembolism , 2015, BMJ : British Medical Journal.

[11]  A. Deshpande,et al.  Development of a claims-based algorithm to identify colorectal cancer recurrence. , 2015, Annals of epidemiology.

[12]  T. Walley,et al.  The UK General Practice Research Database , 1997, The Lancet.

[13]  Teresa To,et al.  Development and use of reporting guidelines for assessing the quality of validation studies of health administrative data. , 2011, Journal of clinical epidemiology.

[14]  The UK General Practice Research Database , 2007 .

[15]  A. Piatti,et al.  Healthcare-acquired infections in rehabilitation units of the Lombardy Region, Italy , 2011, Infection.

[16]  M. Stolar,et al.  Identification of metastatic cancer in claims data , 2012, Pharmacoepidemiology and drug safety.

[17]  F. Fabris,et al.  Epidemiology of primary and secondary thrombocytopenia: first analysis of an administrative database in a major Italian institution , 2012, Blood coagulation & fibrinolysis : an international journal in haemostasis and thrombosis.

[18]  Suzanne L. West,et al.  Validity of Pharmacoepidemiologic Drug and Diagnosis Data , 2012 .

[19]  P. Papini,et al.  Indicators of breast cancer severity and appropriateness of surgery based on hospital administrative data in the Lazio Region, Italy , 2006, BMC public health.

[20]  C. Quantin,et al.  Haemoptysis in adults: a 5-year study using the French nationwide hospital administrative database , 2015, European Respiratory Journal.

[21]  Annunziata Faustini,et al.  The Reliability of Hospital and Pharmaceutical Data to Assess Prevalent Cases of Chronic Obstructive Pulmonary Disease , 2012, COPD.

[22]  C. Alves,et al.  Data sources on drug safety evaluation: a review of recent published meta‐analyses , 2012, Pharmacoepidemiology and drug safety.

[23]  A. Marinaccio,et al.  [A comparative analysis between regional mesothelioma registries and cancer registries: results of the ReNaM-AIRTUM project]. , 2014, Epidemiologia e prevenzione.

[24]  Martijn J Schuemie,et al.  EU-ADR healthcare database network vs. spontaneous reporting system database: preliminary comparison of signal detection. , 2011, Studies in health technology and informatics.

[25]  Martijn J Schuemie,et al.  Chronic disease prevalence from Italian administrative databases in the VALORE project: a validation through comparison of population estimates with general practice databases and national survey , 2013, BMC Public Health.

[26]  M. Davoli,et al.  The impact of a pay-for-performance system on timing to hip fracture surgery: experience from the Lazio Region (Italy) , 2013, BMC Health Services Research.

[27]  E. Beghi,et al.  Validation of healthcare administrative data for the diagnosis of epilepsy , 2013, Journal of Epidemiology & Community Health.

[28]  A. Nobili,et al.  Cholinesterase inhibitor use in Alzheimer's disease: the EPIFARM‐Elderly Project , 2011, Pharmacoepidemiology and drug safety.

[29]  C. Mathers,et al.  Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012 , 2015, International journal of cancer.

[30]  M. Abrahamowicz,et al.  Estimation of National Colorectal-Cancer Incidence Using Claims Databases , 2012, Journal of cancer epidemiology.

[31]  Katherine E Henson,et al.  Risk of Suicide After Cancer Diagnosis in England , 2018, JAMA psychiatry.

[32]  L. G. García Rodríguez,et al.  Positive predictive value of ICD-9th codes for upper gastrointestinal bleeding and perforation in the Sistema Informativo Sanitario Regionale database. , 1999, Journal of clinical epidemiology.

[33]  E. Beghi,et al.  Validity of hospital discharge diagnoses for public health surveillance of the Guillain-Barrè syndrome , 2002, Neurological Sciences.

[34]  M. Winget,et al.  Using administrative data to estimate time to breast cancer diagnosis and percent of screen-detected breast cancers – a validation study in Alberta, Canada. , 2015, European journal of cancer care.

[35]  M. Brownell,et al.  Administrative record linkage as a tool for public health research. , 2011, Annual review of public health.

[36]  A. Dehal,et al.  Comorbidity and outcomes after surgery among women with breast cancer: analysis of nationwide in-patient sample database , 2013, Breast Cancer Research and Treatment.

[37]  C. Sacerdote,et al.  A high positive predictive value algorithm using hospital administrative data identified incident cancer cases. , 2008, Journal of clinical epidemiology.

[38]  R. Rinaldi,et al.  Accuracy of ICD-9 codes in identifying ischemic stroke in the General Hospital of Lugo di Romagna (Italy) , 2003, Neurological Sciences.

[39]  Hershel Jick,et al.  The General Practice Research Database , 2004 .

[40]  B. Strom,et al.  12. Validity of Pharmacoepidemiologic Drug and Diagnosis Data , 2013 .

[41]  D. Rennie,et al.  Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative , 2003, BMJ : British Medical Journal.

[42]  P. Vineis,et al.  Appropriateness of early breast cancer management in relation to patient and hospital characteristics: a population based study in Northern Italy , 2009, Breast Cancer Research and Treatment.

[43]  M. Zorzi,et al.  Screening for colorectal cancer in Italy: 2011-2012 survey. , 2015, Epidemiologia e prevenzione.