Identification of a potential fibromyalgia diagnosis using random forest modeling applied to electronic medical records.

BACKGROUND: Diagnosis of fibromyalgia (FM), a chronic musculoskeletal condition characterized by widespread pain and a constellation of symptoms, remains challenging and is often delayed. METHODS: Random forest modeling of electronic medical records was used to identify variables that may facilitate earlier FM identification and diagnosis. Subjects aged ≥18 years with two or more listings of the International Classification of Diseases, Ninth Revision, (ICD-9) code for FM (ICD-9 729.1) ≥30 days apart during the 2012 calendar year were defined as cases among subjects associated with an integrated delivery network and who had one or more health care provider encounter in the Humedica database in calendar years 2011 and 2012. Controls were without the FM ICD-9 codes. Seventy-two demographic, clinical, and health care resource utilization variables were entered into a random forest model with downsampling to account for cohort imbalances (<1% subjects had FM). Importance of the top ten variables was ranked based on normalization to 100% for the variable with the largest loss in predicting performance by its omission from the model. Since random forest is a complex prediction method, a set of simple rules was derived to help understand what factors drive individual predictions. RESULTS: The ten variables identified by the model were: number of visits where laboratory/non-imaging diagnostic tests were ordered; number of outpatient visits excluding office visits; age; number of office visits; number of opioid prescriptions; number of medications prescribed; number of pain medications excluding opioids; number of medications administered/ordered; number of emergency room visits; and number of musculoskeletal conditions. A receiver operating characteristic curve confirmed the model's predictive accuracy using an independent test set (area under the curve, 0.810). To enhance interpretability, nine rules were developed that could be used with good predictive probability of an FM diagnosis and to identify no-FM subjects. CONCLUSION: Random forest modeling may help to quantify the predictive probability of an FM diagnosis. Rules can be developed to simplify interpretability. Further validation of these models may facilitate earlier diagnosis and enhance management.

[1]  S. Silverman,et al.  Electronic medical record data to identify variables associated with a fibromyalgia diagnosis: importance of health care resource utilization , 2015, Journal of pain research.

[2]  Gary S Collins,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration , 2015, Annals of Internal Medicine.

[3]  D. Buskila,et al.  Predicting fibromyalgia, a narrative review: Are we better than fools and children? , 2014, European journal of pain.

[4]  Max Kuhn,et al.  Applied Predictive Modeling , 2013 .

[5]  G. Zlateva,et al.  Health-resource use and costs associated with fibromyalgia in France, Germany, and the United States , 2013, ClinicoEconomics and outcomes research : CEOR.

[6]  L. Arnold,et al.  Development and testing of the fibromyalgia diagnostic screen for primary care. , 2011, Journal of women's health.

[7]  G. Zlateva,et al.  Clinical comorbidities, treatment patterns, and healthcare costs among patients with fibromyalgia newly prescribed pregabalin or duloxetine in usual care , 2012, Journal of medical economics.

[8]  J. Cappelleri,et al.  The Burden of Fibromyalgia: Assessment of Health Status Using the EuroQol (EQ-5D) in Patients with Fibromyalgia Relative to Other Chronic Conditions , 2011 .

[9]  D. Goldenberg,et al.  The comparative burden of mild, moderate and severe Fibromyalgia: results from a cross-sectional survey in the United States , 2011, Health and quality of life outcomes.

[10]  Daniel J Clauw,et al.  Improving the recognition and diagnosis of fibromyalgia. , 2011, Mayo Clinic proceedings.

[11]  F. Wolfe,et al.  The development of fibromyalgia – I: Examination of rates and predictors in patients with rheumatoid arthritis (RA) , 2011, PAIN®.

[12]  J. Fermanian,et al.  Development and validation of the Fibromyalgia Rapid Screening Tool (FiRST) , 2010, PAIN.

[13]  C. Beauchemin,et al.  Clinical and Economic Characteristics of Patients With Fibromyalgia Syndrome , 2010, The Clinical journal of pain.

[14]  F. Wolfe,et al.  The American College of Rheumatology Preliminary Diagnostic Criteria for Fibromyalgia and Measurement of Symptom Severity , 2010, Arthritis care & research.

[15]  Serge Perrot,et al.  A patient survey of the impact of fibromyalgia and the journey to diagnosis , 2010, BMC health services research.

[16]  F. Wolfe,et al.  EQ-5D and SF-36 Quality of Life Measures in Systemic Lupus Erythematosus: Comparisons with Rheumatoid Arthritis, Noninflammatory Rheumatic Disorders, and Fibromyalgia , 2010, The Journal of Rheumatology.

[17]  Mariza de Andrade,et al.  Identification of genes and haplotypes that predict rheumatoid arthritis using random forests , 2009, BMC proceedings.

[18]  Daniel L Riddle,et al.  Two-year incidence and predictors of future knee arthroplasty in persons with symptomatic knee osteoarthritis: preliminary analysis of longitudinal data from the osteoarthritis initiative. , 2009, The Knee.

[19]  F. Salaffi,et al.  Health-related quality of life in fibromyalgia patients: a comparison with rheumatoid arthritis patients and the general population using the SF-36 health survey. , 2009, Clinical and experimental rheumatology.

[20]  C. Power,et al.  The influence of socioeconomic status on the reporting of regional and widespread musculoskeletal pain: results from the 1958 British Birth Cohort Study , 2008, Annals of the rheumatic diseases.

[21]  Kaija Saranto,et al.  Definition, structure, content, use and impacts of electronic health records: A review of the research literature , 2008, Int. J. Medical Informatics.

[22]  H. Birnbaum,et al.  Employees With Fibromyalgia: Medical Comorbidity, Healthcare Costs, and Work Loss , 2008, Journal of occupational and environmental medicine.

[23]  E. Dukes,et al.  The health status burden of people with fibromyalgia: a review of studies that assessed health status with the SF-36 or the SF-12 , 2007, International journal of clinical practice.

[24]  Jessie Jones,et al.  An internet survey of 2,596 people with fibromyalgia , 2007, BMC musculoskeletal disorders.

[25]  A. Silman,et al.  The role of psychosocial factors in predicting the onset of chronic widespread pain: results from a prospective population-based study. , 2006, Rheumatology.

[26]  S. Wessely,et al.  The impact of a diagnosis of fibromyalgia on health care resource use by primary care patients in the UK: an observational study based on clinical practice. , 2006, Arthritis and rheumatism.

[27]  A. Silman,et al.  Features of somatization predict the onset of chronic widespread pain: results of a large population-based study. , 2001, Arthritis and rheumatism.

[28]  A. Silman,et al.  Risk factors for persistent chronic widespread pain: a community-based study. , 2001, Rheumatology.

[29]  M. Speechley,et al.  Testing an instrument to screen for fibromyalgia syndrome in general population studies: the London Fibromyalgia Epidemiology Study Screening Questionnaire. , 1999, The Journal of rheumatology.

[30]  P. Tugwell,et al.  The American College of Rheumatology 1990 Criteria for the Classification of Fibromyalgia. Report of the Multicenter Criteria Committee. , 1990, Arthritis and rheumatism.