Investigating the impact of disease and health record duration on the eMERGE algorithm for rheumatoid arthritis

OBJECTIVE The study sought to determine the dependence of the Electronic Medical Records and Genomics (eMERGE) rheumatoid arthritis (RA) algorithm on both RA and electronic health record (EHR) duration. MATERIALS AND METHODS Using a population-based cohort from the Mayo Clinic Biobank, we identified 497 patients with at least 1 RA diagnosis code. RA case status was manually determined using validated criteria for RA. RA duration was defined as time from first RA code to the index date of biobank enrollment. To simulate EHR duration, various years of EHR lookback were applied, starting at the index date and going backward. Model performance was determined by sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve (AUC). RESULTS The eMERGE algorithm performed well in this cohort, with overall sensitivity 53%, specificity 99%, positive predictive value 97%, negative predictive value 74%, and AUC 76%. Among patients with RA duration <2 years, sensitivity and AUC were only 9% and 54%, respectively, but increased to 71% and 85% among patients with RA duration >10 years. Longer EHR lookback also improved model performance up to a threshold of 10 years, in which sensitivity reached 52% and AUC 75%. However, optimal EHR lookback varied by RA duration; an EHR lookback of 3 years was best able to identify recently diagnosed RA cases. CONCLUSIONS eMERGE algorithm performance improves with longer RA duration as well as EHR duration up to 10 years, though shorter EHR lookback can improve identification of recently diagnosed RA cases.

[1]  I. Kohane,et al.  Electronic medical records for discovery research in rheumatoid arthritis , 2010, Arthritis care & research.

[2]  Scott M. Brue,et al.  Data resource profile: the Rochester Epidemiology Project (REP) medical records-linkage system. , 2012, International journal of epidemiology.

[3]  I. Kohane,et al.  Development of phenotype algorithms using electronic medical records and incorporating natural language processing , 2015, BMJ : British Medical Journal.

[4]  T. Therneau,et al.  Is the incidence of rheumatoid arthritis rising?: results from Olmsted County, Minnesota, 1955-2007. , 2010, Arthritis and rheumatism.

[5]  Douglas G Altman,et al.  The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: guidelines for reporting observational studies. , 2014, International journal of surgery.

[6]  Kiley J. Johnson,et al.  The Mayo Clinic Biobank: a building block for individualized medicine. , 2013, Mayo Clinic proceedings.

[7]  Association Between Anti–Citrullinated Fibrinogen Antibodies and Coronary Artery Disease in Rheumatoid Arthritis , 2018, Arthritis care & research.

[8]  Hua Xu,et al.  Portability of an algorithm to identify rheumatoid arthritis in electronic health records , 2012, J. Am. Medical Informatics Assoc..

[9]  S. Murphy,et al.  Association between inflammation and systolic blood pressure in RA compared to patients without RA , 2018, Arthritis Research & Therapy.

[10]  Donia Scott,et al.  Extracting information from the text of electronic medical records to improve case detection: a systematic review , 2016, J. Am. Medical Informatics Assoc..

[11]  A. Silman,et al.  UvA-DARE (Digital Academic Repository) 2010 Rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative Aletaha, , 2010 .

[12]  C. Chute,et al.  Electronic Medical Records for Genetic Research: Results of the eMERGE Consortium , 2011, Science Translational Medicine.

[13]  Peter Szolovits,et al.  Associations of autoantibodies, autoimmune risk alleles, and clinical diagnoses from the electronic medical records in rheumatoid arthritis cases and non-rheumatoid arthritis controls. , 2013, Arthritis and rheumatism.

[14]  Douglas G Altman,et al.  [The Strengthening the Reporting of Observational Studies in Epidemiology [STROBE] statement: guidelines for reporting observational studies]. , 2007, Gaceta sanitaria.

[15]  M. Liang,et al.  The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. , 1988, Arthritis and rheumatism.

[16]  D. MacArthur,et al.  An eMERGE Clinical Center at Partners Personalized Medicine , 2016, Journal of personalized medicine.

[17]  Chen Lin,et al.  Automatic Prediction of Rheumatoid Arthritis Disease Activity from the Electronic Medical Records , 2013, AMIA.