Mining Disease Risk Patterns from Nationwide Clinical Databases for the Assessment of Early Rheumatoid Arthritis Risk

Rheumatoid arthritis (RA) is a chronic autoimmune rheumatic disease that can cause painful swelling in the joint lining, morning stiffness, and joint deformation/destruction. These symptoms decrease both quality of life and life expectancy. However, if RA can be diagnosed in the early stages, it can be controlled with pharmacotherapy. Although many studies have examined the possibility of early assessment and diagnosis, few have considered the relationship between significant risk factors and the early assessment of RA. In this paper, we present a novel framework for early RA assessment that utilizes data preprocessing, risk pattern mining, validation, and analysis. Under our proposed framework, two risk patterns can be discovered. Type I refers to well-known risk patterns that have been identified by existing studies, whereas Type II denotes unknown relationship risk patterns that have rarely or never been reported in the literature. These Type II patterns are very valuable in supporting novel hypotheses in clinical trials of RA, and constitute the main contribution of this work. To ensure the robustness of our experimental evaluation, we use a nationwide clinical database containing information on 1,314 RA-diagnosed patients over a 12-year follow-up period (1997–2008) and 965,279 non-RA patients. Our proposed framework is employed on this large-scale population-based dataset, and is shown to effectively discover rich RA risk patterns. These patterns may assist physicians in patient assessment, and enhance opportunities for early detection of RA. The proposed framework is broadly applicable to the mining of risk patterns for major disease assessments. This enables the identification of early risk patterns that are significantly associated with a target disease.

[1]  H. El-Gabalawy,et al.  Periodontitis and rheumatoid arthritis: epidemiologic, clinical, and immunologic associations. , 2009, Compendium of continuing education in dentistry.

[2]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[3]  Chang-Fu Kuo,et al.  Rheumatoid arthritis prevalence, incidence, and mortality rates: a nationwide population study in Taiwan , 2013, Rheumatology International.

[4]  M. Akhtar,et al.  Interstitial keratitis and sensorineural hearing loss as a manifestation of rheumatoid arthritis: clinical lessons from a rare complication , 2012, BMJ Case Reports.

[5]  A. Filer,et al.  Performance of the 2010 ACR/EULAR criteria for rheumatoid arthritis: comparison with 1987 ACR criteria in a very early synovitis cohort , 2011, Annals of the rheumatic diseases.

[6]  J. Jaimes-Hernández,et al.  Chronic eosinophilic pneumonia: autoimmune phenomenon or immunoallergic disease? Case report and literature review. , 2012, Reumatologia clinica.

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  A. Ebringer,et al.  Rheumatoid arthritis is caused by a Proteus urinary tract infection , 2014, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[9]  J. Vergnes,et al.  Effect of periodontal treatment on the clinical parameters of patients with rheumatoid arthritis: study protocol of the randomized, controlled ESPERA trial , 2013, Trials.

[10]  S. Easteal,et al.  Predicting the presence of hepatitis B virus surface antigen in Chinese patients by pathology data mining , 2013, Journal of medical virology.

[11]  V. Cruz,et al.  Ulcerative colitis and rheumatoid arthritis: a rare association--case report. , 2012, Revista brasileira de reumatologia.

[12]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[13]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[14]  P. Nyirjesy,et al.  MALASSEZIA FURFUR FOLLICULITIS OF THE VULVA: OLIVE OIL SOLVES THE MYSTERY , 1994, Obstetrics and gynecology.

[15]  M. Turiel,et al.  Is atherosclerosis an autoimmune disease? , 2014, BMC Medicine.

[16]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[17]  T. Fahey,et al.  Diagnostic accuracy of a clinical prediction rule (CPR) for identifying patients with recent-onset undifferentiated arthritis who are at a high risk of developing rheumatoid arthritis: a systematic review and meta-analysis. , 2014, Seminars in arthritis and rheumatism.

[18]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[19]  W. Grassi,et al.  Sonographic assessment of carpal tunnel syndrome in rheumatoid arthritis: prevalence and correlation with disease activity , 2012, Rheumatology International.

[20]  A. Hamdan,et al.  Laryngeal involvement in rheumatoid arthritis. , 2007, Middle East journal of anaesthesiology.

[21]  A. Gibofsky,et al.  Overview of epidemiology, pathophysiology, and diagnosis of rheumatoid arthritis. , 2012, The American journal of managed care.

[22]  A. Dees,et al.  Late Onset Takayasu Arteritis and Rheumatoid Arthritis , 2012, Case reports in medicine.

[23]  E. Choy Understanding the dynamics: pathways involved in the pathogenesis of rheumatoid arthritis. , 2012, Rheumatology.

[24]  Hung-Wen Chiu,et al.  Comorbidity study of ADHD: Applying association rule mining (ARM) to National Health Insurance Database of Taiwan , 2009, Int. J. Medical Informatics.

[25]  K. Oikarinen,et al.  Radiological signs indicating infection of dental origin in elderly Finns , 2013, Acta odontologica Scandinavica.

[26]  J. Manson,et al.  C-reactive protein in the prediction of rheumatoid arthritis in women. , 2006, Archives of internal medicine.

[27]  Jiun-Liang Chen,et al.  Identifying Core Herbal Treatments for Children with Asthma: Implication from a Chinese Herbal Medicine Database in Taiwan , 2013, Evidence-based complementary and alternative medicine : eCAM.

[28]  T. Ng,et al.  A clinical decision support tool to predict survival in cancer patients beyond 120 days after palliative chemotherapy. , 2012, Journal of palliative medicine.

[29]  T. Huizinga,et al.  Prediction and prevention of rheumatoid arthritis , 2007 .

[30]  A. Choudhary,et al.  Development of a 5 year life expectancy index in older adults using predictive mining of electronic health record data. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[31]  P. Elsner,et al.  Erythema multiforme-Like Drug Eruption with Oral Involvement after Intake of Leflunomide , 2003, Dermatology.

[32]  T. Skare,et al.  Anti-CCP in systemic lupus erythematosus patients: a cross sectional study in Brazilian patients , 2013, Clinical Rheumatology.

[33]  Jonathan Kay,et al.  Prevalence of comorbidities in rheumatoid arthritis and evaluation of their monitoring: results of an international, cross-sectional study (COMORA) , 2013, Annals of the rheumatic diseases.

[34]  P. O'connor Crystal Deposition Disease and Psoriatic Arthritis , 2013, Seminars in Musculoskeletal Radiology.

[35]  E. Soriano,et al.  Validation of a Prediction Rule for the Diagnosis of Rheumatoid Arthritis in Patients with Recent Onset Undifferentiated Arthritis , 2013, International journal of rheumatology.

[36]  L. Robertson,et al.  Chronic urticaria and autoimmunity. , 2013, Skin therapy letter.

[37]  Saskia le Cessie,et al.  A prediction rule for disease outcome in patients with recent-onset undifferentiated arthritis: how to guide individual treatment decisions. , 2007, Arthritis and rheumatism.

[38]  Vladimir Naumovich Vapni The Nature of Statistical Learning Theory , 1995 .

[39]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.