Multivariate modeling to identify patterns in clinical data: the example of chest pain

BackgroundIn chest pain, physicians are confronted with numerous interrelationships between symptoms and with evidence for or against classifying a patient into different diagnostic categories. The aim of our study was to find natural groups of patients on the basis of risk factors, history and clinical examination data which should then be validated with patients' final diagnoses.MethodsWe conducted a cross-sectional diagnostic study in 74 primary care practices to establish the validity of symptoms and findings for the diagnosis of coronary heart disease. A total of 1199 patients above age 35 presenting with chest pain were included in the study. General practitioners took a standardized history and performed a physical examination. They also recorded their preliminary diagnoses, investigations and management related to the patient's chest pain. We used multiple correspondence analysis (MCA) to examine associations on variable level, and multidimensional scaling (MDS), k-means and fuzzy cluster analyses to search for subgroups on patient level. We further used heatmaps to graphically illustrate the results.ResultsA multiple correspondence analysis supported our data collection strategy on variable level. Six factors emerged from this analysis: „chest wall syndrome“, „vital threat“, „stomach and bowel pain“, „angina pectoris“, „chest infection syndrome“, and „ self-limiting chest pain“. MDS, k-means and fuzzy cluster analysis on patient level were not able to find distinct groups. The resulting cluster solutions were not interpretable and had insufficient statistical quality criteria.ConclusionsChest pain is a heterogeneous clinical category with no coherent associations between signs and symptoms on patient level.

[1]  François Béland,et al.  Correspondence analysis is a useful tool to uncover the relationships among categorical variables. , 2010, Journal of clinical epidemiology.

[2]  Norbert Donner-Banzhoff,et al.  Accuracy of symptoms and signs for coronary heart disease assessed in primary care. , 2010, The British journal of general practice : the journal of the Royal College of General Practitioners.

[3]  Diane Carroll,et al.  Cluster Analysis of Elderly Cardiac Patients' Prehospital Symptomatology , 2008, Nursing research.

[4]  H. Charles Romesburg,et al.  Cluster analysis for researchers , 1984 .

[5]  G. Eslick Usefulness of chest pain character and location as diagnostic indicators of an acute coronary syndrome. , 2005, The American journal of cardiology.

[6]  Christian Fg Schendera,et al.  Clusteranalyse mit SPSS: Mit Faktorenanalyse , 2010 .

[7]  A. Becker,et al.  Ruling out coronary artery disease in primary care: development and validation of a simple prediction rule , 2010, Canadian Medical Association Journal.

[8]  Abdel-Badeeh M. Salem Case Based Reasoning Technology for Medical Diagnosis , 2007 .

[9]  C. Schmid,et al.  Why clinicians are natural bayesians , 2005, BMJ : British Medical Journal.

[10]  Catherine J Ryan,et al.  Classifying subgroups of patients with symptoms of acute coronary syndromes: A cluster analysis. , 2010, Research in nursing & health.

[11]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[12]  J. Knottnerus,et al.  Assessment of the accuracy of diagnostic tests: the cross-sectional study. , 2003, Journal of clinical epidemiology.

[13]  Seth Bullock,et al.  Simple Heuristics That Make Us Smart , 1999 .

[14]  James Jaccard,et al.  Statistics for the Behavioral Sciences , 1983 .

[15]  Brian Everitt,et al.  Cluster analysis , 1974 .

[16]  Andreas Pöge,et al.  Clusteranalyse. Anwendungsorientierte Einführung in Klassifikationsverfahren , 2010 .

[17]  M. Greenacre,et al.  Multiple Correspondence Analysis and Related Methods , 2006 .

[18]  D. Moser,et al.  Symptom Clusters in Acute Myocardial Infarction: A Secondary Data Analysis , 2007, Nursing research.

[19]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[20]  Alexandra L Hanlon,et al.  Differences in mortality in acute coronary syndrome symptom clusters. , 2010, American heart journal.

[21]  Rudolf Seising,et al.  From vagueness in medical thought to the foundations of fuzzy reasoning in medical diagnosis , 2006, Artif. Intell. Medicine.

[22]  A. Becker,et al.  Chest pain in primary care: Epidemiology and pre-work-up probabilities , 2009, The European journal of general practice.

[23]  F. Rutten,et al.  Electrocardiography in primary care; is it useful? , 2000, International journal of cardiology.

[24]  P. Todd,et al.  Simple Heuristics That Make Us Smart , 1999 .

[25]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[26]  J. Hinde,et al.  Correspondence analysis as a screening method for indicants for clinical diagnosis. , 1989, Statistics in medicine.