Classification of Visit-to-Visit Blood Pressure Variability: A Machine Learning Approach for Data Clustering on Systolic Blood Pressure Intervention Trial (SPRINT)

Background: Blood pressure variability (BPV) is associated with the cardiovascular disease. However, there is no standard risk stratification method to evaluate BPV. Our study aims to cluster BPV into three levels, namely, low, medium and high levels, by a machine learning approach. Methods: The Systolic Blood Pressure Intervention Trial (SPRINT) dataset, which includes patients with hypertension or at risk of cardiovascular diseases, was obtained from a clinical data sharing platform. In the clinical trial, participants with systolic blood pressure (SBP) of at least 130 mmHg and an increased cardiovascular risk were randomized to receive intensive treatment (targeting SBP below 120 mmHg) or standard treatment (targeting SBP below 140 mmHg), and blood pressure (BP) were measured and recorded during the follow-up periods. Visit-to-visit BPV was measured by the deviation between the observed records and the personalized BP trends, and two-dimensional clustering on SBP and diastolic BP were applied. Different curve fitting techniques (linear regression and cubic regression) and clustering methods (K-means and Agglomerative Clustering) were attempted and compared with each other. Results: With 8,092 participants and a median follow-up of 3.26 years, linear regression was a simple and reliable method to capture the BP trend. K-means model showed stable data clustering results. Intensive treatment showed to be effective for participants with a high level of BPV. Conclusion: Machine learning can be used for data clustering on BPV.

[1]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[2]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[3]  Alan D. Lopez,et al.  A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[4]  N J Wald,et al.  Use of blood pressure lowering drugs in the prevention of cardiovascular disease: meta-analysis of 147 randomised trials in the context of expectations from prospective epidemiological studies , 2009, BMJ : British Medical Journal.

[5]  G. Krishna,et al.  Agglomerative clustering using the concept of mutual nearest neighbourhood , 1978, Pattern Recognit..

[6]  J. Manson,et al.  Association Between Annual Visit-to-Visit Blood Pressure Variability and Stroke in Postmenopausal Women: Data From the Women's Health Initiative , 2012, Hypertension.

[7]  Jackson T. Wright,et al.  A Randomized Trial of Intensive versus Standard Blood-Pressure Control. , 2016, The New England journal of medicine.

[8]  G. Mancia,et al.  Total Cardiovascular Risk, Blood Pressure Variability and Adrenergic Overdrive in Hypertension: Evidence, Mechanisms and Clinical Implications , 2012, Current Hypertension Reports.

[9]  Richard J McManus,et al.  Blood pressure variability and cardiovascular disease: systematic review and meta-analysis , 2016, British Medical Journal.

[10]  M. Smits,et al.  A Randomized Trial of Intensive versus Standard Blood-Pressure Control. , 2016, The New England journal of medicine.