One Hundred Years of Hypertension Research: Topic Modeling Study

Background Due to scientific and technical advancements in the field, published hypertension research has developed substantially during the last decade. Given the amount of scientific material published in this field, identifying the relevant information is difficult. We used topic modeling, which is a strong approach for extracting useful information from enormous amounts of unstructured text. Objective This study aims to use a machine learning algorithm to uncover hidden topics and subtopics from 100 years of peer-reviewed hypertension publications and identify temporal trends. Methods The titles and abstracts of hypertension papers indexed in PubMed were examined. We used the latent Dirichlet allocation model to select 20 primary subjects and then ran a trend analysis to see how popular they were over time. Results We gathered 581,750 hypertension-related research articles from 1900 to 2018 and divided them into 20 topics. These topics were broadly categorized as preclinical, epidemiology, complications, and therapy studies. Topic 2 (evidence review) and topic 19 (major cardiovascular events) are the key (hot topics). Most of the cardiopulmonary disease subtopics show little variation over time, and only make a small contribution in terms of proportions. The majority of the articles (414,206/581,750; 71.2%) had a negative valency, followed by positive (119, 841/581,750; 20.6%) and neutral valency (47,704/581,750; 8.2%). Between 1980 and 2000, negative sentiment articles fell somewhat, while positive and neutral sentiment articles climbed substantially. Conclusions The number of publications has been increasing exponentially over the period. Most of the uncovered topics can be grouped into four categories (ie, preclinical, epidemiology, complications, and treatment-related studies).

[1]  R. Shawahna Scoping and bibliometric analysis of promoters of therapeutic inertia in hypertension. , 2021, The American journal of managed care.

[2]  P. Devos,et al.  Trends in Worldwide Research in Hypertension Over the Period 1999–2018 , 2020, Hypertension.

[3]  Jiang He,et al.  The global epidemiology of hypertension , 2020, Nature Reviews Nephrology.

[4]  Hongfang Liu,et al.  Unsupervised Machine Learning for the Discovery of Latent Disease Clusters and Patient Subgroups Using Electronic Health Records , 2019, J. Biomed. Informatics.

[5]  P. Devos,et al.  Bibliometric analysis of research relating to hypertension reported over the period 1997–2016 , 2019, Journal of hypertension.

[6]  Y. Samancı,et al.  Bibliometric analysis of the top-cited articles on idiopathic intracranial hypertension , 2019, Neurology India.

[7]  Kira Radinsky,et al.  Machine learning of big data in gaining insight into successful treatment of hypertension , 2018, Pharmacology research & perspectives.

[8]  Chen Zou Analyzing research trends on drug safety using topic modeling , 2018, Expert opinion on drug safety.

[9]  Bo Jin,et al.  Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning , 2018, Journal of medical Internet research.

[10]  J. Macinko,et al.  Measuring the bias against low-income country research: an Implicit Association Test , 2017, Globalization and Health.

[11]  Z. Obermeyer,et al.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.

[12]  Louise Bouchard,et al.  Research on health inequalities: A bibliometric analysis (1966-2014). , 2015, Social science & medicine.

[13]  Peter D. Turney,et al.  Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon , 2010, HLT-NAACL 2010.

[14]  Cesar G Victora,et al.  [North-South relations in scientific publications: editorial racism?]. , 2006, Revista de saude publica.

[15]  V. N. Salgado-de Snyder,et al.  [A first analysis of research on social determinants of health in Mexico: 2005-2012]. , 2014, Salud publica de Mexico.

[16]  L. Xiaoming,et al.  Prevention and control of hypertension , 2010 .

[17]  J. C. Pereira [Revista de Saúde Pública: forty years of Brazilian scientific production]. , 2006, Revista de saude publica.