Umpire 2.0: Simulating realistic, mixed-type, clinical data for machine learning

[1]  Marianthi Markatou,et al.  kamila: Clustering Mixed-Type Data in R and Hadoop , 2018 .

[2]  J A Cook,et al.  The rise of big clinical databases , 2015, The British journal of surgery.

[3]  K. Radhakrishnan,et al.  Using Unsupervised Machine Learning to Identify Subgroups Among Home Health Patients With Heart Failure Using Telehealth , 2018, Computers, informatics, nursing : CIN.

[4]  W. Kim,et al.  Identification of subtypes in subjects with mild-to-moderate airflow limitation and its clinical and socioeconomic implications , 2017, International journal of chronic obstructive pulmonary disease.

[5]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[6]  Brian W. Powers,et al.  Subgroups of High-Cost Medicare Advantage Patients: an Observational Study , 2018, Journal of General Internal Medicine.

[7]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[8]  Jiexin Zhang,et al.  Sources of variation in false discovery rate estimation include sample size, correlation, and inherent differences between groups , 2012, BMC Bioinformatics.

[9]  Jennifer G. Dy,et al.  Do COPD subtypes really exist? COPD heterogeneity and clustering in 10 independent cohorts , 2017, Thorax.

[10]  Stan Pounds,et al.  Estimating the Occurrence of False Positives and False Negatives in Microarray Studies by Approximating and Partitioning the Empirical Distribution of P-values , 2003, Bioinform..

[11]  Brian W. Powers,et al.  Applying Machine Learning Algorithms to Segment High-Cost Patient Populations , 2018, Journal of General Internal Medicine.

[12]  Kevin R Coombes,et al.  Unsupervised machine learning and prognostic factors of survival in chronic lymphocytic leukemia , 2020, J. Am. Medical Informatics Assoc..

[13]  D. Singer,et al.  Association of of Atrial Fibrillation Clinical Phenotypes With Treatment Patterns and Outcomes: A Multicenter Registry Study , 2017, JAMA cardiology.

[14]  Robert A. Davis,et al.  A cluster-based approach for integrating clinical management of Medicare beneficiaries with multiple chronic conditions , 2019, PloS one.

[15]  Benjamin M. Marlin,et al.  Unsupervised pattern discovery in electronic health care data using probabilistic clustering models , 2012, IHI '12.

[16]  Spiros Denaxas,et al.  Identifying clinically important COPD sub-types using data-driven approaches in primary care population based electronic health records , 2019, BMC Medical Informatics and Decision Making.

[17]  Kevin R. Coombes,et al.  The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data , 2009, Cancer informatics.

[18]  D. Caillaud,et al.  Clinical COPD phenotypes: a novel approach using principal component and cluster analyses , 2010, European Respiratory Journal.