Combinatorial Optimization of Clustering Decisions: An Approach to Refine Psychiatric Diagnoses.

Using complete enumeration (e.g., generating all possible subsets of item combinations) to evaluate clustering problems has the benefit of locating globally optimal solutions automatically without the concern of sampling variability. The proposed method is meant to combine clustering variables in such a way as to create groups that are maximally different on a theoretically sound derivation variable(s). After the population of all unique sets is permuted, optimization on some predefined, user-specific function can occur. We apply this technique to optimizing the diagnosis of Alcohol Use Disorder. This is a unique application, from a clustering point of view, in that the decision rule for clustering observations into the "diagnosis" group relies on both the set of items being considered and a predefined threshold on the number of items required to be endorsed for the "diagnosis" to occur. In optimizing diagnostic rules, criteria set sizes can be reduced without a loss of significant information when compared to current and proposed, alternative, diagnostic schemes.

[1]  Lawrence Hubert,et al.  The variance of the adjusted Rand index. , 2016, Psychological methods.

[2]  Jerome C. Wakefield,et al.  How Many People have Alcohol Use Disorders? Using the Harmful Dysfunction Analysis to Reconcile Prevalence Estimates in Two Community Surveys , 2014, Front. Psychiatry.

[3]  Michael J. Brusco,et al.  An Exact Algorithm for Hierarchically Well-Formulated Subsets in Second-Order Polynomial Regression , 2009, Technometrics.

[4]  B. Grant,et al.  A multidimensional assessment of the validity and utility of alcohol use disorder severity as determined by item response theory models. , 2010, Drug and alcohol dependence.

[5]  Kenneth J Sher,et al.  Limits of Current Approaches to Diagnosis Severity Based on Criterion Counts , 2015, Clinical psychological science : a journal of the Association for Psychological Science.

[6]  D. Steinley Properties of the Hubert-Arabie adjusted Rand index. , 2004, Psychological methods.

[7]  J. Wakefield,et al.  DSM‐5 substance use disorder: how conceptual missteps weakened the foundations of the addictive disorders field , 2015, Acta psychiatrica Scandinavica.

[8]  Michael J. Brusco,et al.  Principal Cluster Axes: A Projection Pursuit Index for the Preservation of Cluster Structures in the Presence of Data Reduction , 2012, Multivariate behavioral research.

[9]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[10]  Jerome C Wakefield,et al.  The harmful dysfunction model of alcohol use disorder: revised criteria to improve the validity of diagnosis and prevalence estimates. , 2015, Addiction.

[11]  Kenneth J Sher,et al.  Truth or consequences in the diagnosis of substance use disorders. , 2014, Addiction.

[12]  Kenneth J Sher,et al.  Toward more efficient diagnostic criteria sets and rules: The use of optimization approaches in addiction science. , 2019, Addictive behaviors.

[13]  Fred W. Glover,et al.  Future paths for integer programming and links to artificial intelligence , 1986, Comput. Oper. Res..

[14]  Douglas Steinley,et al.  Local optima in K-means clustering: what you don't know may hurt you. , 2003, Psychological methods.

[15]  Robin L. Cautin,et al.  Why many clinical psychologists are resistant to evidence-based practice: root causes and constructive remedies. , 2013, Clinical psychology review.

[16]  J Rehm,et al.  Reduction of drinking in problem drinkers and all-cause mortality. , 2013, Alcohol and alcoholism.

[17]  L. Cronbach,et al.  Construct validity in psychological tests. , 1955, Psychological bulletin.

[18]  Kenneth J Sher,et al.  It's the algorithm! Why differential rates of chronicity and comorbidity are not evidence for the validity of the abuse-dependence distinction. , 2010, Journal of abnormal psychology.

[19]  Michael J. Brusco,et al.  Exact and approximate algorithms for variable selection in linear discriminant analysis , 2011, Comput. Stat. Data Anal..

[20]  M. Brusco,et al.  Variable Neighborhood Search Heuristics for Selecting a Subset of Variables in Principal Component Analysis , 2009 .

[21]  W. DeSarbo Gennclus: New models for general nonhierarchical clustering analysis , 1982 .

[22]  G. Reed,et al.  Toward ICD-11: Improving the Clinical Utility of WHO's International Classification of Mental Disorders , 2010 .

[23]  K. Bucholz,et al.  DSM-5 criteria for substance use disorders: recommendations and rationale. , 2013, The American journal of psychiatry.

[24]  Douglas Steinley,et al.  K-means clustering: a half-century synthesis. , 2006, The British journal of mathematical and statistical psychology.

[25]  M. Brusco,et al.  Choosing the number of clusters in Κ-means clustering. , 2011, Psychological methods.