A threshold‐free summary index for quantifying the capacity of covariates to yield efficient treatment rules

When data on treatment assignment, outcomes, and covariates from a randomized trial are available, a question of interest is to what extent covariates can be used to optimize treatment decisions. Statistical hypothesis testing of covariate-by-treatment interaction is ill-suited for this purpose. The application of decision theory results in treatment rules that compare the expected benefit of treatment given the patient's covariates against a treatment threshold. However, determining treatment threshold is often context-specific, and any given threshold might seem arbitrary when the overall capacity towards predicting treatment benefit is of concern. We propose the Concentration of Benefit index (Cb ), a threshold-free metric that quantifies the combined performance of covariates towards finding individuals who will benefit the most from treatment. The construct of the proposed index is comparing expected treatment outcomes with and without knowledge of covariates when one of a two randomly selected patients is to be treated. We show that the resulting index can also be expressed in terms of the integrated efficiency of individualized treatment decision over the entire range of treatment thresholds. We propose parametric and semiparametric estimators, the latter being suitable for out-of-sample validation and correction for optimism. We used data from a clinical trial to demonstrate the calculations in a step-by-step fashion https://github.com/msadatsafavi/txBenefit. The proposed index has intuitive and theoretically sound interpretation and can be estimated with relative ease for a wide class of regression models. Beyond the conceptual developments, various aspects of estimation and inference for such a metric need to be pursued in future research. R code that implements the method for a variety of regression models is provided at (https://github.com/msadatsafavi/txBenefit).

[1]  David M Kent,et al.  Assessing and reporting heterogeneity in treatment effects in clinical trials: a proposal , 2010, Trials.

[2]  John Connett,et al.  Azithromycin for prevention of exacerbations of COPD. , 2011, The New England journal of medicine.

[3]  S D Walter,et al.  The partial area under the summary ROC curve , 2005, Statistics in medicine.

[4]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[5]  J. Gastwirth The Estimation of the Lorenz Curve and Gini Index , 1972 .

[6]  D. Rubin Comment: Which Ifs Have Causal Answers , 1986 .

[7]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[8]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[9]  Peter Bühlmann Regression shrinkage and selection via the Lasso: a retrospective (Robert Tibshirani): Comments on the presentation , 2011 .

[10]  I. van der Tweel,et al.  Subgroup analyses in confirmatory clinical trials: time to be specific about their purposes , 2016, BMC Medical Research Methodology.

[11]  J. Wedzicha,et al.  COPD exacerbations: definitions and classifications , 2003, European Respiratory Journal.

[12]  P. Gustafson,et al.  Relative impact characteristic curve: a graphical tool to visualize and quantify the clinical utility and population-level consequences of implementing markers. , 2018, Annals of epidemiology.

[13]  Matthew M. Chingos,et al.  The Review of Economics and Statistics , 2018 .

[14]  Ewout W. Steyerberg,et al.  Overfitting and optimism in prediction models , 2009 .

[15]  S. Greenland,et al.  Limitations of individual causal models, causal graphs, and ignorability assumptions, as illustrated by random confounding and design unfaithfulness , 2015, European Journal of Epidemiology.

[16]  Ewout W Steyerberg,et al.  The proposed 'concordance-statistic for benefit' provided a useful metric when modeling heterogeneous treatment effects. , 2018, Journal of clinical epidemiology.

[17]  Lu Tian,et al.  Effectively Selecting a Target Population for a Future Comparative Study , 2013, Journal of the American Statistical Association.

[18]  Ewout W Steyerberg,et al.  Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting. , 2019, Journal of clinical epidemiology.

[19]  William White,et al.  A Proposal , 2008, Moon, Sun, and Witches.

[20]  Ewout Steyerberg,et al.  Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects , 2018, British Medical Journal.

[21]  C. Begg,et al.  Using the Lorenz Curve to Characterize Risk Predictiveness and Etiologic Heterogeneity. , 2016, Epidemiology.

[22]  S. Taylor,et al.  Azithromycin for the Prevention of COPD Exacerbations: The Good, Bad, and Ugly. , 2015, The American journal of medicine.

[23]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[24]  E. Elkin,et al.  Decision Curve Analysis: A Novel Method for Evaluating Prediction Models , 2006, Medical decision making : an international journal of the Society for Medical Decision Making.

[25]  M. Pencina,et al.  Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond , 2008, Statistics in medicine.

[26]  A. Mushlin,et al.  Focusing Technology Assessment Using Medical Decision Theory , 1988, Medical decision making : an international journal of the Society for Medical Decision Making.

[27]  Stephen W Lagakos,et al.  Statistics in medicine--reporting of subgroup analyses in clinical trials. , 2007, The New England journal of medicine.

[28]  F. Martinez,et al.  American Journal of Epidemiology Practice of Epidemiology the Association between Rate and Severity of Exacerbations in Chronic Obstructive Pulmonary Disease: an Application of a Joint Frailty-logistic Model , 2022 .

[29]  D. Rubin Statistics and Causal Inference: Comment: Which Ifs Have Causal Answers , 1986 .

[30]  N. Obuchowski,et al.  Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures , 2010, Epidemiology.

[31]  K Claxton,et al.  The irrelevance of inference: a decision-making approach to the stochastic evaluation of health care technologies. , 1999, Journal of health economics.

[32]  Tyler J. VanderWeele,et al.  Selecting Optimal Subgroups for Treatment Using Many Covariates , 2018, Epidemiology.

[33]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .