Optimal dynamic treatment regimes

Summary. A dynamic treatment regime is a list of decision rules, one per time interval, for how the level of treatment will be tailored through time to an individual's changing status. The goal of this paper is to use experimental or observational data to estimate decision regimes that result in a maximal mean response. To explicate our objective and to state the assumptions, we use the potential outcomes model. The method proposed makes smooth parametric assumptions only on quantities that are directly relevant to the goal of estimating the optimal rules. We illustrate the methodology proposed via a small simulation.

[1]  R. Bellman Dynamic programming. , 1957, Science.

[2]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[3]  D. Rubin Statistics and Causal Inference: Comment: Which Ifs Have Causal Answers , 1986 .

[4]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[5]  D. Rubin Comment: Which Ifs Have Causal Answers , 1986 .

[6]  P. Hougaard A class of multivanate failure time distributions , 1986 .

[7]  Ross D. Shachter Evaluating Influence Diagrams , 1986, Oper. Res..

[8]  J. Robins Addendum to “a new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect” , 1987 .

[9]  J. Robins Errata to “a new approach to causal intefence in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect” Mathl Modelling 7(9–12), 1393–1512 (1986) , 1987 .

[10]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[11]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[12]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[13]  V. Strecher,et al.  Do tailored behavior change messages enhance the effectiveness of health risk appraisal? Results from a randomized trial. , 1996, Health education research.

[14]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[15]  R. McMahon,et al.  Family-Based Intervention in the Fast Track Program , 1996 .

[16]  James M. Robins,et al.  Causal Inference from Complex Longitudinal Data , 1997 .

[17]  Ross D. Shachter,et al.  Representation and Analysis of Medical Decision Problems with Influence Diagrams , 1997, Medical decision making : an international journal of the Society for Medical Decision Making.

[18]  James M. Robins,et al.  Estimation of Effects of Sequential Treatments by Reparameterizing Directed Acyclic Graphs , 1997, UAI.

[19]  A E Gelfand,et al.  Approaches for optimal sequential decision analysis in clinical trials. , 1998, Biometrics.

[20]  C. Bielza,et al.  Decision Analysis by Augmented Probability Simulation , 1999 .

[21]  V. Strecher,et al.  One size does not fit all: The case for tailoring print materials , 1999, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[22]  James M. Robins,et al.  Causal inference for complex longitudinal data: the continuous case , 2001 .

[23]  K. Dodge,et al.  Initial impact of the Fast Track prevention trial for conduct problems: I. The high-risk sample. Conduct Problems Prevention Research Group. , 1999, Journal of consulting and clinical psychology.

[24]  Laura R. Olevitch,et al.  Tailoring Health Messages: Customizing Communication With Computer Technology , 1999 .

[25]  David J. Spiegelhalter,et al.  Probabilistic Networks and Expert Systems , 1999, Information Science and Statistics.

[26]  K. Dodge,et al.  Initial impact of the Fast Track prevention trial for conduct problems: II. Classroom effects. Conduct Problems Prevention Research Group. , 1999, Journal of consulting and clinical psychology.

[27]  Michael I. Jordan,et al.  Probabilistic Networks and Expert Systems , 1999 .

[28]  Philip W. Lavori,et al.  A design for testing clinical strategies: biased adaptive within‐subject randomization , 2000 .

[29]  John A. Bather,et al.  Decision Theory: An Introduction to Dynamic Programming and Sequential Decisions , 2000 .

[30]  J. Robins,et al.  Sensitivity Analysis for Selection bias and unmeasured Confounding in missing Data and Causal inference models , 2000 .

[31]  Ree Dawson,et al.  Flexible treatment strategies in chronic disease: clinical and research implications , 2000, Biological Psychiatry.

[32]  J M Robins,et al.  Marginal Mean Models for Dynamic Regimes , 2001, Journal of the American Statistical Association.

[33]  Steffen L. Lauritzen,et al.  Representing and Solving Decision Problems with Limited Information , 2001, Manag. Sci..

[34]  S. Murphy,et al.  Design and evaluation of adaptive preventive interventions , 2001 .

[35]  Joseph B. Kadane,et al.  A Gridding Method for Sequential Analysis Problems , 2001 .

[36]  Pedro Larrañaga,et al.  An Introduction to Probabilistic Graphical Models , 2002, Estimation of Distribution Algorithms.