Assessing dynamic treatment strategies

Of interest here are sequential data-gathering and decision-making processes, started in the preceding chapter in this volume. The archetypical context is that of a sequence of medical decisions, taken at different time points during the follow-up of the patient, each decision involving choice of a treatment in the light of any interim responses or adverse reactions to earlier treatments. The problem consists of predicting the consequences that a (possibly new and possibly hypothetical) treatment plan will have on a future patient, on the basis of what we have learnt from the performances of past medical decision makers on past patients. While we make constant reference to a medical application context, the scope of the method is of much broader relevance. Our approach to this problem owes immensely to the seminal work of James Robins. In a rich series of papers, Robins (1986, 1987, 1988, 1989, 1992, 1997, 2000, 2004), Robins and Wasserman (1997) and Robins et al. (1999) introduce the idea of different treatment strategies applied in different hypothetical studies (these studies being analogous to our notion of ?regimes'). These papers examine conditions on the relationships between these studies, under which those treatment strategies can be compared. Among these, the sequential ran-domization condition is closely related to the ?stability' assumption used in this chapter. Furthermore, Robins (1986) introduced the G-computation algorithm (a special version of dynamic programming) to evaluate a sequential treatment strategy, which works where stan-dard regression ? even under the sequential randomization assumption ? fails. Most of the concepts expounded in this chapter have a counterpart in Robins' work, although, in the interest of readability, we shall frequently abstain from making the relationships with his work explicit. We shall assume that we have an idea of which variables informed past decisions, without assuming that these variables have all been observed in the past. Nor shall we assume that the rules that governed past decisions are discernible or similar to those that will guide future actions. In contrast with the previous chapter, we do use (fully stochastic) causal diagrams (more precisely, influence diagrams). In the context of sequential plan identification, graphical rep-resentations of causality were first advocated by Pearl and Robins (1995). In accord with these authors, we assume that the relevant causal knowledge is encoded in the form of a diagram with a completely specified topology, whose associated numerical conditional probabilities are given only for a subset of its variables, so-called observed variables. The remaining variables in the diagram are ?unobserved' or ?latent'. We shall, however, introduce a form of graphical representation of causality that differs from Pearl and Robins's diagrams in some respects. Part of the problem will be to characterize situations in which the estimation of the causal effects of the treatment plan of interest is not invalidated by unobserved confounding introduced by the latent variables. Throughout this chapter we use previous results by Dawid and Didelez (2010). Elja Arjas, both in his chapter (Chapter 7) in this volume and in the joint paper Arjas and Parner (2004), considers two probability models. One (called the ?obs' model) models the observational study that has generated the data. The other (called the ?ex' model) models the consequences of the future (perhaps hypothetical) application of the treatment plan of interest. In our approach, this distinction is embodied in a decision parameter, called the ?regime indicator', which indicates the particular regime, ?obs' or ?ex', in operation. Supplementing the set of domain variables with the mentioned regime indicator results in an ?augmented' set of variables. In our approach, conditions for the equivalence of Arjas's ?obs' and ?ex' distributions are phrased in terms of conditional independence conditions on this augmented set. Both chapters, ours and that of Arjas, avoid formulations based on ideas of potential outcomes or counterfactuals. We illustrate the methods and their motivations with the aid of two examples. Following Robins, we choose the first example in the area of the treatment of HIV infection. The second example will be in the treatment of abdominal aortic aneurysm. This chapter is about the identifiability of treatment plan effects, rather than about the methods of estimation and computation one is supposed to use if the problem turns out to be identifiable. The latter problems are discussed in greater detail in Chapter 17 in this volume, by Daniel, De Stavola and Cousens, who illustrate their method with the aid of the same HIV example we use here.

[1]  J. Robins Structural Nested Failure Time Models , 2005 .

[2]  Tyler VanderWeele,et al.  Causality, 2nd edn , 2011 .

[3]  A. Philip Dawid,et al.  Identifying the consequences of dynamic treatment strategies: A decision-theoretic overview , 2010, ArXiv.

[4]  A. Whyte Beware of the dog. , 2010, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[5]  Jin Tian,et al.  Identifying Dynamic Sequential Plans , 2008, UAI.

[6]  Judea Pearl,et al.  Identification of Joint Interventional Distributions in Recursive Semi-Markovian Causal Models , 2006, AAAI.

[7]  Carlo Berzuini,et al.  Effectiveness of potent antiretroviral therapy on progression of human immunodeficiency virus: Bayesian modelling and model checking via counterfactual replicates , 2004 .

[8]  E. Arjas,et al.  Causal Reasoning from Longitudinal Data * , 2004 .

[9]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[10]  Ap Dawid,et al.  Causal inference using influence diagrams: The problem of partial compliance (with Discussion) , 2003 .

[11]  A. Dawid Influence Diagrams for Causal Modelling and Inference , 2002 .

[12]  J. Robins,et al.  Estimation of the Causal Effect of a Time-Varying Exposure on the Marginal Mean of a Repeated Binary Outcome , 1999 .

[13]  James M. Robins,et al.  Estimation of Effects of Sequential Treatments by Reparameterizing Directed Acyclic Graphs , 1997, UAI.

[14]  James M. Robins,et al.  Causal Inference from Complex Longitudinal Data , 1997 .

[15]  James M. Robins,et al.  Probabilistic evaluation of sequential plans from causal models with hidden variables , 1995, UAI.

[16]  J. Robins Estimation of the time-dependent accelerated failure time model in the presence of confounding factors , 1992 .

[17]  J. Robins Addendum to “a new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect” , 1987 .

[18]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[19]  A. Dawid Conditional Independence in Statistical Theory , 1979 .