Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Uniformly minimum variance conditionally unbiased estimation in multi-arm multi-stage clinical trials

Uniformly minimum variance conditionally unbiased estimation in multi-arm multi-stage clinical... Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Biometrika (2018), 105,2, pp. 495–501 doi: 10.1093/biomet/asy004 Printed in Great Britain Advance Access publication 28 February 2018 Uniformly minimum variance conditionally unbiased estimation in multi-arm multi-stage clinical trials BY NIGEL STALLARD AND PETER K. KIMANI Warwick Medical School, University of Warwick, Coventry CV4 7AL, U.K. [email protected] [email protected] SUMMARY Multi-arm multi-stage clinical trials compare several experimental treatments with a control treatment, with poorly performing treatments dropped at interim analyses. This leads to inferential challenges, includ- ing the construction of unbiased treatment effect estimators. A number of estimators which are unbiased conditional on treatment selection have been proposed, but are specific to certain selection rules, may ignore the comparison to the control and are not all minimum variance. We obtain estimators for treatment effects compared to the control that are uniformly minimum variance unbiased conditional on selection with any specified rule or stopping for futility. Some key words: Adaptive seamless design; Drop-the-loser design; Point estimation; Treatment selection. 1. INTRODUCTION Multi-arm multi-stage clinical trials compare several treatments to a common control treatment in a single trial with treatments dropped at interim analyses if, based on observed data, they are not sufficiently promising. Such designs have been used by, for example, MacArthur et al. (2013) and Barker et al. (2014). The approach can yield sample size reduction and administrative savings relative to running several two- arm trials, but presents challenges in terms of statistical analysis similar to those of post-model selection inference (Efron, 2014). Proposed analysis methods have mainly focused on frequentist hypothesis tests (Thall et al., 1988, 1989; Stallard & Todd, 2003; Stallard & Friede, 2008; Magirr et al., 2012; Wason et al., 2017), with less work on estimation. Cohen & Sackrowitz (1989) consider two-stage designs with the treatment with the highest observed stage 1 average continuing to stage 2, with equal variances for the averages for each treatment in stage 1 and the stage 2 treatment. They derive an estimator of the stage 2 treatment mean that is uniformly minimum variance conditionally unbiased given the observed ordering of stage 1 treatment means. Bowden & Glimm (2008) extend the method to allow different variances for the treatment means and continuation to stage 2 of the s treatments with the largest observed stage 1 means. They provide expressions for uniformly minimum variance conditionally unbiased estimators for the means for these s treatment arms, again conditioning on the ordering of the stage 1 averages. Like Cohen & Sackrowitz (1989), they do not consider estimation relative to a control group. Kimani et al. (2013), Bowden & Glimm (2014) and Robertson et al. (2016) derive conditionally unbiased estimators for the difference between selected treatments and a control in multi-arm multi-stage trials. Kimani et al. (2013) allow for stopping for futility at stage 1 of a two-stage trial assuming common variances for averages in different arms in stage 1, while Bowden & Glimm (2014) and Robertson et al. (2016) allow for different variances in different arms and correlation between these respectively. Bowden & Glimm (2014) also allow for more than two stages. As discussed below, the Kimani et al. (2013) and Robertson et al. (2016) estimators are conditionally unbiased but not generally minimum variance. We obtain minimum variance conditionally unbiased estimators in these settings. 2018 Biometrika Trust This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/ licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 496 N. STALLARD AND P. K. KIMANI These methods present estimators that are unbiased conditional on selection of the treatment arms with the largest observed stage 1 averages, possibly with the additional condition that these means are sufficiently larger than that for the control. In practice, other selection rules may be used, such as in § 3 below. We show how uniformly minimum variance conditionally unbiased estimators may be obtained for comparisons with the control conditioning on a treatment not being dropped from the trial using any specified rule for selection or stopping for futility. 2. UNIFORMLY MINIMUM VARIANCE CONDITIONALLY UNBIASED ESTIMATION 2·1. A uniformly minimum variance conditionally unbiased estimator Consider a multi-arm multi-stage clinical trial with up to r stages where, in stage j, patients are random- ized to a control treatment, treatment 0, or experimental treatments, labelled i (i ∈  ), with  ⊇ ··· ⊇  . j 1 r Without loss of generality, label treatments such that  ={1, ... , s } for some s ( j = 1, ... , r). Denoting j j j the total number of experimental treatments, s ,by k, let r = max{ j : s  i} (i = 1, ... , k), so that 1 i j treatment i is included in stages 1, ... , r , and set r = r. i 0 Let X denote the stagewise average for treatment i in stage j (i = 0, ... , k; j = 1, ... , r ), with ij i T T T T T X = (X , ... , X ) , X = (X , ... , X ) and X = (X , ... , X ) . Assume that the X are jointly i· i1 ir ·j 0j s j j ij ·1 ·j i j −1 sufficient for μ , ... , μ , with X ∼ N (μ , τ ) independent with τ known. Other cases, such as in § 3 0 k ij i ij ij below, may use normal approximations or estimated variances. Experimental treatments are selected to continue, along with the control, to stage j + 1 depending onX according to some prespecified rule. Some possible rules are discussed below. Define θ = μ − μ (i = 1, ... , k). We wish to estimate θ (i = 1, ... , s ). In particular, we will obtain i i 0 i r uniformly minimum variance conditionally unbiased estimators of θ , ... , θ conditional on the event, 1 s Q, that treatments 1, ... , s are selected to continue to the end of the trial according to the prespecified selection rule. T T −1 T Let τ = (τ , ... , τ ) , τ = τ + ··· + τ , Z = X τ τ (i = 0, ... , k), Z = (Z , ... , Z ) and i· i1 ir i i1 ir i i· i 0 k i i i· k k θ = τ μ / τ . The Appendix shows that θ is orthogonal to θ , in that the information matrix k+1 i i i k+1 i i=0 i=0 term i = 0(Cox & Reid, 1987) (i = 1, ... , k) and Z is complete and sufficient for θ = (θ , ... , θ ) . θ θ 1 k+1 k+1 Let Y = X − X (i = 1, ... , s ); then Y is unbiased for θ and, since Y is independent ofX and hence i ir 0r r i i i r−1 of Q, is also conditionally unbiased for θ given Q. Thus, by the Rao–Blackwell theorem, E(Y | Z, Q) is i i a uniformly minimum variance conditionally unbiased estimator for θ given Q (i = 1, ... , s ). i r Since, given the specified selection rule, the event Q depends only on X , r−1 E(Y | Z,X )f (X | Z ) dX i r−1 r−1 r−1 E(Y | Z, Q) = , (1) f (X | Z ) dX r−1 r−1 with the integrals taken over the region Q corresponding to thoseX for which treatments 1, ... , s will r−1 r be selected to continue to stage r. The denominator is the probability of Q given Z, which will be denoted by pr(Q | Z ). As ⎧ ⎛ ⎞ ⎫ −1 −1 τ 0 ··· 0 τ ⎪ i1 i ⎪ ⎪ ⎛ ⎞ ⎪ ⎪ ⎪ ⎪ ⎜ ⎟ ⎪ ⎛ ⎞ μ ⎪ i · · ⎪ X ⎪ ⎜ ⎟ ⎪ i1 −1 · · · ⎪ ⎪ ⎪ ⎜ ⎟ ⎜ 0 τ · · ⎟ ⎪ ⎪ · i2 ⎪ ⎜ ⎟ ⎨ ⎜ ⎟ ⎜ ⎟ ⎬ ⎜ ⎟ ⎜ · ⎟ ⎜ ⎟ · · · · ⎜ ⎟ ∼ N ⎜ ⎟ ,⎜ ⎟ · · · · · · · 0 · ⎪ ⎜ ⎟ ⎜ ⎟ ⎪ ⎝ ⎠ ⎪ ⎪ ⎪ · ⎜ ⎟ ⎪ i,r −1 ⎝ ⎠ ⎪ ⎪ ⎪ ⎜ ⎟ ⎪ ⎪ ⎪ Z · ⎪ −1 ⎪ i ⎝ ⎠ ⎪ ⎪ μ 0 ··· 0 τ · ⎪ i i,r −1 ⎪ ⎩ ⎭ −1 −1 τ ··· ··· ··· τ i i and, for i = | i and any j, j , X is independent of X   and Z  , we have that ij i j i f (X | Z ) = φ {(X , ... , X ) ; μ ˜ ,  }, (2) r−1 r −1 i1 ir −1 i i i i i=0 Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Miscellanea 497 where φ (X ; μ, ) denotes the n-dimensional multivariate normal density with mean μ and variance matrix −1 −1 −1 −1 , evaluated at X , with μ ˜ = (Z , ... , Z ) and  having diagonal elements (τ − τ , ... , τ − τ ) i i i i i1 i i,r −1 i −1 and all off-diagonal elements equal to −τ .If r = 1, X given Z has a degenerate distribution, since i i1 X = Z and τ = τ . i1 i i1 i r−1 −1 Since Y = X − X and X = τ (τ Z − X τ )(i = 0, ... , s ), i ir 0r ir i i ij ij r ir j=1 r−1 −1 −1 −1 −1 E(Y | Z,X ) = τ τ Z − τ τ Z − τ τ X − τ τ X . i r−1 i i 0 0 ij ij 0j 0j ir 0r ir 0r j=1 Thus (1)gives r−1 −1 −1 −1 −1 −1 E(Y | Z, Q) = τ τ Z − τ τ Z − pr(Q | Z ) τ τ X − τ τ X f (X | Z ) dX . (3) i i i 0 0 ij ij 0j 0j r−1 r−1 ir 0r ir 0r j=1 An important special case arises when the selection rule does not depend on X , the observed averages 0· for the control arm. In the integral over the event Q, the range of integration with respect to X (i = ij 1, ... , k; j = 1, ... , r ) then does not depend on X and the integration with respect to the elements of X , i 0· 0· i.e., X , ... , X , is over the whole real line. Thus 01 0,r−1 r−1 r−1 −1 −1 −1 pr(Q | Z ) τ X τ f (X | Z ) dX = τ τ E(X | Z ), 0j 0j j−1 r−1 0j 0j 0r 0r j=1 j=1 and since E(X | Z ) = Z (j = 1, ... , r − 1),wehave 0j 0 r−1 −1 −1 −1 E(Y | Z, Q) = τ τ Z − pr(Q | Z ) τ X τ f (X | Z ) dX − Z . (4) i i i ij ij r−1 r−1 0 ir ir j=1 In this case the uniformly minimum variance conditionally unbiased estimator for θ is the difference between the uniformly minimum variance conditionally unbiased estimator of μ , which can be calculated ignoring the observed value of X , and Z , the usual uniformly minimum variance unbiased estimator of 0· 0 μ , which does not depend on the selection. The integrals in (3) generally cannot be evaluated using standard functions. A numerical approach is Monte Carlo integration with rejection sampling, simulating X from (2), its conditional distribution r−1 given Z, accepting those in Q for the specified selection rule. This approach can be used for any selection rule with treatments proceeding to stage j + 1 dependent on X . 2·2. Selection of the best-performing treatment in a two-stage trial Much previous work on conditionally unbiased estimation in clinical trials with treatment selection has focused on the special case of r = 2 and s = 1, with the experimental treatment having the largest observed stage 1 mean continuing along with the control to stage 2. Thus Q is the event X > m with m = max(X , ... , X ).As Q is independent of X , the uniformly minimum variance conditionally 21 k1 0· unbiased estimator for θ is (4). Since treatments 2, ... , k are observed in stage 1 only, we have Z = 1 i X (i = 2, ... , k) so that the numerator and denominator in the fractional term in (4) are respectively i1 −1 −1 −1 τ τ X f (X | Z ) dX and pr(X > m | Z ).As X | Z ∼ N (Z , v ) with v = τ − τ , this 11 11 11 1 11 11 1 11 1 1 1 1 12 11 1 1/2 ˜ ˜ ˜ integral and probability are Z {1 − (Z )}+ v φ(Z ) (Todd et al., 1996) and 1 − (Z ) respectively, 1 1 1 1 −1/2 1/2 −1 ˜ ˜ ˜ where Z = (m − Z )v . Thus E(Y | Z, Q) = Z − Z − τ τ v φ(Z )/{1 − (Z )}, confirming 1 1 1 1 0 11 1 1 1 12 1 this to be the difference between the uniformly minimum variance conditionally unbiased estimator for μ ignoring the control treatment given by Bowden & Glimm (2008) and Z . Kimani et al. (2013) consider a similar two-stage trial in which treatment 1 is selected if X > m but continues to the second stage only if X > X + c for some specified c, the trial otherwise stopping 11 01 Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 498 N. STALLARD AND P. K. KIMANI early. In this case, Q is the event X > max(m, X + c). Since this depends on X , the form (3) must 11 01 01 be used rather than (4) to give the uniformly minimum variance conditionally unbiased estimator of θ . The integral in (3) is taken over X and X , but may be rewritten in terms of X and (X − X ), 11 01 11 11 01 noting that X Z v v 11 1 1 1 Z ∼ N , X − X Z − Z v v 11 01 1 0 1 2 −1 −1 with v as above and v = v + τ − τ . Since Q is the rectangular region {X > m, (X − X )> c}, 1 2 1 11 11 01 01 0 (3) is a truncated bivariate normal expectation and can be evaluated using Rosenbaum (1961) to obtain the uniformly minimum variance conditionally unbiased estimator −1 −1 −1 −1 τ τ Z − τ τ E(X | Z, Q) − τ τ Z − τ τ E(X | Z, Q), 1 1 11 11 0 0 01 01 12 12 02 02 1/2 1/2 1/2 where E(X | Z, Q) = v A + Z and E(X | Z, Q) = v A − v B + Z with 11 1 01 0 1 1 2 ˜ ˜ φ(a){1 − (b)}+ ρφ(b){1 − (a ˜ )} ρφ(a){1 − (b)}+ φ(b){1 − (a ˜ )} A = , B = ; U (a, b; ρ) U (a, b; ρ) −1/2 −1/2 2 −1/2 2 −1/2 where a = (m − Z )v , b = (c − Z + Z )v , a ˜ = (a − ρb)(1 − ρ ) , b = (b − ρa)(1 − ρ ) , 1 1 0 1 2 2 T ρ = v /v and U (a, b; ρ) = pr(u > a, u > b) for (u , u ) standard bivariate normal with correlation 1 2 1 2 1 2 ρ. This is not the estimator proposed by Kimani et al. (2013), which is conditionally unbiased but not minimum variance. Robertson et al. (2016) also consider two-stage trials with early stopping for futility, though they do not assume equal variances and consider ranking by standardized observed stage 1 treatment effect estimates, −1 −1 −1/2 R = (X − X )(τ + τ ) , assuming treatment 1 is selected if R > max{R , ... , R , c} for specified i i1 01 1 2 k i1 01 c. In constructing their estimators, they condition on statistics based on the observed treatment effects, X − X (i = 1, ... , k; j = 1, 2). Since these may not be sufficient, for example in the case of normal data ij 0j with a common control, their estimator is also conditionally unbiased but not minimum variance; indeed they show that in the common variance case it has larger variance than the estimator proposed by Kimani et al. (2013). 3. NUMERICAL EXAMPLE The ADVENT trial (MacArthur et al., 2013) was a two-stage study comparing 125 mg, 250 mg and 500 mg doses of crofelemer with a placebo in noninfectious chronic diarrhoea in HIV-seropositive patients. The primary endpoint was clinical response, defined as at most two watery stools per week during at least two of four weeks of treatment. At the end of stage 1, based on data from 200 patients randomized equally between the four treatment arms, a single dose of crofelemer would continue with the control to stage 2, with a further 150 patients randomized equally between these two groups. In the absence of safety concerns, the dose selected would be the lowest dose with an observed clinical response rate within two percentage points of the best-performing dose. Although not explicitly stated by MacArthur et al. (2013), we assume that the trial would have stopped at the first stage if the best-performing dose did not have observed clinical response rate at least two percentage points above the placebo. The trial was analysed using the method of Posch et al. (2005) to control the familywise Type I error rate, but apparently no attempt was made to obtain unbiased estimators of the treatment effect for the selected dose. The U.S. Food and Drug Administration (2012) report gives results of the two stages of the study. In stage 1, 50, 44, 54 and 46 patients received the placebo and the three doses respectively, with 1, 9, 5 and 9 patients in these groups showing clinical response. In stage 2, 88 and 92 further patients received the placebo and 125 mg crofelemer respectively, with 10 and 15 demonstrating clinical response. A naive estimate of the effect of the 125 mg dose relative to placebo is thus 24/136 − 11/138 = 0·097. Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Miscellanea 499 Table 1. Simulated probability of selection and bias (with root mean squared error in paren- theses), with all values multiplied by 100, for the estimator given by (3) and the naive estimator for θ conditional on selection of treatment i (i = 1, ... ,3) forarangeof μ values (μ = 0·02 in all cases) (μ , μ , μ ) pr(selected) Estimator from (3) Naive estimator 1 2 3 12 3 θ θ θ θ θ θ 1 2 3 1 2 3 (0·02, 0·02, 0·02) 85 4 −0·2 (1·5) −0·2 (1·5) −0·1 (1·5) 1·6 (2·0) 1·7 (2·2) 1·9 (2·3) (0·02, 0·02, 0·2)<1 <199 −0·4 (1·3) −0·4 (1·4) −0·7 (3·9) 1·7 (2·1) 1·7 (2·1) −0·4 (3·7) (0·02, 0·1, 0·2)<114 86 −0·2 (1·5) −0·7 (3·3) −0·6 (3·9) 2·0 (2·6) 1·4 (3·1) 0·0 (3·6) (0·02, 0·2, 0·2)<161 39 −0·2 (0·9) −0·7 (4·1) −0·5 (4·2) 2·1 (2·4) 0·5 (3·6) 1·2 (3·7) (0·1, 0·2, 0·2) 558 37 −0·6 (3·3) −0·7 (4·1) −0·5 (4·2) 2·4 (3·6) 0·6 (3·6) 1·2 (3·7) (0·2, 0·2, 0·2) 46 31 23 −0·7 (4·2) −0·5 (4·2) −0·5 (4·3) 1·0 (3·6) 1·5 (3·8) 1·9 (3·9) For illustrative purposes, we treat estimated event rates as asymptotically normally distributed with variances based on the observed responses in each group and stage and set (X , ... , X ) = 01 31 (0·02, 0·2045, 0·0926, 0·1956), (τ , ... , τ ) = (2551, 270·4, 642·1, 292·3), (X , X ) = (0·1136, 01 31 02 12 0·1630) and (τ , τ ) = (873·7, 674·2). Based on these results, we obtained an estimate from (3), calcu- 02 12 lating the integrals using 100 000 simulations conditional on 125 mg being the lowest dose with observed clinical response rate within two percentage points of the best-performing dose, giving an estimate of 0·114. Properties of the estimator (3) were assessed in a realistic setting via simulations based on the ADVENT trial. For a range of μ values, for each of 100 000 simulated trials, we simulated B ∼ B(50, μ )(i = i1 i 0, ... ,3), adding or subtracting 1 if B = 0 or 50 to enable variance estimation. We then set X = B /50 i1 i1 i1 −1 and τ = X (1 − X )/50 (i = 0. ... ,3).For I = min{i ∈{0, ... ,3} : X  max  {X  }− 0·02}, i1 i1 i1 i =0,...,3 i 1 i1 the trial was assumed to stop at the end of stage 1 if I = 0; otherwise, we simulated B ∼ B (75, μ ) i2 i i −1 (i = 0, I ), again adding or subtracting 1 if B = 0 or 75, and set X = B /75 and τ = X (1 − X )/75. i2 i2 i2 i2 i2 i2 The estimate from (3) for θ , conditional on treatment I being selected, was obtained using 10 000 Monte Carlo simulations to evaluate the integrals, and was compared with the naive estimator (B + B )/125 − I 1 I 2 (B + B )/125. Table 1 gives simulated selection probabilities and the bias and root mean squared error 01 02 for the naive estimator and estimator (3). Bias estimates have standard error of at most 0·0002, with the T T exception of those for θ when μ = (0·02, 0·02, 0·2, 0·2) and for θ when μ = (0·02, 0·2, 0·2, 0·2) , with 1 3 standard errors of 0·04 and 0·03 respectively, and estimators for treatments selected with a probability of less than 0·01. Recall that if observed differences between treatments are small, the selection rule favours treatments with the lowest indices, explaining the differences in selection probabilities for treatments with the same mean. The naive estimator is conditionally biased, overestimating the true effect. The bias is relatively small, but in some cases it is, like the difference between the naive and new estimates for the ADVENT data reported above, close to the difference in clinical response rates of two percentage points considered T T important in the trial design. Only settings with μ = (0·02, 0·02, 0·02, 0·2) and (0·02, 0·02, 0·1, 0·2) , where the highest dose is nearly always selected, have bias near zero. The estimator (3) has bias near zero, though given that the derivation is based on assumed normality with known variance, it may not be unbiased; the simulated bias is negative in all cases, suggesting that the true treatment effect is slightly underestimated. The root mean squared errors of the two estimators are similar, though with some sug- gestion that the naive estimator has slightly smaller mean squared error in situations in which it has larger bias, consistent with theoretical results in settings in which the single most promising treatment is selected (Stallard et al., 2008; Bauer et al., 2010). Given the simulation results, it is interesting that the estimate from (3) using the observed ADVENT trial data is larger than the naive estimate. This appears to be due to the large difference between X and X , the placebo arm means in the two stages. A full analysis might 01 02 involve investigation of possible causes for this difference. The estimators proposed by Kimani et al. (2013) and Robertson et al. (2016) were not included in the simulations as they are not applicable for the selection rule used. We conducted 10 additional simulations Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 500 N. STALLARD AND P. K. KIMANI with r = k = 2 and X ∼ N (0, 1)(i = 0, 1, 2; j = 1, 2), selecting the treatment with the largest X ij i1 to continue to stage 2 provided X − X > c, when the Kimani et al. (2013) estimator is conditionally i1 01 unbiased and has smaller mean squared error than that of Robertson et al. (2016). If c →−∞, so there is no stopping for futility, the Kimani et al. (2013) estimator and (3) correspond, with root mean squared error of 1·08. For c = 0, the two estimators have root mean squared error of 1·23 and 1·21 respectively, with these increasing to 1·34 and 1·31 respectively for c = 2. The probabilities of stopping for futility in these two cases are 0·64 and 0·87. 4. DISCUSSION The method proposed gives uniformly minimum variance conditionally unbiased estimators in multi-arm multi-stage clinical trials. Estimators are obtained for treatment effects relative to the control for treatments continuing to the end of the trial conditional on the event, Q, that these treatments are selected to do so. The conditioning event is weaker than that considered by Bowden & Glimm (2008) and Cohen & Sackrowitz (1989), who condition on the ordering of stage 1 treatment means. We consider conditioning on Q to be appropriate since decisions regarding effectiveness of treatments are likely to depend on treatments continuing to the end of the trial but not on the ordering of continuing treatments at earlier analyses. The conditionally unbiased estimators are identical for a two-stage design with only the best-performing treatment continuing along with the control to stage 2, as shown in § 2·2 above. We have considered the total number of treatments and selection rule to be specified in advance. Others have proposed more flexible approaches, such as in the STAMPEDE trial (Sydes et al., 2012), where treatment arms were added during the trial. Our method might not yield conditionally unbiased estimators in such a trial. In a fully flexible approach, it is not clear that it is even possible to define the bias, as this would be an expectation over an unspecified sample space. ACKNOWLEDGEMENT We thank the referees for their comments and the Medical Research Council for funding. APPENDIX Construction of sufficient statistics for θ , ... , θ 1 k+1 Let θ be orthogonal to θ , that is, the information matrix term i = 0 (i = 1, ... , k) (Cox & Reid, k+1 i θ θ k+1 k+1 1987), and write μ = d θ for some d ( j = 1, ... , k + 1). Since we may scale θ arbitrarily and 0 j j j k+1 j=1 retain orthogonality, set d = 1toget μ = d θ + θ and k+1 0 j j k+1 j=1 μ = d θ + θ + θ (i = 1, ... , k). (A1) i j j k+1 i j=1 T T T Let X = (X , ... , X ) ; then E(X ) = θ where is a ( r ) × (k + 1) matrix with blocks 0· k· i=0 of r rows equal to (d , ... , d ,1) + 1 (i = 0, ... , k); here 1 denotes the (k + 1) row vector with i 1 k i i element i equal to 1 (i = 1, ... , k) and other elements equal to 0, and 1 = (0, ... ,0) .We have −1 −1 −1 −1 var(X ) = T , diagonal with elements (τ , ... , τ , ... , τ , ... , τ ). The likelihood is proportional to 01 0r k1 kr 0 k T −1 T −1 T −1 exp (X − θ ) T (X − θ ) = g(X )h(θ ) exp −2X T θ , for some g(X ) and h(θ ). Thus X T T −1 T is complete and sufficient for θ (Lehmann & Romano, 2005, § 4.3). Denoting X T by (W , ... , W ) , 1 k+1 k k we have W = X τ = Z τ and k+1 i· i i i=0 i· i=0 k k T T W = d X τ + X τ = d Z τ + Z τ (i = 1, ... , k). (A2) i i  i · i· i i i i i i · i· i =0 i =0 Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Miscellanea 501 k k −1 For orthogonality (∂ θ /∂θ )T (∂ θ /∂θ ) = d τ + τ = 0, so d =−τ / τ and, from i k+1 i  i i i i  i i =0 i =0 k k k k k (A1) and (A2), θ = τ μ / τ , W = Z τ and W = τ Z − τ Z τ / τ k+1 i i i k+1 i i i i i i  i i  i i=0 i=0 i=0 i =0 i =0 (i = 1, ... , k),so Z is complete and sufficient. With no selection, i.e., s = k, Z and Z − Z are the usual uniformly minimum variance unbiased r i i 0 estimators for μ and θ (i = 1, ... , k). By the factorization theorem, sufficient statistics are the same for i i the conditional and unconditional likelihoods, so Z is also sufficient in this case. REFERENCES BARKER, K. L., JAVAID, M. K., NEWMAN, M., LOWE, C. M., STALLARD, N., CAMPBELL, H., GANDHI, V.&LAMB,S. (2014). Physiotherapy rehabilitation for osteoporotic vertebral fracture (PROVE): Study protocol for a randomised controlled trial. Trials 15, 22. BAUER, P., KÖNIG, F., BRANNATH,W. & POSCH, M. (2010). Selection and bias — two hostile brothers. Statist. Med. 29, 1–13. BOWDEN,J.&GLIMM, E. (2008). Unbiased estimation of selected means in two-stage trials. Biomet. J. 50, 515–27. BOWDEN,J.&GLIMM, E. (2014). Conditionally unbiased and near unbiased estimation of the selected treatment mean for multistage drop-the-loser trials. Biomet. J. 56, 332–49. COHEN, A.&SACKROWITZ, H. B. (1989). Two stage conditionally unbiased estimators of the selected mean. Statist. Prob. Lett. 8, 273–8. COX,D.&REID, N. (1987). Parameter orthogonality and approximate conditional inference (with Discussion). J. R. Statist. Soc. B 49, 1–39. EFRON, B. (2014). Estimation and accuracy after model selection. J. Am. Statist. Assoc. 109, 991–1007. KIMANI, P., TODD,S.&STALLARD, N. (2013). Conditionally unbiased estimation in phase II/III clinical trials with early stopping for futility. Statist. Med. 32, 2893–910. LEHMANN,E.&ROMANO, J. (2005). Testing Statistical Hypotheses. New York: Springer, 3rd ed. MACARTHUR, R. D., HAWKINS, T. N., BROWN, S. J., LAMARCA, A., CLAY, P. G., BARRETT, A. C., BORTEY, E., PATERSON, C., GOLDEN,P.L.&FORBES, W. P. (2013). Efficacy and safery of crofelemer for noninfectious diarrhea in HIV- seropositive individuals (ADVENT trial): A randomized, double-blind, placebo-controlled, two-stage study. HIV Clin. Trials 14, 261–73. MAGIRR, D., JAKI,T. & WHITEHEAD, J. (2012). A generalised Dunnett test for multi-arm, multi-stage clinical studies with treatment selection. Biometrika 99, 494–501. POSCH, M., KOENIG, F., BRANSON, M., BRANNATH, W., DUNGER-BALDAUF,C.&BAUER, P. (2005). Testing and estimation in flexible group sequential designs with adaptive treatment selection. Statist. Med. 24, 3697–714. ROBERTSON, D. S., PREVOST,A. T.&BOWDEN, J. (2016). Unbiased estimation in seamless phase II/III trials with unequal treatment effect variances and hypothesis-driven selection rules. Statist. Med. 35, 3907–22. ROSENBAUM, S. (1961). Moments of a truncated bivariate normal distribution. J. R. Statist. Soc. B 23, 405–8. STALLARD,N.&FRIEDE, T. (2008). A group-sequential design for clinical trials with treatment selection. Statist. Med. 27, 6209–27. STALLARD,N.& TODD, S. (2003). Sequential designs for phase III clinical trials incorporating treatment selection. Statist. Med. 22, 689–703. STALLARD, N., TODD,S.& WHITEHEAD, J. (2008). Estimation following selection of the largest of two normal means. J. Statist. Plan. Infer. 138, 1629–38. SYDES, M. R., PARMAR,M.K.B., MASON, M. D., CLARKE, N. W., AMOS, C., ANDERSON, J., DE BONO, J., DEARNALEY, D. P., DWYER, J., GREEN, C. et al. (2012). Flexible trial design in practice - stopping arms for lack-of-benefit and adding research arms mid-trial in STAMPEDE: A multi-arm multi-stage randomized controlled trial. Trials 13, THALL,P.F., SIMON,R.&ELLENBERG, S. S. (1988). Two-stage selection and testing designs for comparative clinical trials. Biometrika 75, 303–10. THALL,P.F., SIMON,R.&ELLENBERG, S. S. (1989). A two-stage design for choosing among several experimental treatments and a control in clinical trials. Biometrics 45, 537–47. TODD, S., WHITEHEAD,J.&FACEY, K. M. (1996). Point and interval estimation following a sequential clinical trial. Biometrika 83, 453–61. U.S. FOOD AND DRUG ADMINISTRATION (2012). Application Number 202292Orig1s000 Statistical Review. https://www.accessdata.fda.gov/drugsatfda_docs/nda/2012/202292Orig1s000StatR.pdf. Accessed 1 Feb 2017. WASON, J., STALLARD, N., BOWDEN,J.&JENNISON, C. (2017). A multi-stage drop-the-losers design for multi-arm clinical trials. Statist. Meth. Med. Res. 26, 508–24. [Received on 13 September 2017. Editorial decision on 7 January 2018] http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Biometrika Oxford University Press

Uniformly minimum variance conditionally unbiased estimation in multi-arm multi-stage clinical trials

Biometrika , Volume 105 (2) – Jun 1, 2018

Loading next page...
 
/lp/ou_press/uniformly-minimum-variance-conditionally-unbiased-estimation-in-multi-Zt06VLT3fU

References (24)

Publisher
Oxford University Press
Copyright
Copyright © 2022 Biometrika Trust
ISSN
0006-3444
eISSN
1464-3510
DOI
10.1093/biomet/asy004
Publisher site
See Article on Publisher Site

Abstract

Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Biometrika (2018), 105,2, pp. 495–501 doi: 10.1093/biomet/asy004 Printed in Great Britain Advance Access publication 28 February 2018 Uniformly minimum variance conditionally unbiased estimation in multi-arm multi-stage clinical trials BY NIGEL STALLARD AND PETER K. KIMANI Warwick Medical School, University of Warwick, Coventry CV4 7AL, U.K. [email protected] [email protected] SUMMARY Multi-arm multi-stage clinical trials compare several experimental treatments with a control treatment, with poorly performing treatments dropped at interim analyses. This leads to inferential challenges, includ- ing the construction of unbiased treatment effect estimators. A number of estimators which are unbiased conditional on treatment selection have been proposed, but are specific to certain selection rules, may ignore the comparison to the control and are not all minimum variance. We obtain estimators for treatment effects compared to the control that are uniformly minimum variance unbiased conditional on selection with any specified rule or stopping for futility. Some key words: Adaptive seamless design; Drop-the-loser design; Point estimation; Treatment selection. 1. INTRODUCTION Multi-arm multi-stage clinical trials compare several treatments to a common control treatment in a single trial with treatments dropped at interim analyses if, based on observed data, they are not sufficiently promising. Such designs have been used by, for example, MacArthur et al. (2013) and Barker et al. (2014). The approach can yield sample size reduction and administrative savings relative to running several two- arm trials, but presents challenges in terms of statistical analysis similar to those of post-model selection inference (Efron, 2014). Proposed analysis methods have mainly focused on frequentist hypothesis tests (Thall et al., 1988, 1989; Stallard & Todd, 2003; Stallard & Friede, 2008; Magirr et al., 2012; Wason et al., 2017), with less work on estimation. Cohen & Sackrowitz (1989) consider two-stage designs with the treatment with the highest observed stage 1 average continuing to stage 2, with equal variances for the averages for each treatment in stage 1 and the stage 2 treatment. They derive an estimator of the stage 2 treatment mean that is uniformly minimum variance conditionally unbiased given the observed ordering of stage 1 treatment means. Bowden & Glimm (2008) extend the method to allow different variances for the treatment means and continuation to stage 2 of the s treatments with the largest observed stage 1 means. They provide expressions for uniformly minimum variance conditionally unbiased estimators for the means for these s treatment arms, again conditioning on the ordering of the stage 1 averages. Like Cohen & Sackrowitz (1989), they do not consider estimation relative to a control group. Kimani et al. (2013), Bowden & Glimm (2014) and Robertson et al. (2016) derive conditionally unbiased estimators for the difference between selected treatments and a control in multi-arm multi-stage trials. Kimani et al. (2013) allow for stopping for futility at stage 1 of a two-stage trial assuming common variances for averages in different arms in stage 1, while Bowden & Glimm (2014) and Robertson et al. (2016) allow for different variances in different arms and correlation between these respectively. Bowden & Glimm (2014) also allow for more than two stages. As discussed below, the Kimani et al. (2013) and Robertson et al. (2016) estimators are conditionally unbiased but not generally minimum variance. We obtain minimum variance conditionally unbiased estimators in these settings. 2018 Biometrika Trust This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/ licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 496 N. STALLARD AND P. K. KIMANI These methods present estimators that are unbiased conditional on selection of the treatment arms with the largest observed stage 1 averages, possibly with the additional condition that these means are sufficiently larger than that for the control. In practice, other selection rules may be used, such as in § 3 below. We show how uniformly minimum variance conditionally unbiased estimators may be obtained for comparisons with the control conditioning on a treatment not being dropped from the trial using any specified rule for selection or stopping for futility. 2. UNIFORMLY MINIMUM VARIANCE CONDITIONALLY UNBIASED ESTIMATION 2·1. A uniformly minimum variance conditionally unbiased estimator Consider a multi-arm multi-stage clinical trial with up to r stages where, in stage j, patients are random- ized to a control treatment, treatment 0, or experimental treatments, labelled i (i ∈  ), with  ⊇ ··· ⊇  . j 1 r Without loss of generality, label treatments such that  ={1, ... , s } for some s ( j = 1, ... , r). Denoting j j j the total number of experimental treatments, s ,by k, let r = max{ j : s  i} (i = 1, ... , k), so that 1 i j treatment i is included in stages 1, ... , r , and set r = r. i 0 Let X denote the stagewise average for treatment i in stage j (i = 0, ... , k; j = 1, ... , r ), with ij i T T T T T X = (X , ... , X ) , X = (X , ... , X ) and X = (X , ... , X ) . Assume that the X are jointly i· i1 ir ·j 0j s j j ij ·1 ·j i j −1 sufficient for μ , ... , μ , with X ∼ N (μ , τ ) independent with τ known. Other cases, such as in § 3 0 k ij i ij ij below, may use normal approximations or estimated variances. Experimental treatments are selected to continue, along with the control, to stage j + 1 depending onX according to some prespecified rule. Some possible rules are discussed below. Define θ = μ − μ (i = 1, ... , k). We wish to estimate θ (i = 1, ... , s ). In particular, we will obtain i i 0 i r uniformly minimum variance conditionally unbiased estimators of θ , ... , θ conditional on the event, 1 s Q, that treatments 1, ... , s are selected to continue to the end of the trial according to the prespecified selection rule. T T −1 T Let τ = (τ , ... , τ ) , τ = τ + ··· + τ , Z = X τ τ (i = 0, ... , k), Z = (Z , ... , Z ) and i· i1 ir i i1 ir i i· i 0 k i i i· k k θ = τ μ / τ . The Appendix shows that θ is orthogonal to θ , in that the information matrix k+1 i i i k+1 i i=0 i=0 term i = 0(Cox & Reid, 1987) (i = 1, ... , k) and Z is complete and sufficient for θ = (θ , ... , θ ) . θ θ 1 k+1 k+1 Let Y = X − X (i = 1, ... , s ); then Y is unbiased for θ and, since Y is independent ofX and hence i ir 0r r i i i r−1 of Q, is also conditionally unbiased for θ given Q. Thus, by the Rao–Blackwell theorem, E(Y | Z, Q) is i i a uniformly minimum variance conditionally unbiased estimator for θ given Q (i = 1, ... , s ). i r Since, given the specified selection rule, the event Q depends only on X , r−1 E(Y | Z,X )f (X | Z ) dX i r−1 r−1 r−1 E(Y | Z, Q) = , (1) f (X | Z ) dX r−1 r−1 with the integrals taken over the region Q corresponding to thoseX for which treatments 1, ... , s will r−1 r be selected to continue to stage r. The denominator is the probability of Q given Z, which will be denoted by pr(Q | Z ). As ⎧ ⎛ ⎞ ⎫ −1 −1 τ 0 ··· 0 τ ⎪ i1 i ⎪ ⎪ ⎛ ⎞ ⎪ ⎪ ⎪ ⎪ ⎜ ⎟ ⎪ ⎛ ⎞ μ ⎪ i · · ⎪ X ⎪ ⎜ ⎟ ⎪ i1 −1 · · · ⎪ ⎪ ⎪ ⎜ ⎟ ⎜ 0 τ · · ⎟ ⎪ ⎪ · i2 ⎪ ⎜ ⎟ ⎨ ⎜ ⎟ ⎜ ⎟ ⎬ ⎜ ⎟ ⎜ · ⎟ ⎜ ⎟ · · · · ⎜ ⎟ ∼ N ⎜ ⎟ ,⎜ ⎟ · · · · · · · 0 · ⎪ ⎜ ⎟ ⎜ ⎟ ⎪ ⎝ ⎠ ⎪ ⎪ ⎪ · ⎜ ⎟ ⎪ i,r −1 ⎝ ⎠ ⎪ ⎪ ⎪ ⎜ ⎟ ⎪ ⎪ ⎪ Z · ⎪ −1 ⎪ i ⎝ ⎠ ⎪ ⎪ μ 0 ··· 0 τ · ⎪ i i,r −1 ⎪ ⎩ ⎭ −1 −1 τ ··· ··· ··· τ i i and, for i = | i and any j, j , X is independent of X   and Z  , we have that ij i j i f (X | Z ) = φ {(X , ... , X ) ; μ ˜ ,  }, (2) r−1 r −1 i1 ir −1 i i i i i=0 Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Miscellanea 497 where φ (X ; μ, ) denotes the n-dimensional multivariate normal density with mean μ and variance matrix −1 −1 −1 −1 , evaluated at X , with μ ˜ = (Z , ... , Z ) and  having diagonal elements (τ − τ , ... , τ − τ ) i i i i i1 i i,r −1 i −1 and all off-diagonal elements equal to −τ .If r = 1, X given Z has a degenerate distribution, since i i1 X = Z and τ = τ . i1 i i1 i r−1 −1 Since Y = X − X and X = τ (τ Z − X τ )(i = 0, ... , s ), i ir 0r ir i i ij ij r ir j=1 r−1 −1 −1 −1 −1 E(Y | Z,X ) = τ τ Z − τ τ Z − τ τ X − τ τ X . i r−1 i i 0 0 ij ij 0j 0j ir 0r ir 0r j=1 Thus (1)gives r−1 −1 −1 −1 −1 −1 E(Y | Z, Q) = τ τ Z − τ τ Z − pr(Q | Z ) τ τ X − τ τ X f (X | Z ) dX . (3) i i i 0 0 ij ij 0j 0j r−1 r−1 ir 0r ir 0r j=1 An important special case arises when the selection rule does not depend on X , the observed averages 0· for the control arm. In the integral over the event Q, the range of integration with respect to X (i = ij 1, ... , k; j = 1, ... , r ) then does not depend on X and the integration with respect to the elements of X , i 0· 0· i.e., X , ... , X , is over the whole real line. Thus 01 0,r−1 r−1 r−1 −1 −1 −1 pr(Q | Z ) τ X τ f (X | Z ) dX = τ τ E(X | Z ), 0j 0j j−1 r−1 0j 0j 0r 0r j=1 j=1 and since E(X | Z ) = Z (j = 1, ... , r − 1),wehave 0j 0 r−1 −1 −1 −1 E(Y | Z, Q) = τ τ Z − pr(Q | Z ) τ X τ f (X | Z ) dX − Z . (4) i i i ij ij r−1 r−1 0 ir ir j=1 In this case the uniformly minimum variance conditionally unbiased estimator for θ is the difference between the uniformly minimum variance conditionally unbiased estimator of μ , which can be calculated ignoring the observed value of X , and Z , the usual uniformly minimum variance unbiased estimator of 0· 0 μ , which does not depend on the selection. The integrals in (3) generally cannot be evaluated using standard functions. A numerical approach is Monte Carlo integration with rejection sampling, simulating X from (2), its conditional distribution r−1 given Z, accepting those in Q for the specified selection rule. This approach can be used for any selection rule with treatments proceeding to stage j + 1 dependent on X . 2·2. Selection of the best-performing treatment in a two-stage trial Much previous work on conditionally unbiased estimation in clinical trials with treatment selection has focused on the special case of r = 2 and s = 1, with the experimental treatment having the largest observed stage 1 mean continuing along with the control to stage 2. Thus Q is the event X > m with m = max(X , ... , X ).As Q is independent of X , the uniformly minimum variance conditionally 21 k1 0· unbiased estimator for θ is (4). Since treatments 2, ... , k are observed in stage 1 only, we have Z = 1 i X (i = 2, ... , k) so that the numerator and denominator in the fractional term in (4) are respectively i1 −1 −1 −1 τ τ X f (X | Z ) dX and pr(X > m | Z ).As X | Z ∼ N (Z , v ) with v = τ − τ , this 11 11 11 1 11 11 1 11 1 1 1 1 12 11 1 1/2 ˜ ˜ ˜ integral and probability are Z {1 − (Z )}+ v φ(Z ) (Todd et al., 1996) and 1 − (Z ) respectively, 1 1 1 1 −1/2 1/2 −1 ˜ ˜ ˜ where Z = (m − Z )v . Thus E(Y | Z, Q) = Z − Z − τ τ v φ(Z )/{1 − (Z )}, confirming 1 1 1 1 0 11 1 1 1 12 1 this to be the difference between the uniformly minimum variance conditionally unbiased estimator for μ ignoring the control treatment given by Bowden & Glimm (2008) and Z . Kimani et al. (2013) consider a similar two-stage trial in which treatment 1 is selected if X > m but continues to the second stage only if X > X + c for some specified c, the trial otherwise stopping 11 01 Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 498 N. STALLARD AND P. K. KIMANI early. In this case, Q is the event X > max(m, X + c). Since this depends on X , the form (3) must 11 01 01 be used rather than (4) to give the uniformly minimum variance conditionally unbiased estimator of θ . The integral in (3) is taken over X and X , but may be rewritten in terms of X and (X − X ), 11 01 11 11 01 noting that X Z v v 11 1 1 1 Z ∼ N , X − X Z − Z v v 11 01 1 0 1 2 −1 −1 with v as above and v = v + τ − τ . Since Q is the rectangular region {X > m, (X − X )> c}, 1 2 1 11 11 01 01 0 (3) is a truncated bivariate normal expectation and can be evaluated using Rosenbaum (1961) to obtain the uniformly minimum variance conditionally unbiased estimator −1 −1 −1 −1 τ τ Z − τ τ E(X | Z, Q) − τ τ Z − τ τ E(X | Z, Q), 1 1 11 11 0 0 01 01 12 12 02 02 1/2 1/2 1/2 where E(X | Z, Q) = v A + Z and E(X | Z, Q) = v A − v B + Z with 11 1 01 0 1 1 2 ˜ ˜ φ(a){1 − (b)}+ ρφ(b){1 − (a ˜ )} ρφ(a){1 − (b)}+ φ(b){1 − (a ˜ )} A = , B = ; U (a, b; ρ) U (a, b; ρ) −1/2 −1/2 2 −1/2 2 −1/2 where a = (m − Z )v , b = (c − Z + Z )v , a ˜ = (a − ρb)(1 − ρ ) , b = (b − ρa)(1 − ρ ) , 1 1 0 1 2 2 T ρ = v /v and U (a, b; ρ) = pr(u > a, u > b) for (u , u ) standard bivariate normal with correlation 1 2 1 2 1 2 ρ. This is not the estimator proposed by Kimani et al. (2013), which is conditionally unbiased but not minimum variance. Robertson et al. (2016) also consider two-stage trials with early stopping for futility, though they do not assume equal variances and consider ranking by standardized observed stage 1 treatment effect estimates, −1 −1 −1/2 R = (X − X )(τ + τ ) , assuming treatment 1 is selected if R > max{R , ... , R , c} for specified i i1 01 1 2 k i1 01 c. In constructing their estimators, they condition on statistics based on the observed treatment effects, X − X (i = 1, ... , k; j = 1, 2). Since these may not be sufficient, for example in the case of normal data ij 0j with a common control, their estimator is also conditionally unbiased but not minimum variance; indeed they show that in the common variance case it has larger variance than the estimator proposed by Kimani et al. (2013). 3. NUMERICAL EXAMPLE The ADVENT trial (MacArthur et al., 2013) was a two-stage study comparing 125 mg, 250 mg and 500 mg doses of crofelemer with a placebo in noninfectious chronic diarrhoea in HIV-seropositive patients. The primary endpoint was clinical response, defined as at most two watery stools per week during at least two of four weeks of treatment. At the end of stage 1, based on data from 200 patients randomized equally between the four treatment arms, a single dose of crofelemer would continue with the control to stage 2, with a further 150 patients randomized equally between these two groups. In the absence of safety concerns, the dose selected would be the lowest dose with an observed clinical response rate within two percentage points of the best-performing dose. Although not explicitly stated by MacArthur et al. (2013), we assume that the trial would have stopped at the first stage if the best-performing dose did not have observed clinical response rate at least two percentage points above the placebo. The trial was analysed using the method of Posch et al. (2005) to control the familywise Type I error rate, but apparently no attempt was made to obtain unbiased estimators of the treatment effect for the selected dose. The U.S. Food and Drug Administration (2012) report gives results of the two stages of the study. In stage 1, 50, 44, 54 and 46 patients received the placebo and the three doses respectively, with 1, 9, 5 and 9 patients in these groups showing clinical response. In stage 2, 88 and 92 further patients received the placebo and 125 mg crofelemer respectively, with 10 and 15 demonstrating clinical response. A naive estimate of the effect of the 125 mg dose relative to placebo is thus 24/136 − 11/138 = 0·097. Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Miscellanea 499 Table 1. Simulated probability of selection and bias (with root mean squared error in paren- theses), with all values multiplied by 100, for the estimator given by (3) and the naive estimator for θ conditional on selection of treatment i (i = 1, ... ,3) forarangeof μ values (μ = 0·02 in all cases) (μ , μ , μ ) pr(selected) Estimator from (3) Naive estimator 1 2 3 12 3 θ θ θ θ θ θ 1 2 3 1 2 3 (0·02, 0·02, 0·02) 85 4 −0·2 (1·5) −0·2 (1·5) −0·1 (1·5) 1·6 (2·0) 1·7 (2·2) 1·9 (2·3) (0·02, 0·02, 0·2)<1 <199 −0·4 (1·3) −0·4 (1·4) −0·7 (3·9) 1·7 (2·1) 1·7 (2·1) −0·4 (3·7) (0·02, 0·1, 0·2)<114 86 −0·2 (1·5) −0·7 (3·3) −0·6 (3·9) 2·0 (2·6) 1·4 (3·1) 0·0 (3·6) (0·02, 0·2, 0·2)<161 39 −0·2 (0·9) −0·7 (4·1) −0·5 (4·2) 2·1 (2·4) 0·5 (3·6) 1·2 (3·7) (0·1, 0·2, 0·2) 558 37 −0·6 (3·3) −0·7 (4·1) −0·5 (4·2) 2·4 (3·6) 0·6 (3·6) 1·2 (3·7) (0·2, 0·2, 0·2) 46 31 23 −0·7 (4·2) −0·5 (4·2) −0·5 (4·3) 1·0 (3·6) 1·5 (3·8) 1·9 (3·9) For illustrative purposes, we treat estimated event rates as asymptotically normally distributed with variances based on the observed responses in each group and stage and set (X , ... , X ) = 01 31 (0·02, 0·2045, 0·0926, 0·1956), (τ , ... , τ ) = (2551, 270·4, 642·1, 292·3), (X , X ) = (0·1136, 01 31 02 12 0·1630) and (τ , τ ) = (873·7, 674·2). Based on these results, we obtained an estimate from (3), calcu- 02 12 lating the integrals using 100 000 simulations conditional on 125 mg being the lowest dose with observed clinical response rate within two percentage points of the best-performing dose, giving an estimate of 0·114. Properties of the estimator (3) were assessed in a realistic setting via simulations based on the ADVENT trial. For a range of μ values, for each of 100 000 simulated trials, we simulated B ∼ B(50, μ )(i = i1 i 0, ... ,3), adding or subtracting 1 if B = 0 or 50 to enable variance estimation. We then set X = B /50 i1 i1 i1 −1 and τ = X (1 − X )/50 (i = 0. ... ,3).For I = min{i ∈{0, ... ,3} : X  max  {X  }− 0·02}, i1 i1 i1 i =0,...,3 i 1 i1 the trial was assumed to stop at the end of stage 1 if I = 0; otherwise, we simulated B ∼ B (75, μ ) i2 i i −1 (i = 0, I ), again adding or subtracting 1 if B = 0 or 75, and set X = B /75 and τ = X (1 − X )/75. i2 i2 i2 i2 i2 i2 The estimate from (3) for θ , conditional on treatment I being selected, was obtained using 10 000 Monte Carlo simulations to evaluate the integrals, and was compared with the naive estimator (B + B )/125 − I 1 I 2 (B + B )/125. Table 1 gives simulated selection probabilities and the bias and root mean squared error 01 02 for the naive estimator and estimator (3). Bias estimates have standard error of at most 0·0002, with the T T exception of those for θ when μ = (0·02, 0·02, 0·2, 0·2) and for θ when μ = (0·02, 0·2, 0·2, 0·2) , with 1 3 standard errors of 0·04 and 0·03 respectively, and estimators for treatments selected with a probability of less than 0·01. Recall that if observed differences between treatments are small, the selection rule favours treatments with the lowest indices, explaining the differences in selection probabilities for treatments with the same mean. The naive estimator is conditionally biased, overestimating the true effect. The bias is relatively small, but in some cases it is, like the difference between the naive and new estimates for the ADVENT data reported above, close to the difference in clinical response rates of two percentage points considered T T important in the trial design. Only settings with μ = (0·02, 0·02, 0·02, 0·2) and (0·02, 0·02, 0·1, 0·2) , where the highest dose is nearly always selected, have bias near zero. The estimator (3) has bias near zero, though given that the derivation is based on assumed normality with known variance, it may not be unbiased; the simulated bias is negative in all cases, suggesting that the true treatment effect is slightly underestimated. The root mean squared errors of the two estimators are similar, though with some sug- gestion that the naive estimator has slightly smaller mean squared error in situations in which it has larger bias, consistent with theoretical results in settings in which the single most promising treatment is selected (Stallard et al., 2008; Bauer et al., 2010). Given the simulation results, it is interesting that the estimate from (3) using the observed ADVENT trial data is larger than the naive estimate. This appears to be due to the large difference between X and X , the placebo arm means in the two stages. A full analysis might 01 02 involve investigation of possible causes for this difference. The estimators proposed by Kimani et al. (2013) and Robertson et al. (2016) were not included in the simulations as they are not applicable for the selection rule used. We conducted 10 additional simulations Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 500 N. STALLARD AND P. K. KIMANI with r = k = 2 and X ∼ N (0, 1)(i = 0, 1, 2; j = 1, 2), selecting the treatment with the largest X ij i1 to continue to stage 2 provided X − X > c, when the Kimani et al. (2013) estimator is conditionally i1 01 unbiased and has smaller mean squared error than that of Robertson et al. (2016). If c →−∞, so there is no stopping for futility, the Kimani et al. (2013) estimator and (3) correspond, with root mean squared error of 1·08. For c = 0, the two estimators have root mean squared error of 1·23 and 1·21 respectively, with these increasing to 1·34 and 1·31 respectively for c = 2. The probabilities of stopping for futility in these two cases are 0·64 and 0·87. 4. DISCUSSION The method proposed gives uniformly minimum variance conditionally unbiased estimators in multi-arm multi-stage clinical trials. Estimators are obtained for treatment effects relative to the control for treatments continuing to the end of the trial conditional on the event, Q, that these treatments are selected to do so. The conditioning event is weaker than that considered by Bowden & Glimm (2008) and Cohen & Sackrowitz (1989), who condition on the ordering of stage 1 treatment means. We consider conditioning on Q to be appropriate since decisions regarding effectiveness of treatments are likely to depend on treatments continuing to the end of the trial but not on the ordering of continuing treatments at earlier analyses. The conditionally unbiased estimators are identical for a two-stage design with only the best-performing treatment continuing along with the control to stage 2, as shown in § 2·2 above. We have considered the total number of treatments and selection rule to be specified in advance. Others have proposed more flexible approaches, such as in the STAMPEDE trial (Sydes et al., 2012), where treatment arms were added during the trial. Our method might not yield conditionally unbiased estimators in such a trial. In a fully flexible approach, it is not clear that it is even possible to define the bias, as this would be an expectation over an unspecified sample space. ACKNOWLEDGEMENT We thank the referees for their comments and the Medical Research Council for funding. APPENDIX Construction of sufficient statistics for θ , ... , θ 1 k+1 Let θ be orthogonal to θ , that is, the information matrix term i = 0 (i = 1, ... , k) (Cox & Reid, k+1 i θ θ k+1 k+1 1987), and write μ = d θ for some d ( j = 1, ... , k + 1). Since we may scale θ arbitrarily and 0 j j j k+1 j=1 retain orthogonality, set d = 1toget μ = d θ + θ and k+1 0 j j k+1 j=1 μ = d θ + θ + θ (i = 1, ... , k). (A1) i j j k+1 i j=1 T T T Let X = (X , ... , X ) ; then E(X ) = θ where is a ( r ) × (k + 1) matrix with blocks 0· k· i=0 of r rows equal to (d , ... , d ,1) + 1 (i = 0, ... , k); here 1 denotes the (k + 1) row vector with i 1 k i i element i equal to 1 (i = 1, ... , k) and other elements equal to 0, and 1 = (0, ... ,0) .We have −1 −1 −1 −1 var(X ) = T , diagonal with elements (τ , ... , τ , ... , τ , ... , τ ). The likelihood is proportional to 01 0r k1 kr 0 k T −1 T −1 T −1 exp (X − θ ) T (X − θ ) = g(X )h(θ ) exp −2X T θ , for some g(X ) and h(θ ). Thus X T T −1 T is complete and sufficient for θ (Lehmann & Romano, 2005, § 4.3). Denoting X T by (W , ... , W ) , 1 k+1 k k we have W = X τ = Z τ and k+1 i· i i i=0 i· i=0 k k T T W = d X τ + X τ = d Z τ + Z τ (i = 1, ... , k). (A2) i i  i · i· i i i i i i · i· i =0 i =0 Downloaded from https://academic.oup.com/biomet/article/105/2/495/4913762 by DeepDyve user on 18 July 2022 Miscellanea 501 k k −1 For orthogonality (∂ θ /∂θ )T (∂ θ /∂θ ) = d τ + τ = 0, so d =−τ / τ and, from i k+1 i  i i i i  i i =0 i =0 k k k k k (A1) and (A2), θ = τ μ / τ , W = Z τ and W = τ Z − τ Z τ / τ k+1 i i i k+1 i i i i i i  i i  i i=0 i=0 i=0 i =0 i =0 (i = 1, ... , k),so Z is complete and sufficient. With no selection, i.e., s = k, Z and Z − Z are the usual uniformly minimum variance unbiased r i i 0 estimators for μ and θ (i = 1, ... , k). By the factorization theorem, sufficient statistics are the same for i i the conditional and unconditional likelihoods, so Z is also sufficient in this case. REFERENCES BARKER, K. L., JAVAID, M. K., NEWMAN, M., LOWE, C. M., STALLARD, N., CAMPBELL, H., GANDHI, V.&LAMB,S. (2014). Physiotherapy rehabilitation for osteoporotic vertebral fracture (PROVE): Study protocol for a randomised controlled trial. Trials 15, 22. BAUER, P., KÖNIG, F., BRANNATH,W. & POSCH, M. (2010). Selection and bias — two hostile brothers. Statist. Med. 29, 1–13. BOWDEN,J.&GLIMM, E. (2008). Unbiased estimation of selected means in two-stage trials. Biomet. J. 50, 515–27. BOWDEN,J.&GLIMM, E. (2014). Conditionally unbiased and near unbiased estimation of the selected treatment mean for multistage drop-the-loser trials. Biomet. J. 56, 332–49. COHEN, A.&SACKROWITZ, H. B. (1989). Two stage conditionally unbiased estimators of the selected mean. Statist. Prob. Lett. 8, 273–8. COX,D.&REID, N. (1987). Parameter orthogonality and approximate conditional inference (with Discussion). J. R. Statist. Soc. B 49, 1–39. EFRON, B. (2014). Estimation and accuracy after model selection. J. Am. Statist. Assoc. 109, 991–1007. KIMANI, P., TODD,S.&STALLARD, N. (2013). Conditionally unbiased estimation in phase II/III clinical trials with early stopping for futility. Statist. Med. 32, 2893–910. LEHMANN,E.&ROMANO, J. (2005). Testing Statistical Hypotheses. New York: Springer, 3rd ed. MACARTHUR, R. D., HAWKINS, T. N., BROWN, S. J., LAMARCA, A., CLAY, P. G., BARRETT, A. C., BORTEY, E., PATERSON, C., GOLDEN,P.L.&FORBES, W. P. (2013). Efficacy and safery of crofelemer for noninfectious diarrhea in HIV- seropositive individuals (ADVENT trial): A randomized, double-blind, placebo-controlled, two-stage study. HIV Clin. Trials 14, 261–73. MAGIRR, D., JAKI,T. & WHITEHEAD, J. (2012). A generalised Dunnett test for multi-arm, multi-stage clinical studies with treatment selection. Biometrika 99, 494–501. POSCH, M., KOENIG, F., BRANSON, M., BRANNATH, W., DUNGER-BALDAUF,C.&BAUER, P. (2005). Testing and estimation in flexible group sequential designs with adaptive treatment selection. Statist. Med. 24, 3697–714. ROBERTSON, D. S., PREVOST,A. T.&BOWDEN, J. (2016). Unbiased estimation in seamless phase II/III trials with unequal treatment effect variances and hypothesis-driven selection rules. Statist. Med. 35, 3907–22. ROSENBAUM, S. (1961). Moments of a truncated bivariate normal distribution. J. R. Statist. Soc. B 23, 405–8. STALLARD,N.&FRIEDE, T. (2008). A group-sequential design for clinical trials with treatment selection. Statist. Med. 27, 6209–27. STALLARD,N.& TODD, S. (2003). Sequential designs for phase III clinical trials incorporating treatment selection. Statist. Med. 22, 689–703. STALLARD, N., TODD,S.& WHITEHEAD, J. (2008). Estimation following selection of the largest of two normal means. J. Statist. Plan. Infer. 138, 1629–38. SYDES, M. R., PARMAR,M.K.B., MASON, M. D., CLARKE, N. W., AMOS, C., ANDERSON, J., DE BONO, J., DEARNALEY, D. P., DWYER, J., GREEN, C. et al. (2012). Flexible trial design in practice - stopping arms for lack-of-benefit and adding research arms mid-trial in STAMPEDE: A multi-arm multi-stage randomized controlled trial. Trials 13, THALL,P.F., SIMON,R.&ELLENBERG, S. S. (1988). Two-stage selection and testing designs for comparative clinical trials. Biometrika 75, 303–10. THALL,P.F., SIMON,R.&ELLENBERG, S. S. (1989). A two-stage design for choosing among several experimental treatments and a control in clinical trials. Biometrics 45, 537–47. TODD, S., WHITEHEAD,J.&FACEY, K. M. (1996). Point and interval estimation following a sequential clinical trial. Biometrika 83, 453–61. U.S. FOOD AND DRUG ADMINISTRATION (2012). Application Number 202292Orig1s000 Statistical Review. https://www.accessdata.fda.gov/drugsatfda_docs/nda/2012/202292Orig1s000StatR.pdf. Accessed 1 Feb 2017. WASON, J., STALLARD, N., BOWDEN,J.&JENNISON, C. (2017). A multi-stage drop-the-losers design for multi-arm clinical trials. Statist. Meth. Med. Res. 26, 508–24. [Received on 13 September 2017. Editorial decision on 7 January 2018]

Journal

BiometrikaOxford University Press

Published: Jun 1, 2018

There are no references for this article.