zoukankan html css js c++ java

BAYESIAN STATISTICS AND CLINICAL TRIAL CONCLUSIONS: WHY THE OPTIMSE STUDY SHOULD BE CONSIDERED POSITIVE（转）

Statistical approaches to randomised controlled trial analysis

The statistical approach used in the design and analysis of the vast majority of clinical studies is often referred to as classical or frequentist. Conclusions are made on the results of hypothesis tests with generation of p-values and confidence intervals, and require that the correct conclusion be drawn with a high probability among a notional set of repetitions of the trial.

Bayesian inference is an alternative, which treats conclusions probabilistically and provides a different framework for thinking about trial design and conclusions. There are many differences between the two, but for this discussion there are two obvious distinctions with the Bayesian approach. The first is that prior knowledge can be accounted for to a greater or lesser extent, something life scientists sometimes have difficulty reconciling. Secondly, the conclusions of a Bayesian analysis often focus on the decision that requires to be made, e.g. should this new treatment be used or not.

There are pros and cons to both sides, nicely discussed here, but I would argue that the results of frequentist analyses are too often accepted with insufficient criticism. Here’s a good example.

OPTIMSE: Optimisation of Cardiovascular Management to Improve Surgical Outcome

Optimising the amount of blood being pumped out of the heart during surgery may improve patient outcomes. By specifically measuring cardiac output in the operating theatre and using it to guide intravenous fluid administration and the use of drugs acting on the circulation, the amount of oxygen that is delivered to tissues can be increased.

It sounds like common sense that this would be a good thing, but drugs can have negative effects, as can giving too much intravenous fluid. There are also costs involved, is the effort worth it? Small trials have suggested that cardiac output-guided therapy may have benefits, but the conclusion of a large Cochrane review was that the results remain uncertain.

A well designed and run multi-centre randomised controlled trial was performed to try and determine if this intervention was of benefit (OPTIMSE: Optimisation of Cardiovascular Management to Improve Surgical Outcome).

Patients were randomised to a cardiac output–guided hemodynamic therapy algorithm for intravenous fluid and a drug to increase heart muscle contraction (the inotrope, dopexamine) during and 6 hours following surgery (intervention group) or to usual care (control group).

The primary outcome measure was the relative risk (RR) of a composite of 30-day moderate or major complications and mortality.

OPTIMSE: reported results

Focusing on the primary outcome measure, there were 158/364 (43.3%) and 134/366 (36.6%) patients with complication/mortality in the control and intervention group respectively. Numerically at least, the results appear better in the intervention group compared with controls.

Using the standard statistical approach, the relative risk (95% confidence interval) = 0.84 (0.70-1.01), p=0.07 and absolute risk difference = 6.8% (−0.3% to 13.9%), p=0.07. This is interpreted as there being insufficient evidence that the relative risk for complication/death is different to 1.0 (all analyses replicated below). The authors reasonably concluded that:

In a randomized trial of high-risk patients undergoing major gastrointestinal surgery, use of a cardiac output–guided hemodynamic therapy algorithm compared with usual care did not reduce a composite outcome of complications and 30-day mortality.

A difference does exist between the groups, but is not judged to be a sufficient difference using this conventional approach.

OPTIMSE: Bayesian analysis

Repeating the same analysis using Bayesian inference provides an alternative way to think about this result. What are the chances the two groups actually do have different results? What are the chances that the two groups have clinically meaningful differences in results? What proportion of patients stand to benefit from the new intervention compared with usual care?

With regard to prior knowledge, this analysis will not presume any prior information. This makes the point that prior information is not always necessary to draw a robust conclusion. It may be very reasonable to use results from pre-existing meta-analyses to specify a weak prior, but this has not been done here. Very grateful to John Kruschke for the excellent scripts and book, Doing Bayesian Data Analysis.

The results of the analysis are presented in the graph below. The top panel is the prior distribution. All proportions for the composite outcome in both the control and intervention group are treated as equally likely.

The middle panel contains the main findings. This is the posterior distribution generated in the analysis for the relative risk of the composite primary outcome (technical details in script below).

The mean relative risk = 0.84 which as expected is the same as the frequentist analysis above. Rather than confidence intervals, in Bayesian statistics a credible interval or region is quoted (HDI = highest density interval is the same). This is philosphically different to a confidence interval and says:

Given the observed data, there is a 95% probability that the true RR falls within this credible interval.

This is a subtle distinction to the frequentist interpretation of a confidence interval:

Were I to repeat this trial multiple times and compute confidence intervals, there is a 95% probability that the true RR would fall within these confidence intervals.

This is an important distinction and can be extended to make useful probabilistic statements about the result.

The figures in green give us the proportion of the distribution above and below 1.0. We can therefore say:

The probability that the intervention group has a lower incidence of the composite endpoint is 97.3%.

It may be useful to be more specific about the size of difference between the control and treatment group that would be considered equivalent, e.g. 10% above and below a relative risk = 1.0. This is sometimes called the region of practical equivalence (ROPE; red text on plots). Experts would determine what was considered equivalent based on many factors. We could therefore say:

The probability of the composite end-point for the control and intervention group being equivalent is 22%.

Or, the probability of a clinically relevant difference existing in the composite endpoint between control and intervention groups is 78%

Finally, we can use the 200 000 estimates of the probability of complication/death in the control and intervention groups that were generated in the analysis (posterior prediction). In essence, we can act like these are 2 x 200 000 patients. For each “patient pair”, we can use their probability estimates and perform a random draw to simulate the occurrence of complication/death. It may be useful then to look at the proportion of “patients pairs” where the intervention patient didn’t have a complication but the control patient did:

Using posterior prediction on the generated Bayesian model, the probability that a patient in the intervention group did not have a complication/death when a patient in the control group did have a complication/death is 28%.

Conclusion

On the basis of a standard statistical analysis, the OPTIMISE trial authors reasonably concluded that the use of the intervention compared with usual care did not reduce a composite outcome of complications and 30-day mortality.

Using a Bayesian approach, it could be concluded with 97.3% certainty that use of the intervention compared with usual care reduces the composite outcome of complications and 30-day mortality; that with 78% certainty, this reduction is clinically significant; and that in 28% of patients where the intervention is used rather than usual care, complication or death may be avoided.

  1 # OPTIMISE trial in a Bayesian framework
  2 # JAMA. 2014;311(21):2181-2190. doi:10.1001/jama.2014.5305
  3 # Ewen Harrison
  4 # 15/02/2015
  5  
  6 # Primary outcome: composite of 30-day moderate or major complications and mortality
  7 N1 <- 366
  8 y1 <- 134
  9 N2 <- 364
 10 y2 <- 158
 11 # N1 is total number in the Cardiac Output–Guided Hemodynamic Therapy Algorithm (intervention) group
 12 # y1 is number with the outcome in the Cardiac Output–Guided Hemodynamic Therapy Algorithm (intervention) group
 13 # N2 is total number in usual care (control) group
 14 # y2 is number with the outcome in usual care (control) group
 15  
 16 # Risk ratio
 17 (y1/N1)/(y2/N2)
 18  
 19 library(epitools)
 20 riskratio(c(N1-y1, y1, N2-y2, y2), rev="rows", method="boot", replicates=100000)
 21  
 22 # Using standard frequentist approach
 23 # Risk ratio (bootstrapped 95% confidence intervals) = 0.84 (0.70-1.01)
 24 # p=0.07 (Fisher exact p-value)
 25  
 26 # Reasonably reported as no difference between groups.
 27  
 28 # But there is a difference, it just not judged significant using conventional
 29 # (and much criticised) wisdom.
 30  
 31 # Bayesian analysis of same ratio
 32 # Base script from John Krushcke, Doing Bayesian Analysis
 33  
 34 #------------------------------------------------------------------------------
 35 source("~/Doing_Bayesian_Analysis/openGraphSaveGraph.R")
 36 source("~/Doing_Bayesian_Analysis/plotPost.R")
 37 require(rjags) # Kruschke, J. K. (2011). Doing Bayesian Data Analysis, Academic Press / Elsevier.
 38 #------------------------------------------------------------------------------
 39 # Important
 40 # The model will be specified with completely uninformative prior distributions (beta(1,1,).
 41 # This presupposes that no pre-exisiting knowledge exists as to whehther a difference
 42 # may of may not exist between these two intervention.
 43  
 44 # Plot Beta(1,1)
 45 # 3x1 plots
 46 par(mfrow=c(3,1))
 47 # Adjust size of prior plot
 48 par(mar=c(5.1,7,4.1,7))
 49 plot(seq(0, 1, length.out=100), dbeta(seq(0, 1, length.out=100), 1, 1),
 50 type="l", xlab="Proportion",
 51 ylab="Probability",
 52 main="OPTIMSE Composite Primary Outcome
Prior distribution",
 53 frame=FALSE, col="red", oma=c(6,6,6,6))
 54 legend("topright", legend="beta(1,1)", lty=1, col="red", inset=0.05)
 55  
 56 # THE MODEL.
 57 modelString = "
 58 # JAGS model specification begins here...
 59 model {
 60 # Likelihood. Each complication/death is Bernoulli.
 61 for ( i in 1 : N1 ) { y1[i] ~ dbern( theta1 ) }
 62 for ( i in 1 : N2 ) { y2[i] ~ dbern( theta2 ) }
 63 # Prior. Independent beta distributions.
 64 theta1 ~ dbeta( 1 , 1 )
 65 theta2 ~ dbeta( 1 , 1 )
 66 }
 67 # ... end JAGS model specification
 68 " # close quote for modelstring
 69  
 70 # Write the modelString to a file, using R commands:
 71 writeLines(modelString,con="model.txt")
 72  
 73  
 74 #------------------------------------------------------------------------------
 75 # THE DATA.
 76  
 77 # Specify the data in a form that is compatible with JAGS model, as a list:
 78 dataList = list(
 79 N1 = N1 ,
 80 y1 = c(rep(1, y1), rep(0, N1-y1)),
 81 N2 = N2 ,
 82 y2 = c(rep(1, y2), rep(0, N2-y2))
 83 )
 84  
 85 #------------------------------------------------------------------------------
 86 # INTIALIZE THE CHAIN.
 87  
 88 # Can be done automatically in jags.model() by commenting out inits argument.
 89 # Otherwise could be established as:
 90 # initsList = list( theta1 = sum(dataList$y1)/length(dataList$y1) ,
 91 #                   theta2 = sum(dataList$y2)/length(dataList$y2) )
 92  
 93 #------------------------------------------------------------------------------
 94 # RUN THE CHAINS.
 95  
 96 parameters = c( "theta1" , "theta2" )     # The parameter(s) to be monitored.
 97 adaptSteps = 500              # Number of steps to "tune" the samplers.
 98 burnInSteps = 1000            # Number of steps to "burn-in" the samplers.
 99 nChains = 3                   # Number of chains to run.
100 numSavedSteps=200000           # Total number of steps in chains to save.
101 thinSteps=1                   # Number of steps to "thin" (1=keep every step).
102 nIter = ceiling( ( numSavedSteps * thinSteps ) / nChains ) # Steps per chain.
103 # Create, initialize, and adapt the model:
104 jagsModel = jags.model( "model.txt" , data=dataList , # inits=initsList ,
105 n.chains=nChains , n.adapt=adaptSteps )
106 # Burn-in:
107 cat( "Burning in the MCMC chain...
" )
108 update( jagsModel , n.iter=burnInSteps )
109 # The saved MCMC chain:
110 cat( "Sampling final MCMC chain...
" )
111 codaSamples = coda.samples( jagsModel , variable.names=parameters ,
112 n.iter=nIter , thin=thinSteps )
113 # resulting codaSamples object has these indices:
114 #   codaSamples[[ chainIdx ]][ stepIdx , paramIdx ]
115  
116 #------------------------------------------------------------------------------
117 # EXAMINE THE RESULTS.
118  
119 # Convert coda-object codaSamples to matrix object for easier handling.
120 # But note that this concatenates the different chains into one long chain.
121 # Result is mcmcChain[ stepIdx , paramIdx ]
122 mcmcChain = as.matrix( codaSamples )
123  
124 theta1Sample = mcmcChain[,"theta1"] # Put sampled values in a vector.
125 theta2Sample = mcmcChain[,"theta2"] # Put sampled values in a vector.
126  
127 # Plot the chains (trajectory of the last 500 sampled values).
128 par( pty="s" )
129 chainlength=NROW(mcmcChain)
130 plot( theta1Sample[(chainlength-500):chainlength] ,
131 theta2Sample[(chainlength-500):chainlength] , type = "o" ,
132 xlim = c(0,1) , xlab = bquote(theta[1]) , ylim = c(0,1) ,
133 ylab = bquote(theta[2]) , main="JAGS Result" , col="skyblue" )
134  
135 # Display means in plot.
136 theta1mean = mean(theta1Sample)
137 theta2mean = mean(theta2Sample)
138 if (theta1mean > .5) { xpos = 0.0 ; xadj = 0.0
139 } else { xpos = 1.0 ; xadj = 1.0 }
140 if (theta2mean > .5) { ypos = 0.0 ; yadj = 0.0
141 } else { ypos = 1.0 ; yadj = 1.0 }
142 text( xpos , ypos ,
143 bquote(
144 "M=" * .(signif(theta1mean,3)) * "," * .(signif(theta2mean,3))
145 ) ,adj=c(xadj,yadj) ,cex=1.5  )
146  
147 # Plot a histogram of the posterior differences of theta values.
148 thetaRR = theta1Sample / theta2Sample # Relative risk
149 thetaDiff = theta1Sample - theta2Sample # Absolute risk difference
150  
151 par(mar=c(5.1, 4.1, 4.1, 2.1))
152 plotPost( thetaRR , xlab= expression(paste("Relative risk (", theta[1]/theta[2], ")")) ,
153 compVal=1.0, ROPE=c(0.9, 1.1),
154 main="OPTIMSE Composite Primary Outcome
Posterior distribution of relative risk")
155 plotPost( thetaDiff , xlab=expression(paste("Absolute risk difference (", theta[1]-theta[2], ")")) ,
156 compVal=0.0, ROPE=c(-0.05, 0.05),
157 main="OPTIMSE Composite Primary Outcome
Posterior distribution of absolute risk difference")
158  
159 #-----------------------------------------------------------------------------
160 # Use posterior prediction to determine proportion of cases in which
161 # using the intervention would result in no complication/death
162 # while not using the intervention would result in complication death
163  
164 chainLength = length( theta1Sample )
165  
166 # Create matrix to hold results of simulated patients:
167 yPred = matrix( NA , nrow=2 , ncol=chainLength )
168  
169 # For each step in chain, use posterior prediction to determine outcome
170 for ( stepIdx in 1:chainLength ) { # step through the chain
171 # Probability for complication/death for each "patient" in intervention group:
172 pDeath1 = theta1Sample[stepIdx]
173 # Simulated outcome for each intervention "patient"
174 yPred[1,stepIdx] = sample( x=c(0,1), prob=c(1-pDeath1,pDeath1), size=1 )
175 # Probability for complication/death for each "patient" in control group:
176 pDeath2 = theta2Sample[stepIdx]
177 # Simulated outcome for each control "patient"
178 yPred[2,stepIdx] = sample( x=c(0,1), prob=c(1-pDeath2,pDeath2), size=1 )
179 }
180  
181 # Now determine the proportion of times that the intervention group has no complication/death
182 # (y1 == 0) and the control group does have a complication or death (y2 == 1))
183 (pY1eq0andY2eq1 = sum( yPred[1,]==0 & yPred[2,]==1 ) / chainLength)
184 (pY1eq1andY2eq0 = sum( yPred[1,]==1 & yPred[2,]==0 ) / chainLength)
185 (pY1eq0andY2eq0 = sum( yPred[1,]==0 & yPred[2,]==0 ) / chainLength)
186 (pY10eq1andY2eq1 = sum( yPred[1,]==1 & yPred[2,]==1 ) / chainLength)
187  
188 # Conclusion: in 27% of cases based on these probabilities,
189 # a patient in the intervention group would not have a complication,
190 # when a patient in control group did.

转自：http://www.datasurg.net/2015/02/16/bayesian-statistics-and-clinical-trial-conclusions-why-the-optimse-study-should-be-considered-positive/

---------------------------------------------------------------------------------- 数据和特征决定了效果上限，模型和算法决定了逼近这个上限的程度 ----------------------------------------------------------------------------------

查看全文

相关阅读:
Codeforces Round #256 (Div. 2/B)/Codeforces448B_Suffix Structures(字符串处理)
【android】优秀的UI资源站点集合
 升级iOS8系统后，保险箱Pro、私人保险箱、私密相冊打开就闪退的官方解决方式
 js产生随机数
 java实现各种数据统计图（柱形图，饼图，折线图）
Matlab画图-非常具体，非常全面
 Lucene教程具体解释
 NAND FLASH
Jenkins(二)
iOS 本地通知

原文地址：https://www.cnblogs.com/payton/p/4302632.html