I wonder if any one in the list can please help me with this bootstrapping problem.
I have raw data, in sequence, of failures and success with treatment.
There were 5 failures and 30 successes.
The sequence of failures (F) and success (S) were as follows
SSSSSSSSSSSSSSSSSSSFSSSSSFSSFSSFFSS
For CUSUM calculations each success gets a score of +2/7 and each failure gets a score of -12/7
Using bootstrapping techniques (reordering the sequence 1000 times) I want to calculate the 95% confidence limits for CUSUM
Dear All
2 months back I had posted this bootstrapping problem for a clinical trial we were doing.
We have now developed a software using this for 'CUSUM (cumulative sum) limits' calculations.
The background
RCTs are the best way to study a new intervention. However they are very expensive (and so RCTs are often done by the pharmaceutical industry promoting the new intervention). We wondered if CUSUM as used in industry for quality control, can be used, at least initially, to check if a new intervention does more harm or more good than traditional treatment.
The Proof of Concept
I am attaching a small study as 'proof of concept' When we acquired the data for the study the software had not been developed. The pragmatic stopping rule we adopted while acquiring the data was to temporarily stop the trail (pending full development of the software) if the CUSUM with the new drug exceeded the overall rate of failure with the standard drug (if the CUSUM of failures with the new drug crossed the zero line).
I have uploaded the software at
http://jacob.puliyel.com/foresee/
This now allows the intervention to be compared to 'standard therapy' in real time (meaning a new CUSUM graph can be drawn with each new patient treated) and so the lag phase before an intervention is declared as 'causing more harm,' is minimized.
I would greatly appreciate any feed back (including negative feedback!) on this.
Regards
Jacob Puliyel
Dear Jacob,
Thank you for this interesting post. Some years ago I worked in QA/QC in a
clinical diagnostics environment, so read your proposal closely. Especially
since it seems you are proposing replacing RCT methodology with the CUSUM
approach.
From Page 7, "However, RCT's have inherent problems, especially in the
context of trials in children. According to Mc Culloch and colleagues, RCT's
require large samples, long duration, difficult blinding and are very
expensive10 and it is difficult to recruit cases11. Parents find the concept
of equipoise between trial drugs and the need for blinded randomization
difficult to understand." Are you suggesting that the CUSUM approach
improves on these conditions whilst remaining effective ?
I am suggesting that CUSUM may be a less expensive way to look at a new mode of treatment and compare it to standard therapy. Here of course the we lose the advantage of randomization, blinding etc that RCTs have, and we are looking only at historical controls.
In your latest posting you say, "The pragmatic stopping rule we adopted
while acquiring the data was to temporarily stop the trail (pending fullwith the new drug crossed the zero line)."
development of the software) if the CUSUM with the new drug exceeded the
overall rate of failure with the standard drug (if the CUSUM of failures
RCT's also employ stopping rules.
I'm finding it difficult to understand why the CUSUM approach with a
stopping rule is better than an RCT approach with a stopping rule. This is
worsened because I don't recognise that if "the new drug exceeded the
overall rate of failure with the standard drug" it is equivalent to "if the
CUSUM of failures with the new drug crossed the zero line". This might be
me, I might be rusty, but if for example the new drug scored -0.25, say,
over a number of occasions while the limit was -2, it would pass each time
whilst the CUSUM would steadily decline. I'm wary, in case your definition
of CUSUM (enclosed in your software) is different to mine.
I hope the explanation above answers the question. Ordinarily the trial with the study drug should be stopped when failures crosses the -2SD line.
Page 2:
"What this study adds
Nebulised hypertonic saline is at least as good as standard treatment with
nebulised Epinephrine." In standard equivalence trials, showing this
requires a larger sample size. Whilst open to a new idea, again I wonder if
the CUSUM approach is as valid as RCT, yet achieves this ?
The question of sample size for a valid CUSUM study needs to be looked at. I am not confident our sample size was any where near adequate but I thot it was enough to test the concept of using CUSUM for such comparisons.
Page 6:
"A Cochrane review of the use of Epinephrine found evidence that it was more
effective when used in the outpatient setting but no evidence of benefit
when used in inpatients when compared against either placebo or Salbutamol3.
"
Page 5:
". Nebulised bronchodilators like Salbutamol, Ipravent and Epinephrine have
been used by some in treatment of bronchiolitis. A Cochrane meta-analysis
has not found these drugs to be useful2"
Page 5:
". It is clear that it is for this temporary but perceptible relief of
symptoms that these drugs are used."
You require an active comparator, and Epinephrine seems commonly recognised
as such, but care is required as formal studies suggest it may not be
effective in the long term, in your setting (in patient). So to say that
your drug is at least as effective as Epinephrine might not prove much.
I agree completely and that is why we brought up all those Cochrane meta-analysis. So to say that
your drug is at least as effective as Epinephrine might not prove much. My question is - Can we use this CUSUM analysis tool, to compare a new drug against 'standard therapy'
I wondered if a crossover method might be used ? Or might this limit study
participants to being not too ill.
Analysis of RCT data allows for adjustments to be made (eg age). How could
this be achieved with CUSUM ?
Page 13: a strength of the study is early stoppage in real time if
necessary. What about false negatives ?
The stopping rule will be 'crossing of the -2SD lower line'. The bootstrapping process makes allowance for the random clustering of failures.
Table 1 goes from Score 0 to 3, but at the bottom refers briefly to scores 4
to 9.
There are 5 rows and each row has a maximum score of 3. The highest score one can achieve is 15
Figure 2 at patient 3 shows a sharp drop in the bootstrapping lines, in 8
out of 10, yet no alteration in gradient is seen in the blue, CUSUM
graph....seems strange ?
The blue line is the actual sequence with the standard drug. 10 random reodering of the data is also shown.
Re-reading this, I haven't said much positive about the paper.....sorry.
I
like the idea, and would appreciate in words and diagram how you translate
your data into CUSUM, using a real example. When I used it, I would have
found it straightforward to do so (what does bootstrapping achieve here ? ~
this question probably reflects more on me than the paper !)
You write What does bootstrapping achieve here?