JTCS Sign the Guestbook
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lauer, M. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lauer, M. S.
Related Collections
Right arrow Education
Right arrowRelated Articles

J Thorac Cardiovasc Surg 2006;132:249-251
© 2006 The American Association for Thoracic Surgery


Statistics for the Rest of Us

Believability of clinical trials: A diagnostic testing perspective

Michael S. Lauer, MD, FACC, FAHA *

Departments of Medicine and Epidemiology and Biostatistics, Lerner Medical College of Case Western Reserve University, and the Department of Cardiovascular Medicine, The Cleveland Clinic Foundation, Cleveland, Ohio

Received for publication November 21, 2005; accepted for publication March 3, 2006.

* Address for reprints: Michael S. Lauer, MD, FACC, FAHA, The Cleveland Clinic Foundation, 9500 Euclid Ave/Desk JJ40, Cleveland, OH 44122 (Email: Lauerm{at}ccf.org).

GoClinical trials are the medical community's diagnostic tests. When a physician sends a patient for a conventional diagnostic test, he or she is asking whether a certain disease is likely to be present. Similarly, when the medical community designs and executes a clinical trial, it wants to know whether a certain treatment is likely to work. And just like a diagnostic test, a clinical trial has the potential for yielding an incorrect, erroneous result.

The traditional approach to reporting probability of error in clinical trials includes P values and statements of power. The P value is the probability that a positive trial result is just the result of chance, given an assumed truth of the null hypothesis, which is that the treatment does not work. Power refers to the ability of a trial to show that a treatment works, given that in truth it actually does. Although presenting P values and statements of power has become lore in reporting clinical trials, careful reflection shows that clinicians might not gain as much value from them as they might think.

When a clinician reads a diagnostic test report that is "positive," the next question is, "How likely is it that my patient has the disease?" That is, what is the likelihood that this positive report is in fact correct? As described by Bayesian theory, the likelihood of a correct result given a positive test result, or positive predictive value, is related not only to the sensitivity and specificity of the test but also to the pretest likelihood of disease being present. 1 Go

Similarly, when a clinical trial report is presented to the medical community suggesting that a treatment works, the critical question that clinicians should ask is, "Is the trial likely to be reporting a correct result?" What is the positive predictive value of the trial? To estimate this, we need to know not only the P value and the power but also something about the pretrial likelihood of a positive result, as recently argued by Ioannidis 2 Go and O'brien and Castelloe. 3 Go

Bayesian thinking applied to clinical trials is shown schematically in Table 1 (taken from Ioannides 2 Go), which shows an example of an experimental treatment in which the pretrial likelihood of success is considered to be low (ie, 10%). This might be because we are testing a new and innovative treatment for which there is little prior experience. We imagine a universe in which 1000 trials are performed, one of which happens to be the one reported in the journal resting between our hands. Each trial is small and has a power of only 50% because funding is limited and single centers have few eligible patients. Traditionally, a trial result would be considered positive if the P value is .05 or less. Given the pretrial expectation that 100 of 1000 trial results will be positive and given a power of 50%, one would expect 50 positive trial results, which is shown in the upper left hand box of Table 1. Conversely, among the 900 predicted negative trial results supporting the null hypothesis, we would expect 45 positive trial results, given a P value of .05. Thus of 1000 trials in this imaginary universe, we predict 95 positive trial results, of which 50 are true-positive results and 45 are false-positive results. The positive predictive value for any of the positive trial results turns out to be only 52%.


View this table:
[in this window]
[in a new window]
 
TABLE 1. Application of the Bayesian approach to clinical trials
 
Now let us put ourselves in the position of a clinician reading a positive trial result on this treatment in a medical journal. We might be concerned about low power, but the trial result was positive because the reported P value is .05, which sounds impressive. However, when considering the low pretrial likelihood of a positive result, we must conclude that there is a 48% chance that this positive trial result is actually wrong!

Although this hypothetical example might seem far-fetched, in fact, most medical literature is plagued by problems with low pretest probability and underpowered samples. As summarized by Ioannidis, 2 Go much of what is published in the medical literature might actually be false.

And the problem worsens.

A reported positive trial result might be incorrect not only because of a low pretrial likelihood but also because of bias. Bias refers to a fundamental problem in the way the trial is designed, analyzed, or reported that results in its result being reported as positive when in fact it is not. There are many well-known sources of bias. As mentioned by Tiruvoipati and colleagues, 4 Go bias can result from a lack of unpredictable allocation, inadequate concealment of allocation from investigators, inadequate blinding, inadequate sample size, and failure to use the intent-to-treat method for analyzing results.

Yet another problem arises from performing multiple trials of the same treatment. 2 Go Although it seems to be scientifically intuitive that one wants to reproduce results, performance of multiple trials by multiple groups leads to an increased risk of false-positive findings because of a standard, multiple testing problem. 5,6 Go When one puts together low pretest likelihood, bias, and multiple trials, one is left with a high likelihood of false-positive trial results (Figures 1 and 2). Go


Figure 1
View larger version (21K):
[in this window]
[in a new window]
 
Figure 1. Hypothetical plot showing the probability of a positive trial result reflecting truth (posttrial probability) according to pretrial probability and study power. A low ß value of 0.1 (topmost curve) corresponds to 90% power, whereas a high ß value of 0.7 (bottom curve) corresponds to 30% power; intermediate curves show a ß value of 0.3 (70% power) and a ß value of 0.5 (50% power). Posttrial probability increases as pretrial probability and power increase.

 

Figure 2
View larger version (21K):
[in this window]
[in a new window]
 
Figure 2. Hypothetical plot showing the probability of a positive trial result reflecting truth (posttrial probability) according to pretrial probability and degree of bias. N is the number of trials performed, in this case 4. The {alpha} value is the P value for significance, in this case .05. Also, in this case power is high at 90%. Bias is represented by µ, with values of 0 and 0.1 corresponding to little bias (in design, analysis, and reporting) and 0.4 corresponding to a substantial degree of bias (intermediate curve, µ = 0.2). Note the dramatic decrease in posttrial probability as bias increases and pretrial probability decreases. This type of scenario might be common for small trials of highly innovative treatments.

 
What can the medical community do to ensure that clinical trials that are reported in the medical literature are in fact believable? First, with rare exceptions, trials that are published in high-impact journals and that are likely to affect clinical practice should be those in which there is a reasonably high pretrial probability of success. Exceptions should be explicitly categorized by labeling them, for example, as "preliminary communications." 7 Go Trials should have high power and little bias. I therefore agree with Tiruvoipati and colleague's call 4 Go for widespread adoption of the CONSORT statement 8 Go for reporting trials in cardiac surgery.

Tiruvoipati and colleagues 4 Go are to be congratulated for their systematic and critical evaluation of the cardiac surgery literature and for their challenge to its journal editors. Routine incorporation of the CONSORT guidelines, as exemplified by 5 recently published cardiac surgery trials, 9-13 Go will improve the transparency of reporting and decrease the likelihood of bias, 14 Go thereby increasing the predictive value of one of the medical community's most precious diagnostic tests, the randomized trial.


See related articles on pages 229, 233, 241, 243, 245, and 247.

 


    References
 Top
 References
 

  1. Fletcher RW, Fletcher SW. Clinical epidemiology. the essentials. 4th ed.. Philadelphia: Lippincot, Williams, and Wilkins; 2005.
  2. Ioannidis JP. Why most published research findings are false. PLoS Med 2005;2:e124.[Medline]
  3. O'brien RG, Castelloe JM. Sample size analysis in study planning. concepts and issues, with examples using PROC POWER and PROC GLMPOWER. 2004Paper presented at: Proceedings of the 29th SAS Users Group International Conference; Montreal, Quebec, Canada.
  4. Tiruvoipati R, Balasubramanian SP, Atturu G, Peek GJ, Elbourne D. Improving the quality of reporting randomized controlled trials in cardiothoracic surgery—the way forward. J Thorac Cardiovasc Surg 2006;132:233-240.[Abstract/Free Full Text]
  5. Dmitrienko A, Offen WW, Westfall PH. Gatekeeping strategies for clinical trials that do not require all primary effects to be significant. Stat Med 2003;22:2387-2400.[Medline]
  6. Gong J, Pinheiro JC, DeMets DL. Estimating significance level and power comparisons for testing multiple endpoints in clinical trials. Control Clin Trials 2000;21:313-329.[Medline]
  7. Nissen SE, Tsunoda T, Tuzcu EM, et al. Effect of recombinant ApoA-I Milano on coronary atherosclerosis in patients with acute coronary syndromes. a randomized controlled trial. JAMA 2003;290:2292-2300.[Abstract/Free Full Text]
  8. Moher D, Schulz KF, Altman D. The CONSORT statement. revised recommendations for improving the quality of reports of parallel-group randomized trials. JAMA 2001;285:1987-1991.[Abstract/Free Full Text]
  9. Alexander JH, Hafley G, Harrington RA, Peterson ED, Ferguson TB, Lorenz TJ, et al. Efficacy and safety of edifoligide, an E2F transcription factor decoy, for prevention of vein graft failure following coronary artery bypass graft surgery: PREVENT IV: a randomized controlled trial. JAMA 2005;294:2446-2454.[Abstract/Free Full Text]
  10. Doukas G, Samani NJ, Alexiou C, et al. Left atrial radiofrequency ablation during mitral valve surgery for continuous atrial fibrillation. a randomized controlled trial. JAMA 2005;294:2323-2329.[Abstract/Free Full Text]
  11. Puskas JD, Williams WH, Mahoney EM, et al. Off-pump vs conventional coronary artery bypass grafting: early and 1-year graft patency, cost, and quality-of-life outcomes: a randomized trial. JAMA 2004;291:1841-1849.[Abstract/Free Full Text]
  12. Verrier ED, Shernan SK, Taylor KM, et al. Terminal complement blockade with pexelizumab during coronary artery bypass graft surgery requiring cardiopulmonary bypass. a randomized trial. JAMA 2004;291:2319-2327.[Abstract/Free Full Text]
  13. Mitchell LB, Exner DV, Wyse DG, et al. Prophylactic oral amiodarone for the prevention of arrhythmias that begin early after revascularization, valve replacement, or repair: PAPABEAR: a randomized controlled trial. JAMA 2005;294:3093-3100.[Abstract/Free Full Text]
  14. Moher D, Jones A, Lepage L. Use of the CONSORT statement and quality of reports of randomized trials. a comparative before-and-after evaluation. JAMA 2001;285:1992-1995.[Abstract/Free Full Text]

Related Articles

CONSORT and beyond
Eugene H. Blackstone
J. Thorac. Cardiovasc. Surg. 2006 132: 229-232. [Extract] [Full Text] [PDF]

Improving the quality of reporting randomized controlled trials in cardiothoracic surgery: The way forward
Ravindranath Tiruvoipati, Sabapathy P. Balasubramanian, Gnaneswar Atturu, Giles J. Peek, and Diana Elbourne
J. Thorac. Cardiovasc. Surg. 2006 132: 233-240. [Abstract] [Full Text] [PDF]

Randomized clinical trials in surgery: Why do we need them?
Charles M. Balch
J. Thorac. Cardiovasc. Surg. 2006 132: 241-242. [Extract] [Full Text] [PDF]

Randomized controlled trials in surgery: Comic opera no more?
Neal F. Kassell and Aaron S. Dumont
J. Thorac. Cardiovasc. Surg. 2006 132: 243-244. [Extract] [Full Text] [PDF]

Reports of clinical trials: Ethical aspects
Robert M. Sade
J. Thorac. Cardiovasc. Surg. 2006 132: 245-246. [Extract] [Full Text] [PDF]

Improving the quality of clinical trials in surgery
Steven Piantadosi
J. Thorac. Cardiovasc. Surg. 2006 132: 247-248. [Extract] [Full Text] [PDF]



This article has been cited by other articles:


Home page
Ann. Thorac. Surg.Home page
G. L. Grunkemeier, Y. Wu, and A. P. Furnary
What is the value of a p value?
Ann. Thorac. Surg., May 1, 2009; 87(5): 1337 - 1343.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lauer, M. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lauer, M. S.
Related Collections
Right arrow Education
Right arrowRelated Articles


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ANN THORAC SURG ASIAN CARDIOVASC THORAC ANN EUR J CARDIOTHORAC SURG
J THORAC CARDIOVASC SURG ICVTS ALL CTSNet JOURNALS