|
|
||||||||
J Thorac Cardiovasc Surg 2001;121:454-464
© 2001 The American Association for Thoracic Surgery
General Thoracic Surgery |
From the Departments of Thoracic and Cardiovascular Surgery,a Biostatistics and Epidemiology,b Hematology and Medical Oncology,c Gastroenterology,d and Anatomic Pathology,e The Center for Swallowing and Esophageal Disorders, The Cleveland Clinic Foundation, Cleveland, Ohio.
Received for publication June 15, 2000. Revisions requested Sept 21, 2000; revisions received Oct 13, 2000. Accepted for publication Oct 24, 2000. Address for reprints: Thomas W. Rice, MD, The Cleveland Clinic Foundation, 9500 Euclid Ave, Desk F25, Cleveland, OH 44195 (E-mail: ricet{at}ccf.org).
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Patients clinically staged with cancers metastatic to regional lymph nodes (N1) have a poor prognosis if treated with surgery alone. It is known that patients who have no residual cancer after induction chemoradiation therapy have an improved survival. However, no study has clinically staged patients and quantified the benefit of downstaging from N1 to N0, regardless of primary tumor status. Therefore, the purpose of this study was to evaluate clinical staging and downstaging by induction chemoradiation therapy in patients with clinical N1 (cN1) esophageal carcinoma.
| Patients and methods |
|---|
|
|
|---|
|
The addition of groups 5 through 8 was the strategy used to quantify and adjust for clinical staging inaccuracies.
Of the 293 patients, 240 (82%) were men, 276 (94%) were white, and age ranged from 26 to 84 years (mean ± SD, 62 ± 11). A total of 232 (79%) patients had adenocarcinoma, 57 (19%) patients had squamous cell carcinoma, and 4 (1%) had adenosquamous carcinoma.
Staging
All patients underwent endoscopic esophageal ultrasound (EUS) before treatment to determine clinical stage. Clinical depth of tumor invasion (cT) and clinical regional lymph node status (cN) were recorded by means of the 1997 criteria of the American Joint Committee on Cancer Staging.
3 Ultrasound characteristics were primary determinants of cN and included nodal size, shape, border, internal echo characteristics, and proximity of the regional node to the primary tumor. Distant metastatic disease (cM1b) was identified by computed tomographic scanning.
Pathologic stage (pT and pN) was determined by examination of the resection specimen and regional lymph nodes that were either sampled or included in the lymphadenectomy specimen.
Induction therapy
Since 1991, patients have been offered induction therapy for cN1 and cT3 or cT4 cancers. Induction therapy included two courses of chemotherapy, either cisplatin and 5-fluorouracil
4 or cisplatin and taxol.
5 Concurrent with chemotherapy, accelerated fractionation radiation therapy (1.5 Gy bid to a total dose of 45 Gy) was administered in two courses over 2 months. Surgery was accomplished 4 to 6 weeks later. These studies were approved and reviewed annually by The Cleveland Clinic Foundation Institutional Review Board, and written informed consent was obtained from all patients before beginning treatment.
Surgery
Of 293 patients, 234 (80%) had esophagectomy with thoracotomy and 2-field lymphadenectomy, and 59 (20%) had transhiatal esophagectomy and lymph node sampling. Patients with cN1 and cT3 or cT4 cancers not receiving induction therapy underwent surgery as the primary treatment. Six patients had pT4 tumors, all infracarinal. Ninety-four percent had R0 resections.
Study end point
The end point of the study was all-cause mortality. Patients were followed up by periodic clinic visits; however, a cross-sectional systematic follow-up was made in December 1999. Mean follow-up was 26 ± 15 months (43 ± 35 months in patients alive at last follow-up).
Methodsdata analysis
Descriptive
Descriptive statistics are summarized as the mean and standard deviation for continuous variables and as frequencies and percentages for categorical variables.
Survival
Nonparametric estimates of survival are obtained by the method of Kaplan and Meier.
6 A parametric method was used to resolve the number of phases of instantaneous risk of death (hazard function) and to estimate their shaping parameters.
7 Estimates of survival were considered reliable to a maximum of 10 years.
So that the prognostic relevance of clinical staging could be established, a multivariable time-related analysis of survival was performed of clinical staging, demographic, and surgical experience variables (Appendix I), in the following manner. Initial screening of variables possibly related to survival used the log-rank test and the univariable Cox proportional hazards model. Continuous and ordinal variables were assessed univariably by decile risk analysis to suggest transformations of scale required to assure that the relationships of these measured variables were well calibrated to model of outcome.
Multivariable analysis was performed simultaneously for the two hazard phases identified from the data. We used a directed technique of entry of variables into the multivariable models.
8 A P = .1 criterion for retention of variables in the final models was used. Additionally, bootstrap resampling was used to validate the models.
9
Who will respond to induction therapy?
Among the 69 cN1 patients who received induction therapy, 37 were found to have pN0 at operation and are classified as "responders." Multivariable logistic regression analysis was performed of all pre-induction therapy variables (Appendix I) to identify factors that would predict pN0.
Cost and benefit of therapy
Two avenues of inquiry were taken to investigate the question, "Do patients receiving induction therapy survive as long as their counterparts who did not receive induction therapy?" If they do not, this would suggest a toxic effect of therapy, advancement of the disease during therapy, the biologic process of downstaging, or other therapy-related differences.
Matched analysis
Even in a randomized study, finding counterparts for such a comparison is impossible because pathologic stage is unknown at the time of randomization. Therefore, patients were multivariably propensity-matched for their demography, pathologic stage, operation, and date of operation (Appendix I). Matching factors associated with induction therapy were identified by means of multivariable logistic regression analyses and strategies for model building similar to the survival analysis described above. After these factors had been identified, statistically nonsignificant factors, including some components of pathologic stage, sex, race, cell type, degree of differentiation of the tumor, surgical approach, and date of operation were added to the analyses. The resulting logistic equation yielded a propensity score for each patient. Patients were then sorted according to their score. This revealed that about one third (tercile I) of the patients were unmatched (less than 10% chance of receiving induction therapy). In addition, the 19 patients downstaged by induction therapy to pT0/pN0 were unmatched. The remaining two terciles were well matched (Appendix Table I
). The effect of induction therapy on survival was determined within each tercile as a whole and separately for the pN1 and pN0 cohorts.
|
So that risk-adjusted estimates of the effect of being downstaged by induction therapy to pN0 could be obtained, the final multivariable model was solved for two 62-year-old patients, both pT2 N0 M0 at thoracotomy, one downstaged to that pathology by induction therapy. These estimates are compared with a third 62-year-old patient at thoracotomy with pN1 disease and three positive nodes. The cost of induction therapy in the patient downstaged to pN0 was estimated as the area between the survival curves for the two pN0 patients. The benefit of induction therapy was estimated as the area between the survival curve of the patient downstaged to pN0 and the patient with pN1 disease.
Differences of areas represent the reduced and added years of life within the 10 years after surgery contributed by downstaging. For reasons previously given, these differences in lifetime are presented with 90% confidence limits (CL).
10
Presentation
Confidence limits of survival estimates are equivalent to 1 standard error (68% CL). Tables of multivariable analyses are presented with their regression coefficients, standard errors, and P values, rather than an expression of relative risk. This is because the transformation of scale of continuous and ordinal variables makes conversion of regression coefficients to either odds ratios or hazard ratios uninterpretable. In addition, the hazard function models are not of proportional hazards, further reducing the interpretability of hazard ratios based solely on regression coefficients.
To obtain risk-adjusted estimates of the effect of N1 burden on 1- and 5-year survival, we solved the final multivariable equation for number of nodes from 1 to 10 with all other factors held constant: age 62 years, pN1 disease, no induction therapy, use of thoracotomy, and no metastatic disease.
| Results |
|---|
|
|
|---|
|
|
|
|
|
|
Cost and benefit of therapy
Matched analysis
In general, patients receiving induction therapy were younger, had N1 disease, and greater T. This was true for both clinical stage (Table IV) and pathologic stage (Table V). In well-matched patients, there was a small, consistent, nonsignificant difference in survival for similar pathologic stages in patients receiving induction therapy (Appendix Table IV
). Survival of patients in well-matched terciles II and III (Appendix Table I
) had similar pattern of survival (P = .4). In both terciles II and III, patients receiving induction therapy had a slightly poorer survival at any given point. In both terciles II and III, this pattern was seen in pN0 patients (P = .4 and .6) and pN1 patients (P = .8 and .2).
|
|
|
|
|
|
|
| Discussion |
|---|
|
|
|---|
Once tumors have metastasized to regional lymph nodes, survival is poor. Surgery in these patients is little more than palliation. The identification of N1 disease and downstaging with induction therapy to pN0 are possible. This is evident by the greater discordance of cN and pN in the induction therapy group compared with the surgery groups. The discordance (inaccuracy of staging) seen in the surgery group was 20%. This discordance rose to 45% in the induction therapy groups. However, this is a combination of staging inaccuracy and downstaging. Thus, about 25% to 30% of patients will be downstaged to N0 by induction therapy. Unfortunately, accurate identification of responders to induction therapy is not possible at present.
Survival
Regardless of treatment, patients with pN1 disease (groups 2, 3, 6, and 7) have poor survival. There is slight modulation of survival in these patients by N1 burden. In pN0 disease (groups 1, 4, 5, and 8), survival is intermediate to good and is substantially modulated by induction therapy, pT, and type of operation.
Cost and benefit of therapy
Induction therapy in pathologically matched patient groups produces a small difference in survival and in multivariable analysis a statistically significant difference. This difference is the result of multiple factors, including induction toxicity, the biologic process of downstaging that produces a lower stage that may not be equivalent to that same stage in a patient not receiving induction therapy (hysteresis effect of downstaging), possible progression of disease during therapy, and additional unknown factors.
The cost of induction therapy in responders is considerably outweighed by the survival benefit of conversion to pN0. In the future, if nonresponders can be identified, they can be spared the risks of induction therapy.
Implications
Previous studies of induction therapy have ignored clinical stage and randomized patients into surgery-only and multimodality arms, assuming that randomization will be equal across stages.
11-14 This study illustrates several problems with such trials. For patients with early-stage tumors and excellent survival, small studies would be underpowered to detect a therapeutic effect in this growing segment of patients. In addition, including such patients without accounting for stage may dilute the therapeutic effect of induction therapy. Another problem of randomized trials in this setting is that they may still be susceptible to the very selection bias they are designed to protect against.
15 Patients who are deemed to have either good or bad therapy risk may or may not be included at all in the trial. Further, as shown by this study, there is inaccuracy in clinical staging of about 20%. This is similar in magnitude to the proportion of patients who may respond to induction therapy. Both inaccuracy and lack of response will tend to dilute and hide the beneficial effect of induction therapy (Appendix II).
Presently we depend on clinical stage to direct treatment. The finding of cN1 predicts poor survival, and those patients who are suitable candidates are offered induction chemoradiation therapy and surgery in a protocol setting. Those patients who are understaged and found to have N1 disease at resection are offered postoperative adjuvant chemoradiation therapy. Although this treatment protocol is not perfect, it has allowed stratification of patients for both prognosis and evaluation of therapy. We strongly recommend that all patients with esophageal carcinoma be clinically staged and have their treatment modified accordingly.
| Limitations |
|---|
|
|
|---|
| Appendix I. Variables examined in multivariable analyses |
|---|
|
|
|---|
Pre-induction therapy (or preoperative) clinical staging
Each clinical depth of tumor invasion (cT) stage (0, Tis, 1-4), an ordinal variable representing these (1 = cT0, 2 = cTis, 3 = cT1, 4 = cT2, 5 = cT3, 6 = cT4), and a dichotomous variable for cT > 2; clinical N1 disease.
Pre-esophagectomy therapies
Previous esophagogastric operations
Preoperative induction therapy
Surgical approach
Thoracotomy versus transhiatal
Tumor characteristics
Cell type (adenocarcinoma, squamous cell, adenosquamous); each histologic grade of tumor differentiation and an ordinal variable representing differentiation (0 = well, 1 = moderately well, 2 = moderate, 3 = moderately poor, and 4 = poor); R0 resection; presence of distant metastases, either M1a or M1b.
Pathologic staging
Pathologic depth of tumor invasion (pT) expressed similar to clinical staging; pathologic N1 disease (pN1), number of nodes sampled, number of nodes positive, and percent of nodes sampled found to be positive (both are expressions of N1 burden); pathologic stage (pStage) (ordinal variable, where 0 = stage 0, 1 = stage I, 2 = stage IIA, 3 = stage IIB, 4 = stage III, 5 = stage IVA, 6 = stage IVB).
Downstaging
Difference in T, difference in N.
Postoperative
Use of postoperative chemotherapy and/or radiotherapy
Surgical experience
Date of operation, expressed continuously as years between January 1, 1985, and the date of esophagectomy
Note: All dichotomous variables were coded as 0 for absence of the factor and 1 for presence of the factor. Thus, the factor pN1 disease (Table VI
) is 0 for pN0 disease and 1 for pN1 disease.
| Appendix II: Simulation of randomized trial of induction therapy |
|---|
|
|
|---|
Discussion
Dr Richard Whyte (Stanford, Calif). The objective was to determine the relevance of clinical staging and the importance of downstaging. The results, not surprisingly, showed that the earlier stage patients did the best, the most advanced stage patients did the worst, and the patients who received induction therapy were somewhere in between. Furthermore, of those that received induction therapy, patients with N1 disease did worse than those with N0 disease. This, I thought, was fairly straightforward. The manuscript, on the other hand, was anything but straightforward. Somehow the 1-page abstract was transformed into 44 pages, including 9 tables, 6 figures, 2 appendixes, and enough statistics, including P values, adjusted coefficients, confidence limits, terciles, propensity matching, and mathematical modeling to make my head spin. To be fair, however, much of the text and several of the figures are contained in two appendixes, which detail the complex statistical methods used to analyze the data. Nonetheless, I did indeed find the statistics a challenge. I would like to thank both Craig Miller, my colleague, for convincing me that the statistics are both appropriate and, as he terms it, "the wave of the future," and Tom Rice for a quick tutorial on these statistics on Thursday morning. Fortunately, Tom has chosen not to present all of the statistics today but instead has chosen two key features: the hazard functions and a simulated survival curve of a hypothetical patient based on his mathematical model. This model allows one to perform a simulated randomized trial of two treatment modes. While the model takes into account many of the quantitative aspects of patient selection, it fails to account for some of the more qualitative issues such as patient bias, physician bias, and patient performance status. To borrow from Dr Matloff's presentation, if one were to compare clinical trials to government, I would submit that a randomized prospective study is like democracy: it is not perfect but it is the best system we have.
I have 3 questions for Dr Rice. First, numerous other trials of neoadjuvant therapy for esophageal cancer have shown that complete pathologic response including both tumor and nodal response is a predictor of better outcome. Why did this study instead ignore tumor response and use an assumption of nodal response? I say assumption since there is generally no pretreatment proof of N1 disease. What was the complete pathologic response rate in these patients with post-treatment N0 disease?
Dr Rice.The problem with clinical staging is that the pathologic stage is not known until all the treatment is completed. If induction therapy is successful, then the pathologic stage is changed. I agree that patients who have a complete response, no residual tumor, are the best group of patients. Twenty percent of our patients had complete sterilization of the tumor and the regional lymph nodes with induction therapy. However, those patients who were T0 but still N1 at resection had a poor survival. So sterilization of the tumor is not the key. Regional lymph node downstaging is crucial. We did not include the data on T downstaging because it made the presentation and paper even more complex. I plan to write a companion article about the implications of T staging and downstaging. Twenty percent of our patients had sterilization of the tumor and regional nodes, and approximately 30%, a third of the patients, had sterilization of regional nodes.
Dr Whyte.My second question concerns the clinical staging. You had an error rate of 20% based on clinical stagingincluding both false positives and false negatives. Would this justify more aggressive surgical staging such as more liberal use of needle biopsies, EUS-guided needle biopsies, or thoracoscopic or laparoscopic staging?
Dr Rice.Staging is in its infancy and is certainly evolving. No doubt clinical staging will improve with improved instruments and technology. The addition of EUS-guided fine needle aspiration of regional nodes will improve the determination of regional lymph node status. Thoracoscopic and laparoscopic staging, which is not the equivalent of mediastinoscopy in lung cancer, is hard to do. It is a difficult procedure that is essentially another operation. It should be reserved for those questions that are not answered by clinical staging. There is a role for invasive staging, but not in every patient.
Dr White. Should we now consider a simulated or, to use the computer parlance of the day, a "virtual" randomized study to settle these types of questions, particularly when there are true randomized studies or randomized clinical trials that address the same question yet fail to find significant differences in the two treatment arms?
Dr Rice. Simulated randomization would never replace a truly randomized phase III study. However, the present studies as they were conceived do not randomize patients by stage. This is a major problem, because the small effect of induction therapy on a few patients is hidden by the inclusion of very good risk patients. Not until we have sophisticated studies that randomize patients by stage, are well conducted, and are adequately analyzed are we ever going to have a phase III study that will answer the questions. If we continue to conduct studies of esophageal cancer as we have, every study will be a negative study. The answers will be hidden because of the inadequacies of the preparation and conduct of the study.
Dr Donald Low (Seattle, Wash). Dr Rice, I enjoyed your paper as well. Previous publications, however, have tended to take the nodal status issue and look on it not only positive-negative but number of nodes positive and number of stations positive. Did you substratify the patients who were pN1 positive to the number of nodes and to the number of stations positive?
Dr Rice. We did analyze by number of positive nodes. There is an exponential decreasing survival. The break point is at three positive nodes. After, survival plateaus at a very low level. With one node positive, 5-year survival is about 40%, with two nodes positive it is a bit less than 20%, and once there are three nodes or more positive, 5-year survival is less than 5%. The percentage of lymph nodes positive in resection specimens is also important, and the break point is around 20%. We did not look at the number of perihilar stations. There is no doubt that lymph node burden is crucial in further defining N1 disease.
James Mark (Stanford, Calif). I was struck with the staging question also. You had some cN1 pN0 patients who were operated on alone. In other words, nothing had been given them and they were overstaged before the operation. I would suggest that if you are going to use patients like that as a control or use noninvasive staging to get those answers, you subtract that number from the ones that are downstaged by induction therapy, because some are downstaged with no therapy.
Dr Rice. Understaging/overstaging was included in the analysis. The analysis in the paper includes 8 groups: the 4 additional groups were patients overstaged, patients understaged, N0 patients who received induction therapy because of advanced T, and patients who were cN0 at induction but were pN1 at resection; this represents either misstaging or progression on treatment. The analysis of all 8 groups gets pretty complex. For simplicity, I presented 4 groups that encompass the two main concepts, staging and downstaging. The detailed analysis of all 8 groups is included in the paper.
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. Pennathur, J. D. Luketich, R. J. Landreneau, J. Ward, N. A. Christie, M. K. Gibson, M. Schuchert, K. Cooper, S. R. Land, and C. P. Belani Long-Term Results of a Phase II Trial of Neoadjuvant Chemotherapy Followed by Esophagectomy for Locally Advanced Esophageal Neoplasm Ann. Thorac. Surg., June 1, 2008; 85(6): 1930 - 1937. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. P. Rizk, E. Venkatraman, M. S. Bains, B. Park, R. Flores, L. Tang, D. H. Ilson, B. D. Minsky, and V. W. Rusch American Joint Committee on Cancer Staging System Does Not Accurately Predict Survival in Patients Receiving Multimodality Therapy for Esophageal Adenocarcinoma J. Clin. Oncol., February 10, 2007; 25(5): 507 - 512. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Rice, D. P. Mason, S. C. Murthy, G. Zuccaro Jr, D. J. Adelstein, L. A. Rybicki, and E. H. Blackstone T2N0M0 esophageal cancer J. Thorac. Cardiovasc. Surg., February 1, 2007; 133(2): 317 - 324. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Cerfolio, A. S. Bryant, B. Ohja, A. A. Bartolucci, and M. A. Eloubeidi The accuracy of endoscopic ultrasonography with fine-needle aspiration, integrated positron emission tomography with computed tomography, and computed tomography in restaging patients with esophageal cancer after neoadjuvant chemoradiotherapy J. Thorac. Cardiovasc. Surg., June 1, 2005; 129(6): 1232 - 1241. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Kesler, P. R. Helft, E. A. Werner, N. P. Jain, J. A. Brooks, J. M. DeWitt, J. K. Leblanc, N. S. Fineberg, L. H. Einhorn, and J. W. Brown A Retrospective Analysis of Locally Advanced Esophageal Cancer Patients Treated With Neoadjuvant Chemoradiation Therapy Followed by Surgery or Surgery Alone Ann. Thorac. Surg., April 1, 2005; 79(4): 1116 - 1121. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Malaisrie, B. Untch, G. V. Aranha, N. Mohideen, A. Hantel, and J. Pickleman Neoadjuvant Chemoradiotherapy for Locally Advanced Esophageal Cancer: Experience at a Single Institution Arch Surg, May 1, 2004; 139(5): 532 - 539. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Rice, D. J. Adelstein, M. A. Chidel, L. A. Rybicki, M. M. DeCamp, S. C. Murthy, and E. H. Blackstone Benefit of postoperative adjuvant chemoradiotherapy in locoregionally advanced esophageal carcinoma J. Thorac. Cardiovasc. Surg., November 1, 2003; 126(5): 1590 - 1596. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Goldberg, J. Farma, C. Lampert, P. Colarusso, L. Coia, H. Frucht, E. Goosenberg, M. Beard, and L. M. Weiner Survival following intensive preoperative combined modality therapy with paclitaxel, cisplatin, 5-fluorouracil, and radiation in resectable esophageal carcinoma: A phase I report J. Thorac. Cardiovasc. Surg., October 1, 2003; 126(4): 1168 - 1173. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Hagen and T. R. DeMeester Staging of esophageal carcinoma J. Thorac. Cardiovasc. Surg., May 1, 2003; 125(5): 988 - 991. [Full Text] [PDF] |
||||
![]() |
T. W. Rice, E. H. Blackstone, D. J. Adelstein, G. Zuccaro Jr, J. J. Vargo, J. R. Goldblum, S. C. Murthy, M. M. DeCamp, and L. A. Rybicki Role of clinically determined depth of tumor invasion in the treatment of esophageal carcinoma J. Thorac. Cardiovasc. Surg., May 1, 2003; 125(5): 1091 - 1102. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Rice, E. H. Blackstone, L. A. Rybicki, D. J. Adelstein, S. C. Murthy, M. M. DeCamp, and J. R. Goldblum Refining esophageal cancer staging J. Thorac. Cardiovasc. Surg., May 1, 2003; 125(5): 1103 - 1113. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ANN THORAC SURG | ASIAN CARDIOVASC THORAC ANN | EUR J CARDIOTHORAC SURG |
| J THORAC CARDIOVASC SURG | ICVTS | ALL CTSNet JOURNALS |