
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Molecular Oncology, Markers, Clinical Correlates |
1 Gray Cancer Institute and 2 Department of Pathology, Mount Vernon Hospital, London, United Kingdom; 3 Department of Oncology, University College London, London, United Kingdom; and 4 Barbara Ann Karmanos Cancer Institute, Detroit, Michigan
| ABSTRACT |
|---|
|
|
|---|
Experimental Design: Tumor material was obtained from 402 patients. Immunohistochemistry was used to assess Ki-67, CD31, p53, Bcl-2, and cyclin D1 expression. A hierarchical clustering algorithm with a Bayesian information criterion was used to group tumors with similar marker expression; resulting expression profiles were then compared in terms of their difference in outcome after CHART and conventionally fractionated radiotherapy.
Results: Molecular marker profile was an independent prognostic factor for locoregional control. This was confirmed in multivariate analysis, including clinical variables such as tumor and nodal status, primary site, histological grade, age, and gender (P < 0.001 and P = 0.006 for local and nodal relapse, respectively). In particular, Bcl-2-positive tumors responded significantly better than average in both arms of the trial. Tumors negative for p53- and Bcl-2, with high and randomly patterned Ki-67 expression, responded worse than average with no benefit from CHART. Tumors with similarly negative p53 and Bcl-2, but low Ki-67 staining, with an organized pattern, benefit significantly from CHART schedule.
Conclusions: This study demonstrates the potential of molecular profiles to predict radiotherapy response of head and neck squamous cell carcinoma and for treatment stratification. Distinct expression profiles correlate with three distinct clinical phenotypes, including good locoregional control, poor locoregional control, and an outcome strongly dependent upon fractionation schedule.
| INTRODUCTION |
|---|
|
|
|---|
The secondary aim of this study was to investigate the potential and flexibility of hierarchical clustering for clinical outcome analysis of data sets with a high number of factors, or covariates, measured for each case. Cluster analysis is an unsupervised classification method for data mining, directed toward identifying tumors with a similar molecular "fingerprint." Here, the clusters are built in the covariate space, and it is only after the clusters have been identified that they are compared in terms of outcome after a specific therapy, both in uni- and multivariate analysis. This strategy overcomes some of the limitations of survival analysis methods that select the covariates in the model based on their association with outcome in the same series, for example Cox proportional hazards model. This model has been the mainstay of multivariate analysis of the influence of prognostic factors on end points requiring prolonged observation of the patient, such as survival or locoregional control, and it has been used already to analyze the impact of gene expression profiles on the outcome of breast carcinoma (3) and lung adenocarcinoma (4) . However, P-values of the covariates included in the model are biased, and they appear to be more significant than they would in an independent confirmatory study. Also, there is a tendency to overfit the data, particularly when the number of covariates increases. As a result, the parameter estimates are often highly sensitive to the detailed composition of the patient population analyzed. Furthermore, this model is linear in the parameters, and it is not suited for the study of complex association patterns between many covariates and outcome. The use of clustering is based on the biological assumption that the overall biomolecular profile of a tumor will be indicative of its response to a specific treatment; thus, considering recurring biomolecular patterns, rather than the expression of single markers, may be a more powerful approach to identifying specific groups that benefit from a particular treatment.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Qualitative and Quantitative Assessment of Staining.
All of the slides were qualitatively or semiquantitatively analyzed by a consultant pathologist, except in the case of cyclin D1. Ki-67 slides were visually scanned and ascribed to one of three scores, 1 = <20% positive cells, 2 = 2040% positive, and 3 = >40% positivity. Ki-67 proliferative pattern was also assessed, as described previously (5)
, by scoring, 1 = marginal (mostly organized), 2 = intermediate (mainly organized), 3 = mixed (more than one pattern), and 4 = random (diffuse, disorganized staining). Bcl-2 was scored as negative if <5% of cells were stained and positive otherwise. The p53 staining was ascribed to one of three scores, 1 = negative (<5% positive cells), 2 = sporadic (575% cells positive), and 3 = all (>75% cells). The staining intensity also classified by scores, 1 = weak, 2 = intermediate, and 3 = strong. The microvessel count identified by CD31 staining was classified in 10 high-power fields, 1 =
35 microvessels, 2 = 3555 microvessels, and 3 = >55 microvessels. Cyclin D1 was assessed by manual counting aided by an in-house image acquisition system incorporating a CCIR-format 3-CCD chip color camera coupled to a PC-based video-rate frame grabber. Software routines for image acquisition, normalization, and storage have been developed as have grids and manual counting data recording and export to spreadsheets. At least 10 high-power fields (x40 objective) were counted for each specimen.
Clinical End Points.
All of the patients were observed weekly for a period of 6 weeks after the start of treatment. Additional follow-ups were at 8 weeks and 3 months after the first day of treatment, subsequently 3 monthly to 2 years, 6 monthly to 5 years, and annually thereafter. Local or nodal failures were recorded when clinically definitive tumor growth was detected at the primary site or in the nodes, respectively. Distant failure was defined as the appearance of metastases outside of the irradiated volume. For each patient, the time to first failure and its location were recorded. Patients without clinical progression were censored at the time of the last follow-up. The local and nodal failures considered in this study were those within the small volume; that is the volume receiving the full dose and including the tumor and involved nodes with a margin. Locoregional control is defined as local control without any nodal recurrence.
Survival Analysis Methods.
The statistical analyses were performed using the SPSS (SPSS Inc., Chicago, IL), S-PLUS 6.1 (Insightful, Basingstoke, United Kingdom), and R packages.5
Backward stepwise elimination based on the likelihood ratio test was used to identify reduced Cox proportional hazards models including only covariates significant at the 0.05 level. For nominal variables, deviation contrasts were used (each category of the predictor variable except the reference category was compared with the overall effect), and for ordinal variables, indicator contrasts were used (each category was compared with the reference category). The significance of the interaction terms between covariates and nonlinear effects of scale covariates were assessed using the interaction and spline algorithms in the S-PLUS 6.1 function coxphi.The proportional hazards hypothesis was tested by estimating the Schoenfeld residuals correlation with the survival time (cox.zph; S-PLUS 6.1). To evaluate the predictive performance of different survival models, receiving operating characteristic curves and relative areas were calculated, using bootstrapping with replacement.
Hierarchical Clustering Methods.
Cluster analysis was performed using SPSS implementation (TwoStep function) of the balanced iterative reducing and clustering using hierarchies method (6)
. This consists of two iterative steps. In the precluster step, the data are scanned individually, and a modified cluster features tree is constructed. Each new case, starting from the root node, descends along the tree based on its similarity to existing nodes, and it is either added to an existing leaf node or it can form a new leaf node. In the cluster step, the leaf nodes of the cluster feature tree are grouped into the desired number of clusters using an agglomerative clustering algorithm. This produces a range of solutions that are compared to estimate the optimal number of clusters. This method has two advantages compared with other K-mean or hierarchical clustering methods: it has the ability to cluster in a space formed by both scale and categorical variables, and it allows optimization of the number of clusters. It can also handle large data sets more efficiently than conventional hierarchical clustering algorithms. Here, the likelihood distance measure was used as the similarity criterion, because both scale and continuous variables were considered. In particular, the percentage of Ki-67/p53/cyclin D1-stained cells and the number of CD31-stained microvessels were entered as scale variables, because they are semiquantitative variables. An analysis where the percentage of Ki-67/p53-stained cells was entered as categorical variables were also performed, and the main results did not vary significantly from the results shown in this article (data not shown). Ki-67 staining pattern and p53 staining were entered as categorical variables, because they are qualitative parameters. Bcl-2 staining was entered as a binary variable. Schwarzs Bayesian criterion was used as the clustering optimization criterion. In a test on simulated data sets, the balanced iterative reducing and clustering using hierarchies algorithm was found superior to other K-mean and hierarchical clustering algorithms in identifying the simulated clusters, with a very low misclassification frequency (data not shown). In-house routines for bootstrapping with replacement were developed within the SPSS software to assess the robustness of cluster structure obtained with the balanced iterative reducing and clustering using hierarchies algorithm and the strength of the correlation of the resulting profiles with clinical end points in Cox regression analysis.
| RESULTS |
|---|
|
|
|---|
|
|
|
|
Cox Proportional Hazards Analysis of Molecular Markers and Clinical Variables.
In a conventional Cox proportional hazards model, including molecular markers, tumor T and N class, histological grade, tumor site, treatment received, patient age, and gender, Bcl-2 status tended to dominate the results (data not shown). Bcl-2-positive patients showed lower hazard of local failure (0.416; P = 0.008) and nodal relapse (0.551; P = 0.006) and higher survival (P < 0.001). Cyclin D1 was positively correlated with nodal relapse hazard (P = 0.027) and negatively correlated with survival rate (P = 0.012). The p53 showed borderline correlation with local failure (P = 0.085) and significant correlation with survival (P = 0.009); tumors with low and high intensities had worse local control and survival than those with intermediate staining. Ki-67 staining pattern was positively correlated with the risk of distant failure (P = 0.001), with a 6.45 hazard risk of mostly organized versus random pattern. Interaction terms between the different markers and quadratic terms did not show significance in these Cox models. The Cox proportional hazards hypothesis was statistically met in all of the cases, apart from the local control model (P = 0.035). However, when single covariates in this model were considered, T class only did not meet the hypothesis (P = 0.003). The area under the receiving operating characteristic curves was higher for Cox models including the clusters than for Cox models including separate biological or clinical characteristics only, but this was not significant.
Distinct Molecular Marker Profiles Are Associated with a Benefit from CHART.
Treatment was found to be a significant prognostic factor for local control only (Fig. 4)
. When a Cox analysis for local control, including T and N class, treatment, histological grade, gender, and age, was stratified by cluster membership, treatment and T class were significant (P = 0.045 and 0.001, respectively). Log-rank test in each cluster (Fig. 5)
showed that patients in clusters 2 and 7 had a significant benefit in local control from CHART (P = 0.002 and P = 0.044, respectively). After bootstrapping with 1000 samples, these results were retained in >95% of the samples. Similar results were obtained for nodal control and survival, although in this case, the significance was seen only for cluster 2 (data not shown). Clearly, comparing the effect of randomization to CHART in each of the eight clusters involves a risk of false-positive findings because of multiple comparisons. However, after application of the Bonferroni correction for multiple tests, the difference in local control between patients in cluster 2 receiving CHART versus CRT was still significant (P = 0.0136).
|
| DISCUSSION |
|---|
|
|
|---|
Cluster analysis is an unsupervised classification method for data mining; it has been used recently in the analysis of gene expression data sets, showing the importance of gene expression profiles in relation to survival and treatment outcome (7 , 8) . However, there are fundamental biological differences in the information provided by molecular markers and gene expression microarrays, and histological data often include very different types of data, from quantitative data (such as the percentage of staining) to purely qualitative data (such as staining pattern); thus, algorithms allowing for different data types are needed. The balanced iterative reducing and clustering using hierarchies algorithm (6) , with a Bayesian information criterion, was used in this study. This hierarchical clustering method allows for the presence of different types of data and can handle large data sets. Furthermore, the Bayesian information criterion selects the most informative number of clusters by finding an optimum balance between the two extreme scenarios, the few poorly defined clusters with many patients in each cluster and the many very well-defined clusters with few patients in each cluster. In particular, the latter case tends toward the extreme scenario of grouping patients on the basis of all of the possible marker expression combinations. This approach has been used in multiple marker studies, but it is mostly ineffective, because when the number of markers increases, the groups contain too few patients to allow a powerful comparison in terms of treatment outcome. Another important aspect of the strategy used in this study is that the profiles are clustered on their molecular similarity, and only after the formation process has ended are the clusters compared in terms of treatment outcome. This overcomes problems with biased significance of the parameters and overfitting, which are typical of survival analysis methods, where the covariates are selected based on their association with treatment outcome in the same series, such as Cox proportional hazards model.
This study focused on biological factors known to affect tumor response to radiotherapy and to fractionation schedule, such as proliferation (Ki-67), cell cycle deregulation (p53) and progression (cyclin D1), apoptosis (Bcl-2 and p53), and vascularity (CD31). Profile membership showed significant correlation with tumor site and histological grade (Fig. 2)
, and with the considered clinical end points (Figs. 3
and 4
). In particular, patients in cluster 4 performed significantly better than patients in the other clusters. This cluster was fully characterized by Bcl-2-positive tumors, and, accordingly, Bcl-2 expression has been shown previously to be associated with higher locoregional control in these patients (9)
. Although Bcl-2 appears to have a specific role in promoting the initiation and progression of numerous cancers, it has shown very controversial results in relation to radiotherapy outcome (10)
. In these patients, Bcl-2-negativity within tumors may simply reflect deregulation of this pathway (9)
. Furthermore, there is evidence that the expression ratio of Bcl-2 to Bax could be more informative than Bcl-2 absolute value (10)
. However, few patients (Table 1)
had Bcl-2-positive tumors, and, thus, the Bcl-2:Bax ratio may be less informative here than in other tumor types.
Patients in cluster 3 showed a significantly worse response to treatment than patients in any other clusters (Figs. 3
and 4
), with significantly lower local control (P < 0.001) and survival (P = 0.001). Tumors in this cluster were Bcl-2- and p53-negative and showed a high percentage of Ki-67 stained cells with a random proliferation pattern. Although Bcl-2 negativity is one of the possible causes for the poor outcome, it was not a unique feature of this cluster (Figs. 1
and 2
). The high percentage of Ki-67-stained cells with a random pattern of staining implies a group of highly proliferative tumors, which might be more aggressive and, hence, have a worse overall treatment outcome. However, clusters 68 show a moderate to high percentage of Ki-67-stained tumor cells, with mixed to random patterns (Figs. 1
and 2
), and they do not show poorer than average treatment outcome (Figs. 3
and 4
). Like Bcl-2, Ki-67 is a marker that has generated studies also with conflicting results. For example, a study of larynx tumors from a large cohort of patients concluded that Ki-67 expression was not predictive of prognosis (11)
, whereas a recent study of oral cavity tumors has argued that a high percentage of Ki-67-stained cells may correlate with an improved radiotherapy response (12)
. However, in contrast to our study, these oral cavity tumors were mainly T1 tumors. The p53 status of tumors in cluster 3 is difficult to establish, because the antibody used was not specific for wild-type or mutated p53. It is likely that high p53 staining intensity is attributable to the accumulation of mutated p53, whereas intermediate p53 staining intensity might be attributable to environmental stress-related induction of wild-type p53. Negative staining could indicate wild-type, deleted, or truncated p53 not recognized by the antibody. However, tumors with a negative and high p53 staining intensity experienced significantly poorer prognosis than tumors with intermediate intensity (P = 0.039, log-rank test), suggesting that tumors with negative or high level of intensity might have similar p53 status. This hypothesis could be tested by staining for target molecules of p53 such as p21, although this has the limitation that it will suggest only some of the possible mutations or, ideally, by looking for p53 mutations using DNA sequencing in a series where frozen material is available. Mutated p53 has been shown to correlate with increased resistance to radiotherapy in animal models (13
, 14) and in humans (12)
and could contribute to the poor prognosis in this cluster. However, low p53 intensity is not a unique feature of this cluster. All of the above considerations indicate that the particular combination of all of these factors, rather than one specific factor alone, determines the poor prognosis of patients in cluster 3.
Although cyclin D1 had no major role in the formation of the clusters, it was still a significant variable for outcome in a Cox model, independently from the observed profiles, and correlated negatively with nodal failure and survival (see "Results"). This is in agreement with the previous finding that cyclin D1 overexpression is an indicator of poor prognosis in HNSCC (15 , 16) .
Randomization to CHART was a significant factor for only local control (Fig. 4)
, with a 0.7 hazard ratio of CHART versus CRT. Kaplan-Meier analyses in the various clusters showed that treatment was a significant variable for local control in cluster 2 and cluster 7 (Fig. 5)
. Multiple paired comparisons of CHART versus CRT in each of the eight clusters may lead to spuriously significant difference. However, the difference in local control in cluster 2 between patients receiving CHART versus CRT was still significant after a Bonferroni correction for multiple testing (P = 0.0136). Bcl-2 and p53 negativity and low Ki-67 expression, with mostly organized staining pattern, characterized these tumors (Fig. 1)
. Cluster 2 shows also a higher percentage of well-differentiated tumors with respect to the other clusters (Fig. 2)
; this agrees with a previous study suggesting that well-differentiated tumors have a more pronounced treatment-time dependency (17)
. Profile 7 showed benefit also from CHART, but this result was not significant after a multiple test correction. This cluster is characterized by intermediate values of p53, Bcl-2 negativity, and low organization of the Ki-67 staining, suggesting that highly and randomly proliferative tumors might benefit also from CHART in the case of intermediate p53 levels.
Although these results are interesting, additional investigation is needed, because identification of tumor characteristics that are predictive for a benefit from acceleration would allow both selection of patients for a specific fractionation schedule and development of radiation modifiers targeted at specific pathways or molecules. When comparing CHART with CRT, the benefit from CHART will be a trade-off between a decreased likelihood of locoregional control because of a lower dose and an increased likelihood of locoregional control because of a shorter treatment time. However, the CHART trial has found no overall benefit from accelerated schedule with respect to CRT (2) ; this is thought to reflect either a lower-than-expected overall accelerated repopulation response in these tumors, or, perhaps, that the advantage from a short schedule has been counteracted by a shorter time (12 days) to reoxygenate the most hypoxic of these tumors (18) . Thus far, attempts to relate cellular proliferation directly to the benefit from accelerated fractionation have failed, and it is likely that the putative link involves multiple biological factors (19) . Here, we have shown that tumors with negative p53 and Bcl-2, and with a mostly organized Ki-67 pattern, did benefit from CHART. These tumors could have a greater tendency to respond to radiation damage with accelerated repopulation, as seen in normal squamous epithelium (20 , 21) , and this type of response could occur less frequently or be less pronounced in tumors showing a proliferation pattern with lower organization, or tumors with a different p53 or Bcl-2 status. This would make these tumors most sensitive to changes in fractionation schedule. This hypothesis is under study, both using biomodeling simulations and by assessing the expression of molecules believed to be involved or associated with the occurrence of accelerated repopulation after irradiation, such as epithelial growth factor receptor.
An alternative hypothesis is that these tumors have a more pronounced or faster reoxygenation, which would overcome the disadvantage of the shortened time for reoxygenation of very hypoxic tumors undergoing CHART. The expression of CD31 before therapy (and thus microvessel density) in these tumors is not different from other clusters (Fig. 1)
, suggesting that the initial hypoxic status is not different. However, a Kaplan-Meier analysis of local control, comparing patients according to treatment and stratifying them by CD31 expression (dichotomized to gain some statistical power), showed a significant benefit from CHART both for the low and the high CD31 tumors (P = 0.01), with a very similar hazard for CHART versus CRT between the low and high CD31 expression groups (0.28 and 0.26, respectively). This supports the hypothesis that the hypoxic tumors in this cluster are able to reoxygenate during treatment. In contrast, when the same analysis was performed in cluster 7 (the other cluster showing a benefit from CHART), significance was seen only for the high CD31 group (CHART/CRT hazard = 0.14; P = 0.004); no benefit was seen for the low CD31 group. This suggests that hypoxic tumors limit the benefit from CHART for patients in cluster 7, or, in other words, that for these tumors, 12 days are not sufficient for reoxygenation. Clinical support for the hypothesis that impeded reoxygenation may play a role in the CHART regimen comes from a randomized controlled trial showing a benefit when the bioreductive drug mitomycin C was added to CHART radiotherapy (22)
. The validation of both the repopulation and reoxygenation hypotheses would benefit from an independent prospective series of patients, where proliferation and oxygenation could be measured before, during, and after radiotherapy, using either histological molecular markers or molecular imaging techniques such as positron emission tomography, where new biomarkers are being found and tested for targeting hypoxia and proliferation (23)
.
In conclusion, this study identifies the potential of molecular marker expression profiles to predict radiotherapy response of HNSSC and treatment stratification. In particular, three distinct expression profiles were identified that correlate with three distinct clinical phenotypes; one group of patients responded particularly well to radiotherapy, a second group showed poor radiotherapy response, and a third group showed a significant benefit from a strongly accelerated radiotherapy. Additional staining of these tumors, to study additional markers, may help to identify patients who are at risk of locoregional HNSCC failure and particularly sensitive to changes in fractionation schedules. Prospective studies, where immunohistochemical staining, molecular imaging, and gene expression data can be collected at different time points, may help to correlate distinct radiation responses with specific aberrant proliferation, apoptotic and/or hypoxic pathways, and to refine radiotherapy fractionation and chemoradiation strategies in HNSCC.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Requests for reprints: Francesca M. Buffa, Gray Cancer Institute, P. O. Box 100, Northwood, London, HA6 2JR, United Kingdom. E-mail: buffa{at}gci.ac.uk
5 Internet address: http://www.r-project.org. ![]()
Received 9/17/03; revised 2/ 6/04; accepted 2/13/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. C. Winter, F. M. Buffa, P. Silva, C. Miller, H. R. Valentine, H. Turley, K. A. Shah, G. J. Cox, R. J. Corbridge, J. J. Homer, et al. Relation of a Hypoxia Metagene Derived from Head and Neck Cancer to Prognosis of Multiple Cancers Cancer Res., April 1, 2007; 67(7): 3441 - 3449. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. M. Erovic, M. Pelzmann, M. Ch. Grasl, J. Pammer, G. Kornek, W. Brannath, E. Selzer, and D. Thurnher Mcl-1, Vascular Endothelial Growth Factor-R2, and 14-3-3{sigma} Expression Might Predict Primary Response against Radiotherapy and Chemotherapy in Patients with Locally Advanced Squamous Cell Carcinomas of the Head and Neck Clin. Cancer Res., December 15, 2005; 11(24): 8632 - 8636. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Bentzen, B. M. Atasoy, F. M. Daley, S. Dische, P. I. Richman, M. I. Saunders, K. R. Trott, and G. D. Wilson Epidermal Growth Factor Receptor Expression in Pretreatment Biopsies From Head and Neck Squamous Cell Carcinoma As a Predictive Factor for a Benefit From Accelerated Radiation Therapy in a Randomized Controlled Trial J. Clin. Oncol., August 20, 2005; 23(24): 5560 - 5567. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |