
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Molecular Oncology, Markers, Clinical Correlates |
Departments of 1 Urology, 2 Pathology and Laboratory Medicine, 3 Biostatistics, 4 Human Genetics and Biostatistics, and 5 Medicine and Urology, University of California School of Medicine, Los Angeles, California
| ABSTRACT |
|---|
|
|
|---|
Experimental Design: A custom tissue array was constructed using clear cell RCC from 318 patients, representing all stages of localized and metastatic RCC, and immunohistochemically stained for molecular markers Ki67, p53, gelsolin, CA9, CA12, PTEN, EpCAM, and vimentin. We present a strategy for evaluating individual candidate markers for prognostic information and integrating informative markers into a multivariate prognostic system.
Results: The overall median follow-up and the median follow-up for surviving patients were 28 and 55 months, respectively. A prognostic model based primarily on molecular markers included metastasis status, p53, CA9, gelsolin, and vimentin as predictors and had high discriminatory power: its statistically validated concordance index (C-index) was found to be 0.75. A prognostic model based on a combination of clinical and molecular predictors included metastasis status, T stage, Eastern Cooperative Oncology Group PS, p53, CA9, and vimentin as predictors and had a C-index of 0.79, which was significantly higher (P < 0.05) than that of prognostic models based on grade alone (C = 0.65), TNM stage alone (C = 0.73), or the University of California Los Angeles integrated staging system (C = 0.76).
Conclusions: Protein expressions obtained using widely available technology can complement standard clinical predictors such as TNM stage, histological grade, and PS.
| INTRODUCTION |
|---|
|
|
|---|
In general, prior attempts to predict patient survival have relied on traditional clinical parameters such as tumor stage and grade (3 , 8) . More recently, methods based on gene arrays, which screen for differential expression of thousands of genes, have identified large numbers of potential prognostic markers (9 , 10) . Our study, evaluating protein expression in a high-throughput tissue array, is a natural extension to the efforts for molecular staging. All of the markers examined in this study were selected based on previous reports linking the markers to the development of malignancies. Ki67 and p53 are related to cellular proliferation (11, 12, 13) . In RCC, Ki67 is an independent predictor of survival (11 , 12) , and a mutated p53 has been shown to be an independent predictor of survival in some studies (14 , 15) .
Gelsolin, EpCAM (epithelial cell adhesion molecule), and vimentin may be involved in cell motility and cancer progression. Gelsolin is the most potent protein known that functions to sever actin during cell motility (16) , and it has been described as a highly significant indicator of poor prognosis in non-small-cell lung cancer (17) . EpCAM is widely expressed on the surface of many carcinomas (18 , 19) . Vimentin, an intermediate filament, has previously been identified as an independent predictor of poor prognosis in RCC (20 , 21) .
CA9, CA12, and PTEN (phosphatase and tensin homologue deleted from chromosome 10) are critical components of the hypoxia pathway, which allows enlarging tumors to adapt to an oxygen-poor microenvironment. In addition, overexpression of CA9 and CA12 is a direct consequence of a VHL mutation, found in over 75% of sporadic clear cell RCC (22 , 23) . Decreased expression of CA9 has previously been shown to predict worse survival (24) . PTEN is a phosphatase that regulates cellular migration, proliferation, and apoptosis (25 , 26) . Although PTEN mutation may be a rare event in RCC (27 , 28) , PTEN deletion has been shown to correlate with poor prognosis (27) .
With the explosive growth of available genomic and proteomic data, the number of markers that have been correlated with prognosis is continuously increasing. We present a strategy for evaluating individual candidate markers for prognostic information and integrating informative markers into a multivariate prognostic system. Levels of marker expression were used to develop two prognostic models for predicting disease-specific survival (DSS) in patients with clear cell RCC. The accuracy of these models was compared with traditional, clinical parameters used for predicting survival.
| MATERIALS AND METHODS |
|---|
|
|
|---|
After surgery, patients were evaluated for disease recurrence by physical examination, liver function tests, chest X-ray, and computerized tomography of the abdomen/pelvis every 612 months. The primary outcome of interest was DSS. Patients with metastatic RCC were treated with cytoreductive nephrectomy (n = 155) followed by interleukin-2-based immunotherapy (n = 116) and then evaluated radiologically in 3 months to determine the response to therapy. A positive response was defined as a complete or partial response.
Tissue Array Construction.
Archival tumor specimens from the cohort of 318 patients were obtained from the Department of Pathology. All tumors were reviewed to confirm clear cell histology. Three core tissue biopsies, each 0.6 mm in diameter, were taken from selected, morphologically representative regions of each paraffin-embedded renal tumor and precisely arrayed using a custom-built instrument as described previously (32)
.
Immunohistochemistry.
Array sections were deparaffinized and immunostained using either the DAKO Envision Plus (DAKO, Carpinteria, CA) or Vector ABC Elite (Vector Laboratories, Burlingame, CA) staining systems as described previously (33)
. Antigen retrieval involved pepsin digestion for 10 min for EpCAM staining, treatment in a pressure cooker for 3 min for Ki67, and heat treatment for 25 min for all other markers. Ki67 staining was performed with mouse monoclonal antibody MIB-1 (DAKO) at 0.5 µg/ml. Gelsolin staining was performed with monoclonal mouse antibody GS-2C4 (Sigma, St. Louis, MO) at 3.8 µg/ml. Immunostaining for PTEN was performed using rabbit polyclonal antibody PN37 (Zymed, South San Francisco, CA) at 2 µg/ml. Immunostaining for EpCAM was performed using monoclonal mouse antibody KS1/4BD (PharMingen, San Diego, CA) at 20 µg/ml. Immunostaining for p53 was performed with monoclonal mouse antibody D07 (DAKO) at a 1:100 dilution. CA9 staining was performed with monoclonal mouse antibody M75 (a gift from Dr. Eric Stanbridge, University of California at Irvine, Irvine, CA) at a 1:25,000 dilution of stock. CA12 staining was performed with a rabbit polyclonal antibody (a gift from Dr. Michael Lerman, National Cancer Institute, Bethesda, MD) at a 1:450 dilution. Vimentin staining was performed with mouse monoclonal antibody V9 (DAKO) at a 1:1000 dilution.
Immunostaining was scored by recording the percentage of cells staining and scoring the area of maximum staining on a 4-point scale, with 0 representing no staining and 3 representing the highest staining. All three cores that were arrayed per tumor were scored and pooled to yield the mean, median, minimum, and maximum values for the three cores.
Statistical Analysis.
Length of follow-up was the time from nephrectomy to date of death or last contact. The end point of interest was DSS. Kaplan-Meier curves were generated to visualize survival rates, and we used Cox proportional hazards models to relate DSS to molecular and clinical predictors. The Schöenfeld residual test was used to evaluate the proportional hazard assumptions. All computations were performed with the R statistical software using the Design, Hmisc, Rpart, and RandomForest libraries. P < 0.05 was considered significant, and P > 0.10 was the criterion when performing Cox regression backward step-down variable deletions.
The predictive accuracy of various Cox regression models was quantified by calculating the concordance index (C-index), which provides the area under the receiver operating characteristics curve for censored data (34 , 35) . A C-index of 0.5 indicates that outcomes are completely random, whereas a C-index of 1 indicates that the model is a perfect predictor. To protect against overfitting during stepwise regression, we used a bootstrap procedure as implemented in the "validate" function of the Design library (34) , which allowed for computation of an unbiased estimate of the C-index. We used 500 bootstrap samples.
To test whether the difference in statistical accuracy between non-nested Cox regression models is significant, we used the rcorrp.cens function in the Design library (34) . This computes U-statistics for testing whether the predictions of one model are more concordant than those of another model, extending the C-index. Specifically, the fraction of pairs was determined for which one model correctly selects the patient with the longer DSS when the competing model did not. Before calculating the C-index or using the rcorrp.cens function, TNM staging system (T stage, N stage, and M stage) and UISS (T stage, N stage, M stage, ECOG PS, and grade) were fitted to our data.
To account for potential overfitting resulting from selection of the staining criterion and cutoff used to stratify DSS, we used a prevalidation method described in Liu et al. (36) Briefly, the prevalidation method is carried out in three steps for each marker. First, an intercept only Cox regression model is fit to the survival times to arrive at deviance residuals, which are normalized transforms of martingale residuals. Second, random regression forest predictors are fit where the outcome is the deviance residual and the covariates are the pooled marker stainings (37) . Random regression forest predictors work well in the setting of many covariates but relatively few observations. Third, the random forest construction is used to define a tumor marker hazard score M as the out-of-bag estimate of the deviance residual. The hazard score M is a real number.
Liu et al. (36) have shown that when M is used as a covariate in a Cox regression model involving the same survival outcome that was used to construct it, it does not lead to an inflation of the false positive rate of the one-sided Wald test that tests whether coefficient is positive. Therefore, we used the corresponding M for each marker as the bias-corrected marker covariate in all univariate and multivariate Cox regression analyses in computing the C-index and in the rcorrp.cens function.
The Cox proportional hazards regression analysis was used to construct two predictive models. One model was primarily based on molecular markers (marker model). A second predictive model was based on a combination of molecular markers and clinical variables (clinical/marker model). We showed previously (24) that CA9 is an independent predictor for survival in patients with metastatic RCC, but not localized RCC. Therefore, an interaction term for CA9 and metastasis status (Met*CA9) was used.
To visualize the relationship between clinical predictors and DSS, we constructed a nomogram for a Cox model that only contains significant terms and dichotomized marker stainings. Although dichotomized marker values were used, only markers shown to contain prognostic information using the prevalidation method were included in the nomogram. This nomogram should be considered as an approximation of the validated models described above. In particular, the nomogram is not used for model comparisons and inferences as described above. Rather, the nomogram is a descriptive tool that needs further validation with new and independent data.
For each molecular marker, the optimal cutoff for the staining scores to stratify DSS was determined using the default settings of the recursive partitioning function (RPART) in the freely available R statistical software.6 To avoid cutoffs that are overly sensitive to assay conditions, the optimal cutoffs were confirmed to be robust before they were used to construct the prognostic nomograms. Ki67 was considered positive if there was >15% nuclear staining at any intensity in the core with median staining, and p53 was positive if there was >15% staining in the core with maximum staining. Gelsolin was positive if there was any level of cytoplasmic staining in any core. PTEN was positive if there was >50% cytoplasmic staining in the core with lowest staining. EpCAM was considered positive if there was any staining in any core. CA9 and CA12 were considered positive if there was 100% staining in all cores and the mean percentage staining in the three cores was >80%, respectively. Vimentin was considered positive if the mean of the maximum staining intensity score in the three cores was <2. Below, the use of dichotomized staining scores is discussed in more detail.
| RESULTS |
|---|
|
|
|---|
|
|
Using a similar approach, a prognostic model was constructed using a combination of clinical variables and marker data (clinical/marker model). In a multivariable Cox analysis, when each of the markers were controlled for T stage, Met, ECOG PS, and grade, we found that CA9, vimentin, and p53 were statistically significant predictors independent of the clinical variables. (Table 2
, right). These three markers were combined in a multivariable Cox analysis that included Met, T stage, ECOG PS, and grade. Covariates retained in a Cox step-down analysis (Table 3)
were used to construct a second prognostic model (clinical/marker model).
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
The predictive accuracy of our marker model for RCC was comparable with UISS, which is a model combining standard clinical predictors; and the clinical/marker model was significantly more accurate than UISS. Molecular information may reduce some of the subjectivity involved in current clinical methods for predicting survival. Although models based on gene expression profiles have been described (9 , 38) , the technology for gene expression profiling is costly and not yet widely available. The hope is that protein expression profiling can be more rapidly incorporated into a clinical setting using resources available at many centers. Immunohistochemical staining is already widely used for pathological analysis of breast and colon cancer tissue. However, careful multicenter clinical studies involving standardized marker staining protocols should be used to validate our findings.
Our study shows that accurate models for molecular staging of a solid tumor can be developed using a very limited number of markers. We demonstrate the use of a statistical technique for constructing prognostic models and comparing them with established systems for predicting survival. This approach can be applied to building and evaluating predictive models for other solid tumors. In this study, the various models were ranked based on predictive accuracy as measured by the concordance index. The accuracy of the predictive models was also compared by examining the concordance of the models for combinations of patient pairs. Using this approach, both the clinical/marker model and the marker model were significantly more accurate than histological grade, and the clinical/marker model was significantly more accurate than grade, TNM stage, and UISS.
Overfitting occurs when random patterns in the data are incorporated into a prognostic model as meaningful information, resulting in what appears to be a more accurate model. However, any apparent improvement in model performance due to overfitting is lost when the model is applied to an independent dataset. The complexity of the training dataset used to build the model correlates with the risk of overfitting. In our study, every step of the modeling process was validated and adjusted for overfitting. Therefore, when the various models are ranked according to predictive accuracy or compared to obtain P values, the significance of our results is not overstated. This helps to ensure that our models will perform similarly in an external dataset as it did in our dataset.
When evaluating a marker by immunohistochemistry, it is common to dichotomize the staining data using a single cutoff to determine positive and negative staining. Dichotomizing the staining simplifies the staining analysis and enhances interobserver reproducibility. However, when performing the statistical analysis, the selection of cutoff criterion can introduce bias and result in overfitting. Therefore, we describe a prevalidation technique for each marker that takes into account all measures of staining (such as intensity and percentage of staining), corrects for overfitting, and quantifies the true prognostic information contained in the staining. This prevalidation procedure leads to unbiased P values and C-indices. Only markers that were significant predictors of survival on prevalidation were dichotomized and used to construct nomograms. Finally, to avoid cutoffs that are overly sensitive to the staining conditions, only robust cutoffs were used.
We propose two nomograms that can be used to predict DSS. One nomogram is based on metastasis status and three molecular markers (CA9, p53, and vimentin). The second nomogram combines clinical and molecular variables (Met, T stage, ECOG PS, CA9, p53, and vimentin). By including metastasis status in the nomogram, we found that the nomogram accurately predicts DSS in both patients with localized RCC and those with metastatic RCC. Both nomograms can be used to calculate 2- and 4-year DSS rates as well as median survival. Although these nomograms are useful for visualizing our predictive models, they need to be tested on independent patient populations before being applied to patient care.
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: H. Kim, D. Seligson, S. Horvath, and A. Belldegrun contributed equally to this work.
Requests for reprints: Arie S. Belldegrun, University of California School of Medicine, Department of Urology, 10833 Le Conte Avenue, Room 66-118 CHS, Los Angeles, CA 90095-1738. Phone: (310) 206-1434; Fax: (310) 206-5343; E-mail: Abelldegrun{at}mednet.ucla.edu
Received 3/10/04; revised 5/ 3/04; accepted 5/10/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
V. Mah, D. B. Seligson, A. Li, D. C. Marquez, I. I. Wistuba, Y. Elshimali, M. C. Fishbein, D. Chia, R. J. Pietras, and L. Goodglick Aromatase Expression Predicts Survival in Women with Early-Stage Non Small Cell Lung Cancer Cancer Res., November 1, 2007; 67(21): 10484 - 10490. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Choueiri, B. Rini, J. Garcia, R. Baz, R. Abou-Jawde, S. Thakkar, P Elson, T. Mekhail, M Zhou, and R. Bukowski Prognostic factors associated with long-term survival in previously untreated metastatic renal cell carcinoma Ann. Onc., February 1, 2007; 18(2): 249 - 255. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Lam, A. J. Pantuck, A. S. Belldegrun, and R. A. Figlin Protein Expression Profiles in Renal Cell Carcinoma: Staging, Prognosis, and Patient Selection for Clinical Trials Clin. Cancer Res., January 15, 2007; 13(2): 703s - 708s. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Lam, A. Breda, A. S. Belldegrun, and R. A. Figlin Evolving Principles of Surgical Management and Prognostic Factors for Outcome in Renal Cell Carcinoma J. Clin. Oncol., December 10, 2006; 24(35): 5565 - 5575. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Sourbier, V. Lindner, H. Lang, A. Agouni, E. Schordan, S. Danilin, S. Rothhut, D. Jacqmin, J.-J. Helwig, and T. Massfelder The Phosphoinositide 3-Kinase/Akt Pathway: A New Target in Human Renal Cell Carcinoma Therapy. Cancer Res., May 15, 2006; 66(10): 5130 - 5142. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Cell Growth & Differentiation |