
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
1 Mayo Clinic Cancer Center, Rochester, Minnesota; 2 University of Texas M.D. Anderson Cancer Center, Houston, Texas; and 3 Millennium Pharmaceuticals, Cambridge, Massachusetts
Requests for reprints: Lynn C. Hartmann, Division of Medical Oncology, Mayo Clinic Cancer Center, 200 First Street Southwest, Rochester, MN 55905. Phone: 507-284-3731; Fax: 507-538-0823; E-mail: hartmann.lynn{at}mayo.edu.
| ABSTRACT |
|---|
|
|
|---|
20% achieve long-term disease-free survival. We hypothesized that differences in gene expression before treatment could distinguish patients with short versus long time to recurrence after administration of platinum-paclitaxel combination chemotherapy.
Experimental Design: To test this hypothesis, gene expression profiling of 79 primary surgically resected tumors from women with advanced-stage, high-grade epithelial ovarian cancer was done using cDNA microarrays containing 30,721 genes. Supervised learning algorithms were applied in an effort to develop a binary classifier that could discriminate women at risk for early (
21 months) versus late (>21 months) relapse after initial chemotherapy.
Results: A 14-gene predictive model was developed using a set of training samples (n = 51) and subsequently tested using an independent set of test samples (n = 28). This model correctly predicted the outcome of 24 of the 28 test samples (86% accuracy) with 95% positive predictive value for early relapse.
Conclusions: Predictive markers for early recurrence can be identified for platinum-paclitaxel combination chemotherapy in primary ovarian carcinoma. The proposed 14-gene model requires further validation.
Key Words: ovarian cancer DNA microarray biomarkers pharmacogenomics
| INTRODUCTION |
|---|
|
|
|---|
20%) are cured with the standard combination. If those women who will derive little benefit from standard platinum-paclitaxel could be identified before therapy (i.e., those with early recurrence), then early intervention with alternative approaches, including novel agents targeted to molecular pathways that are aberrantly activated in ovarian cancer (6), could be used in them. Numerous prognostic factors have been studied in advanced ovarian cancer, but none are predictive of outcome after platinum-paclitaxel chemotherapy (710). Recent studies in several malignancies suggest that unique (RNA) gene expression patterns are often associated with prognosis (11). Emerging data in non-Hodgkin's lymphoma, for example, suggest that gene expression profiling can predict disease-free survival and overall survival in patients treated with anthracycline-based combination chemotherapy (12, 13) . The application of whole-genome technologies, such as gene expression profiling, allows one to generate a multivariate predictor based on the measurement of 30,000 (or more) genes in each sample. Through statistical testing and selection of an analytic gene set for the entire sample group, hierarchical clustering can be applied to find previously unrecognized relationships for the purpose of class comparison (14). This has implications for molecular classification (diagnosis) as well as better prognostic classification (good versus poor). However, the generation of a multigene classifier could also be tested as a predictive model for various outcomes, such as response rate, TTR, and overall survival, after drug administration (15).
To test the hypothesis that gene expression profiling of primary surgically resected ovarian cancers (obtained before chemotherapy) could identify a molecular signature that would distinguish women at risk for early relapse after platinum-paclitaxel chemotherapy, we analyzed 79 ovarian cancer specimens using cDNA microarrays corresponding to 30,721 transcripts. Our results identify a 14-gene predictive model that has 86% accuracy when tested on an independent series of patients treated in a similar manner. Interestingly, of the 19 patients that had early relapse in our test set, 18 patients were correctly identified based on the gene expression profile obtained from the primary tumor.
| MATERIALS AND METHODS |
|---|
|
|
|---|
1 cm in greatest diameter. No patient had received chemotherapy before surgery. All patients were treated with paclitaxel plus either carboplatin or cisplatin as initial therapy after surgery. Because most patients were optimally debulked and had no measurable disease at the initiation of chemotherapy, TTR was selected as the primary end point for the classifier. TTR was the time from diagnosis to recurrence and initiation of second-line therapy. The collection of tumor samples and clinical data was done under institutional review boardapproved protocols. For the present study, we studied two sample sets. A training set of 51 samples was used for marker identification. The median TTR in these patients was 21 months. Because 21 months is the median TTR for optimally debulked, advanced-stage ovarian patients treated with carboplatinum-paclitaxel in the recent Gynecologic Oncology Group phase III trial (5), we used 21 months as the cutoff between early and late relapse. A second test set contained 28 samples. For women who had a positive second look laparotomy after completion of their chemotherapy (n = 11 of 79), the date of their second look was not considered their date of recurrence. These women were included in the early relapse group, but TTR was not available for them.
As noted, the patient samples for the training data were selected to carry out a two-group analysis with a 21 month cutoff. As a test of sensitivity of this selection, we carried out analyses using different cut points of TTR, including patients who had very early relapse (<12 months). The results of those analyses showed no gain in prediction accuracy. We also considered TTR as a continuous variable. However, no single marker or set of markers was significantly associated with TTR after correction for multiple comparison artifacts.
cDNA Arrays and Transcriptional Profiling. RNA was isolated from primary ovarian tumors using the RNeasy kit (Qiagen, Valencia, CA) and quality controlled as described (17). Probes for profiling were generated by reverse transcribing RNA with SuperScript II (Invitrogen, Carlsbad, CA) into cDNA in the presence of [
-33P]dCTP (100 mCi/mL, Amersham, Little Chalfont, United Kingdom) from 2 µg total RNA. The cDNA probes were hybridized (without further amplification) to high-density nylon cDNA arrays as described (18). The arrays used in this study contained 30,721 human sequence clones corresponding to 21,594 unique UniGene clusters. Array hybridizations were done in duplicate and were highly reproducible as reflected by the average coefficient of variation (SD/mean) of 0.2.
Expression results were normalized to the median of all genes in each array. After normalization, expression data were transformed using the expression log (expression + 1) and used in all subsequent analyses.
Data Analysis. The strategy used for data analysis was similar to the one described in our earlier report on building a predictive model for neoadjuvant chemotherapy in breast cancer (15). A multiple random sampling methodology was used to determine the best model (i.e., the model with the lowest prediction error rate based on the 51 training samples). A simpler approach was considered initially that uses all the data for model selection, which is then tested on an independent validation data set. However, simulation studies (data not shown) showed superior performance of the multiple random selection scheme for model selection in data sets with relatively low signal-to-noise levels such as reported in this study. For the random sampling method, a 70% training fraction was sampled, blocking on outcome, debulking status, histology, and tumor stage, from the 51 training samples (see Fig. 1). Signal-to-noise ratio (SNR) [absolute (µlate µearly) / (
late +
early) where µ and
represent the mean and SD of expression for each class, respectively] was calculated for each potential marker. The top SNR markers, up to 500, were then used in a support vector machine (SVM) using a second-order polynomial for classification (19). This process was repeated 100 times resulting in 100 possible models. Each of the 100 models was then refit using all 51 training samples and studied against the 28 test samples. The model with the fewest prediction errors was selected as the best model.
|
Kaplan-Meier curves, stratified by predicted class from the selected model, were constructed based on TTR or follow-up time. Statistically significant differences between predicted class strata were assessed using the log-rank test. To assess general patterns in the data, average linkage clustering was done with CLUSTER and TREEVIEW software (20). Expression data were log (expression + 1) transformed, features were median centered and a Pearson correlation was used.
| RESULTS |
|---|
|
|
|---|
|
Gene Expression Profiling. Individual features ("genes") whose expression correlated with a given sample belonging to one class or the other were ranked using the SNR metric. We then developed a SVM algorithm using the SNR features and built a binary model to predict recurrence (early versus late recurrence defined by TTR <21 versus >21 months). With the SVM algorithm, we identified a top model that contained 14 markers. Each gene with annotation is listed in Table 2 and box plots of gene expression by early versus late recurrence are displayed in Fig. 2. In 4-fold cross-validation studies, the model correctly classified 40 of the 51 training samples (78% accuracy; P < 0.001; see Fig. 3B).
|
|
|
|
Subset Analyses. We applied the top 14-gene model to the following sample subsets: stage III disease patients, optimally debulked patients, and, finally, patients with serous histology tumors. There was no significant improvement in the performance of the model in these patient subgroups (data not shown). Moreover, we generated a new model using only the 37 optimally debulked training samples, which was then tested in the 25 optimally debulked test samples using cross-validation. The best performing SVM-derived model had an overall accuracy (84%) similar to the top model constructed using all the 51 training set patient samples.
| DISCUSSION |
|---|
|
|
|---|
20% remain free of recurrence at 5 years (5). Using gene expression profiling, we sought to develop a predictive model that would identify women with ovarian carcinoma destined for early relapse if treated with platinum-paclitaxel chemotherapy. If such individuals could be identified reliably at diagnosis, then investigational agents, or alternative strategies with existing chemotherapy regimens, could be offered as initial therapy.
The 14-gene model developed in the present study accurately predicted early or late recurrence in 86% of patients in the test set (P < 0.05). Significantly, the positive predictive value of the proposed model is 95% (of the 19 patients who tested positive, 18 relapsed at
21 months). To our knowledge, this study represents the first multigene predictive model for the platinum-paclitaxel regimen in ovarian cancer.
The present study differs from previous reports in which gene expression has been profiled in ovarian cancer. We have previously used gene expression profiling with cDNA microarrays in an attempt to elucidate early events in ovarian carcinogenesis and to distinguish early-stage from late-stage disease (22). Interestingly, none of the 14 genes in the current model were listed in our earlier report. Several other groups have employed gene expression profiling to analyze epithelial ovarian carcinoma. Collectively, these reports describe differences in gene expression between normal ovarian epithelium and epithelial ovarian cancers and among stages, grades, histologic subtypes, and BRCA1/2 status of ovarian cancers (2332). Two groups have also previously correlated gene expression profiles with patient outcomes in ovarian cancer. Lancaster et al. recently analyzed 31 advanced-stage serous cancers using oligonucleotide arrays with unsupervised learning approaches (33). Fourteen of the patients survived >7 years and 17 died within 2 years after initial diagnosis. When hierarchical clustering was done using 2,655 genes that were differentially expressed among the 31 tumor samples, a cluster of 43 genes seemed to distinguish the long-term (>84 months) from the short-term (<24 months) survivors. It is important to note, however, that there were significant differences between debulking status (78% optimally debulked in the good outcome group versus 29% optimally debulked in the poor outcome group) that could have affected the results of this earlier study. Moreover, formal class prediction by supervised learning approaches was not reported. In the second report, methylation microarray analysis was done on 18 stage III to IV epithelial ovarian cancers in an attempt to identify markers for prognosis (34). Notably, five of the samples were obtained at relapse, and progression-free survival data were not available for all the patients. Nevertheless, hierarchical clustering revealed that the group with high levels of tumor methylation had an inferior outcome.
In the present study, there were high rates of optimal debulking surgery in both outcome groups (Table 1), all samples were obtained before chemotherapy, and outcome data were available for all patients. Using this sample set, supervised learning approaches identified a set of 14 differentially expressed genes that distinguish patients based on TTR (Table 2).
Interestingly, the list of 14 differentially expressed genes does not include polypeptides such as ATP binding cassette transporters, metallothionein, DNA repair components, or apoptotic regulators (e.g., Bcl-2 family members), which have been implicated in drug resistance based on studies in model systems. In this regard, the present results parallel those of Holleman et al., who identified a gene expression profile in pediatric acute lymphoblastic leukemia that correlated with drug sensitivity and treatment outcome but also contained a paucity of transcripts commonly implicated in resistance (35). Nonetheless, several markers contained within the 14-gene model identified in our study are plausible determinants of treatment outcome in ovarian carcinoma. Protein kinase C
has been implicated previously as a poor prognostic factor in ovarian carcinoma (36, 37) and its overexpression in the present study is observed in tumors that recur early after initial therapy (Table 2). Protein kinase C family members are known to affect diverse signaling pathways that could affect not only cell cycle progression and differentiation (38) but also response to immune system death signals (39). Another of the top markers, inhibitor of DNA binding 4, negatively regulates the expression of BRCA1 (40, 41). Interestingly, two major studies have shown longer TTR (42) and overall survival (43) in BRCA-linked cancers possibly because of enhanced sensitivity of the tumor cells to DNA damaging agents. In our study, elevated expression of inhibitor of DNA binding 4, which would be expected to down-regulate BRCA1, also correlated with longer TTR. In addition, FLJ20241 showing higher expression in early recurrence in our study, has been identified as a nuclear factor-
B-activating protein in a large-scale experimental study (44). Interestingly, Mabuchi et al. (45) recently linked nuclear factor-
B inhibition to an increase in cisplatin efficacy in in vitro and in vivo ovarian cancer models. Together, our data and these findings suggest that higher FLJ20241expression would lead to nuclear factor-
B activation and consequently decreased cisplatin efficacy, resulting in early recurrence of ovarian cancer after therapy. The exact role of the remaining genes in the biology of ovarian cancer requires further investigation.
In summary, application of gene expression profiling with supervised learning algorithms to a set of ovarian cancers obtained before chemotherapy allowed development of a 14-gene profile that identified patients at risk for early recurrence after combination chemotherapy. The current model predicted outcome in an independent set of patients (n = 28) with an overall accuracy of 86%. These observations need to be validated independently with other sample sets obtained from women with ovarian cancer given platinum-paclitaxel as initial therapy. A second validation attempt could be made using primary ovarian tumors from women treated with other platinum-based regimens to determine the specificity of the 14-gene model. If future studies validate this model, stratification based on pretreatment gene expression would be feasible in patients with ovarian cancer, thereby allowing women who are destined for early relapse to receive alternative primary treatment strategies.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Received 8/18/04; revised 12/15/04; accepted 12/28/04.
| REFERENCES |
|---|
|
|
|---|
B and MAPK signaling pathways. Oncogene 2003;22:330718.[CrossRef][Medline]
B increases the efficacy of cisplatin in in vitro and in vivo ovarian cancer models. J Biol Chem 2004;279:2347785.This article has been cited by other articles:
![]() |
A. Berchuck, E. S. Iversen, J. Luo, J. P. Clarke, H. Horne, D. A. Levine, J. Boyd, M. A. Alonso, A. A. Secord, M. Q. Bernardini, et al. Microarray Analysis of Early Stage Serous Ovarian Cancers Shows Profiles Predictive of Favorable Outcome Clin. Cancer Res., April 1, 2009; 15(7): 2448 - 2455. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. W. Tothill, A. V. Tinker, J. George, R. Brown, S. B. Fox, S. Lade, D. S. Johnson, M. K. Trivett, D. Etemadmoghadam, B. Locandro, et al. Novel Molecular Subtypes of Serous and Endometrioid Ovarian Cancer Linked to Clinical Outcome Clin. Cancer Res., August 15, 2008; 14(16): 5198 - 5208. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Mezzanzanica, M. Fabbi, M. Bagnoli, S. Staurengo, M. Losa, E. Balladore, P. Alberti, L. Lusa, A. Ditto, S. Ferrini, et al. Subcellular Localization of Activated Leukocyte Cell Adhesion Molecule Is a Molecular Predictor of Survival in Ovarian Carcinoma Patients Clin. Cancer Res., March 15, 2008; 14(6): 1726 - 1733. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. N. Fehrmann, X.-y. Li, A. G. J. van der Zee, S. de Jong, G. J. te Meerman, E. G. E. de Vries, and A. P. G. Crijns Profiling Studies in Ovarian Cancer: A Review Oncologist, August 1, 2007; 12(8): 960 - 966. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. Aletti, M. M. Gallenberg, W. A. Cliby, A. Jatoi, and L. C. Hartmann Current Management Strategies for Ovarian Cancer Mayo Clin. Proc., June 1, 2007; 82(6): 751 - 770. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Komatsu, K. Hiyama, K. Tanimoto, M. Yunokawa, K. Otani, M. Ohtaki, E. Hiyama, J. Kigawa, M. Ohwada, M. Suzuki, et al. Prediction of individual response to platinum/paclitaxel combination using novel marker genes in ovarian cancers. Mol. Cancer Ther., March 1, 2006; 5(3): 767 - 775. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Spentzos, D. A. Levine, S. Kolia, H. Otu, J. Boyd, T. A. Libermann, and S. A. Cannistra Unique Gene Expression Profile Based on Pathologic Response in Epithelial Ovarian Cancer J. Clin. Oncol., November 1, 2005; 23(31): 7911 - 7918. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. De Smet, N. L.M.M. Pochet, B. L.R. De Moor, T. Van Gorp, D. Timmerman, I. B. Vergote, L. C. Hartmann, A. I. Damokosh, and S. Hoersch Independent Test Set Performance in the Prediction of Early Relapse in Ovarian Cancer with Gene Expression Profiles Clin. Cancer Res., November 1, 2005; 11(21): 7958 - 7959. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |