| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
1 Molecular Profiling and Biomarker Discovery, 2 Clinical Biostatistics, and 3 Bioinformatics, Wyeth Research, Cambridge, Massachusetts; 4 Clinical Research and Development, Wyeth Research, Collegeville, Pennsylvania; 5 University of Texas Health Science Center, San Antonio, Texas; 6 University of Chicago, Chicago, Illinois; 7 Indiana University, Indianapolis, Indiana; 8 Our Lady of Mercy Medical Center, New York Medical College, Bronx, New York; 9 Fox Chase Cancer Center, Philadelphia, Pennsylvania; 10 Millenium Pharmaceuticals, Cambridge, Massachusetts; and 11 Biometrics Research, Wyeth Research, Pearl River, New York
Requests for reprints: Michael E. Burczynski, Wyeth Research, 1 Burtt Road, Andover, MA 01810. Phone: 978-247-1156; Fax: 978-247-1133; Email: mburczynski{at}wyeth.com.
| ABSTRACT |
|---|
|
|
|---|
Experimental Design: In this study, we evaluated the association of expression profiles in PBMCs with clinical outcomes in patients with advanced renal cell cancer. Transcriptional patterns in PBMCs of 45 renal cell cancer patients were compared with clinical outcome data at the conclusion of a phase II study of the mTOR kinase inhibitor CCI-779 to determine whether pretreatment transcriptional patterns in PBMCs were correlated with eventual patient outcomes.
Results: Unsupervised hierarchical clustering of the PBMC profiles using all expressed genes identified clusters of patients with significant differences in survival. Cox proportional hazards modeling showed that the expression levels of many PBMC transcripts were predictors for the patient outcomes of time to progression and overall survival (time to death). Supervised class prediction approaches identified multivariate expression patterns in PBMCs capable of assigning favorable outcomes of time to death and time to progression in a test set of renal cancer patients, with overall performance accuracies of 72% and 85%, respectively.
Conclusions: The present study provides the first example of gene expression profiling in peripheral blood, a clinically accessible surrogate tissue, for identifying patterns of gene expression associated with higher likelihoods of positive outcome in patients with a solid tumor.
Key Words: CCI-779 gene expression profiling peripheral blood profiling pharmacogenomics surrogate biomarkers
| INTRODUCTION |
|---|
|
|
|---|
Because of their greater accessibility, expression profiles in surrogate tissues, such as peripheral blood mononuclear cells (PBMC), are also of interest for determining whether expression patterns may predict clinical outcomes in cancer patients. We have reported recently that baseline expression profiles of PBMC from renal cell cancer (RCC) patients are significantly distinct from those of disease-free subjects (9). Many of the expressed transcripts were nonetheless highly variable and heterogeneously expressed across the RCC PBMCs, suggesting that subsets of patients with distinct transcriptional profiles may exist in this disease setting.
Following the conclusion of the present clinical trial (10), we compared expression patterns in PBMCs of these RCC patients with various clinical variables to determine whether expression patterns in circulating mononuclear cells were correlated with eventual patient outcomes in this study. The results of both unsupervised and supervised analyses suggest that transcriptional profiles in PBMCs from patients with advanced RCC are not only distinct from profiles in disease-free individuals but also significantly correlated with overall survival and progression-free survival in this disease setting.
| PATIENTS AND METHODS |
|---|
|
|
|---|
|
Clinical Procedures. Patients with advanced cases of RCC were randomized to receive treatment with one of three doses of CCI-779 (25, 75, and 250 mg) administered as a 30-minute i.v. infusion once weekly for the duration of the trial. Clinical staging and extent of residual, recurrent, or metastatic disease were recorded before treatment and every 8 weeks following initiation of CCI-779 therapy. Tumor size was measured (in cm) and reported as the product of the longest diameter and its perpendicular. Measurable disease was defined as any bidimensionally measurable lesion with both diameters >1.0 cm by computed tomography scan, X-ray, or palpation. Tumor responses (complete response, partial response, minor response, stable disease, or progressive disease) were determined by the sum of the products of the perpendicular diameters of all measurable lesions, with progression defined as a 25% increase in tumor size over baseline or nadir or the presence of new lesions. The two main clinical outcome measures used in the pharmacogenomic analysis were time to progression (TTP) and survival or time to death (TTD). TTP was defined as the interval from the date of initial CCI-779 treatment until the first day of measurement of progressive disease or censored at the last date known as progression free. Survival or TTD was defined as the interval from date of initial CCI-779 treatment to the time of death or censored at the last date known alive. In this clinical trial, three doses of CCI-779 were evaluated (25, 75, and 250 mg), but tumor response rates and median survival times were comparable among the three dose groups (10). For these reasons, dose level was not considered in subsequent pharmacogenomic analyses.
Peripheral Blood Mononuclear Cell Preparation, Isolation of RNA, and Hybridization of Targets to Microarrays. Before initiation of therapy, peripheral blood samples (8 mL) were collected into Vacutainer sodium citrate cell purification tubes and PBMCs were isolated according to the manufacturer's protocol (Becton Dickinson, Franklin Lakes, NJ). All blood samples were shipped in cell purification tubes overnight before PBMC processing. Total RNA was isolated from PBMC pellets using the RNeasy mini kit (Qiagen, Valencia, CA) and biotinylated cRNA was prepared using a modification of the procedure described by Lockhart et al. (12). Labeled probes were hybridized to oligonucleotide arrays composed of >12,600 human sequences (HgU95A, Affymetrix, Santa Clara, CA) according to the Affymetrix Expression Analysis Technical Manual (Affymetrix).
Gene Expression Data Reduction. Data analysis and absent/present call determination were done on raw fluorescent intensity values using Genechip 3.2 software (Affymetrix). "Present" calls were calculated using Genechip 3.2 software by estimating whether a transcript is detected in a sample based on the strength of the signal of the gene compared with background. The "average difference" values for each transcript were normalized to "frequency" values using the scaled frequency normalization method (13) in which the average differences for 11 control cRNAs with known abundance spiked into each hybridization solution were used to generate a global calibration curve. This calibration was then used to convert average difference values for all transcripts to frequency estimates (in ppm) ranging from 1:300,000 (
3 ppm) to 1:1,000 (1,000 ppm).
Statistical Analyses. Cox proportional hazards models, which account for censoring in time-to-event outcomes, were used to evaluate the relationships between expression and TTP and between expression and TTD. The models were calculated using log-transformed expression data, and separate models were fit for each transcript. Survival data were assessed by Kaplan Meier analysis, and statistical significance was established using a Wilcoxon test.
Unsupervised hierarchical clustering of genes and/or arrays based on similarity of their expression profiles was done using the procedure of Eisen et al. (14). In these analyses, 5,424 transcripts meeting a nonstringent data reduction filter were used (at least one present call and at least one frequency >10 ppm). Expression data were log transformed and standardized to have a mean value of 0 and a variance of 1, and hierarchical clustering results were generated using average linkage clustering with an uncentered correlation similarity metric.
Gene selection and supervised class prediction were done using GeneCluster version 2.0, which has been described previously (15) and is available from http://www.genome.wi.mit.edu/cancer/software/genecluster2.html. In these analyses, only the 4,022 transcripts meeting a more stringent data reduction filter were used (at least 25% present calls and an average frequency >5 ppm). This more stringent filter was used to avoid including low-level and unreliably detected transcripts in the predictive models. For gene selection, all expression data in training sets and test sets were log transformed before analysis. In training sets of data, models containing increasing numbers of features (transcript sequences) were built using a two-sided approach (equal numbers of features in each class) with a S2N similarity metric that used median values for the class estimate. All comparisons were binary, and predictive gene classifiers containing between 2 and 60 genes in steps of 2 (and 60-200 genes in steps of 10) were evaluated by leave-one-out cross-validation (LOOCV) to identify the smallest predictive model yielding the most accurate class assignments. Prediction of class membership was done using a k nearest-neighbor algorithm also in GeneCluster (16). In these predictions, the number of neighbors was set to k = 3, the cosine distance measure was used, and all k neighbors were given equal weights.
| RESULTS |
|---|
|
|
|---|
|
Identification of Pretreatment Transcript Levels Associated with Patient Outcome. To identify specific transcripts in PBMCs that were correlated with patient outcome, we employed a Cox proportional hazards regression to model outcome as a function of log2-transformed expression levels (in ppm). Cox regression analyses were done on two clinical outcome measures (TTD and TTP) for each of the 5,424 qualifiers that passed the initial filtering criteria (at least one "present" call across the data set and at least one transcript with a frequency of
10 ppm). Of the 45 RCC patients with baseline PBMC expression levels, 10 had censored data for TTD and 4 had censored data for TTP. In the Cox proportional hazards analysis, the risk coefficient associated with each transcript indicates the likelihood of a favorable or nonfavorable outcome, where a risk coefficient <1.0 indicates less risk and a risk coefficient >1.0 indicates higher risk.
For each transcript and outcome measure, risk coefficients were calculated and the P for the hypothesis that the risk coefficient was equal to 1 (i.e., no risk) was calculated. The number of tests that were nominally significant out of the 5,424 tests done for each outcome measure was calculated for five type I (i.e., false-positive) error levels. To adjust for the fact that the 5,424 tests were not independent, a permutation-based approach was then employed to evaluate how often the observed number of significant tests would be found under the null hypothesis of no risk.
The Cox proportional hazards regressions identified transcripts significantly correlated with progression and survival. Permutation analyses confirmed that more genes had statistically significant correlations of gene expression with survival than had significant correlations with disease progression. The 20 genes in PBMCs whose transcript levels possessed a minimal level of significance (P < 0.05) and were most correlated with low risk (risk coefficient <1.0) or high risk (risk coefficient >1.0) for survival (Table 2) or disease progression (Table 3) are presented.
|
|
To evaluate the predictive utility of the profiles correlated with clinical outcomes, we randomly selected 70% of the patient PBMC profiles as a training set, and the remaining 30% of the samples formed the test set. This strategy allows the test set of samples to function similarly to a set of future samples on which the classifications of interest could be predicted. The main benefit of this approach is that it ensures that the test samples are not used in gene selection and it therefore allows a truly independent evaluation of the predictive model discovered in the training set (17,18).
For each outcome measure, we stratified the profiles as originating from patients with poor or favorable outcomes. In this process, we attempted to discover models that could predict either (a) long-term survival or (b) rapid times to disease progression. We established yearlong survival as a favorable outcome for overall survival because this approximated the median survival across all three dose groups for the 45 patients in the pharmacogenomic portion of the trial. We established 106 days as nonfavorable outcome for TTP, because this represented the lower quartile value of disease-free survival for the 45 patients in the pharmacogenomic portion of the trial.
Because it was possible that the observed differences in expression between PBMCs from patients classified into the good and poor outcome categories might be confounded by other differences, such as patient demographics or technical variables, we first compared these characteristics between the poor outcome and the good outcome patient groups defined based on TTD or TTP (Supplementary Data). Groups were tested for differences in continuous variables using a Student's t test or for categorical differences using a likelihood test. Comparison of technical chip variables (raw Q, glyceraldehyde-3-phosphate dehydrogenase 3':5' ratio, scale factors, average frequency, and present calls), demographics (gender, age, and ethnicity), or clinical variables (histologic tumor types, previous surgeries, previous nephrectomies, numbers of metastatic sites, and dose levels) indicated no significant differences between patients in the good and poor outcome categories that might contribute to observed differences in PBMC gene expression. Prognosis by the Motzer-based risk assessment was significantly associated with the groups in the survival comparison as expected but was not significantly associated with the groups in the TTP comparison.
Because these studies used PBMCs as the tissue of interest, we also examined the distribution of cell types (neutrophils, eosinophils, lymphocytes, and monocytes) in the samples of the various groups to determine whether differences in cell populations might be responsible for any observed differences in expression. This analysis showed that the distributions of the various cell subtypes between PBMCs of patients assigned to either good or poor outcome categories for survival and TTP were not significantly different (Supplementary Data). This supports the hypothesis that any observed transcriptional differences between the groups were not the result of altered cell compositions but seem to reflect distinct expression patterns in the PBMCs.
In subsequent analyses, we used a nearest-neighbor prediction algorithm to generate gene classifiers correlated with the groups in the training sets and selected the classifiers that gave the highest accuracy of class assignment by LOOCV. The results of these analyses are depicted for classification based on yearlong survival in Fig. 2 and for classification based on short times to disease progression in Fig. 3. The 20-gene classifier in PBMCs that gave the highest accuracy of class assignment by LOOCV (73%) in the TTD comparison and the 30-gene classifier in PBMCs that displayed the highest accuracy for class assignment by LOOCV in the TTP analysis (74%) are provided in Supplementary Data.
|
|
| DISCUSSION |
|---|
|
|
|---|
Several supervised approaches further strengthened the hypothesis that transcriptional levels of select genes in PBMC profiles of RCC patients are significantly correlated with the clinical outcomes of disease progression and overall survival. Parametric (Cox proportional hazards modeling) univariate analyses identified individual transcripts in PBMCs that were significantly correlated with both disease progression and survival. GeneCluster-based gene selection methods also identified multigene signatures in PBMCs that were predictive of progression and survival.
It should be noted that we did not limit the present analyses to the RCC disease-associated transcripts reported previously (9) and in so doing found that the transcripts most strongly correlated with disease outcome in RCC patients were not necessarily the transcripts most strongly associated with the presence of disease in the comparison of RCC PBMCs and normal volunteers. It seems that some of the differences between transcriptional profiles in PBMCs of RCC patients and healthy subject are driven by differences in cell populations; yet, cell populations were not significantly distinct in poor and favorable outcome groups in the present analysis. This may be one of several plausible reasons that those transcriptional differences in PBMCs that define the presence of RCC relative to the healthy condition are not necessarily those that are most significantly correlated with clinical outcomes within the population of RCC patients evaluated in the present study.
The overall accuracy of the predictive models for TTP and overall survival on test sets of patients was encouraging (85% and 72%, respectively), and overall accuracies in both training set cross-validation and test-set predictions were similar. The model based on TTP displayed higher accuracy mainly because the model based on yearlong survival possessed a higher false-positive rate, because several of the shorter-term survivors were incorrectly classified as long-term survivors. Nonetheless, the gene classifier for yearlong survival exhibited high sensitivity for the patients tested (i.e., correctly assigned the majority of yearlong survivors).
The PBMC-based gene classifier was able to classify short versus long TTP with relatively high accuracy, but the Motzer risk assessment scores, although very well correlated with TTD, were not significantly distinct for patients who exhibited short-term and long-term times to disease progression as defined in this study (see Suplementary Data). This is not surprising because the Motzer risk assessment was developed for overall survival (11). As additional cytostatic cancer therapies are developed, TTP will continue to be an important diagnostic end point in oncology trials. The finding that the PBMC profiles of patients in this study seem prognostic of short versus long TTP provides additional support for continued evaluations of PBMC transcriptional profiles in the context of clinical outcomes in patients with other solid tumors. The results from supervised classification analyses in the present study suggest that it may be possible to use transcriptional profiles in the surrogate tissue of peripheral blood to identify cancer patients with greater chances for long or short times to progression and/or long or short survival.
The results further imply that the circulating mononuclear cells of peripheral blood may serve as a sensitive monitor of the organism's physiologic state. As these cells pass through various tissues, their reaction to the microenvironment is captured in a complex transcriptional response measured through profiling. Surprisingly, such patterns not only seem to be diagnostic of disease state (e.g., RCC) but also may reflect differential responses to variations in clinically indistinguishable disease states (e.g., advanced RCC with different degrees of aggressiveness). This suggests that the PBMCs, due to their transit through the body, may serve as an accessible surrogate monitor of tissues and systems that are not easily obtained by routine biopsies. These changes may be reflective of an ongoing physiologic host response to the tumor, such as an immune or inflammatory response. Concerning this possibility, the elevated expression of immune-associated genes in PBMCs from patients in the good prognosis category is of particular interest.
The functional categories of transcripts in PBMCs associated with low or high risk displayed several interesting trends. First, transcripts elevated in PBMCs of patients with shorter TTP or survival include those involved in cytoskeletal organization/cell motility, associated small GTPases, general pathways of proteasome-dependent catabolism, and general pathways of metabolism. In contrast, transcripts elevated in PBMCs of patients with longer TTP or survival included those involved in mRNA transport, mRNA processing/splicing, and ribosomal protein subunits. Different eukaryotic translation initiation factor isoforms were elevated in patients with poor or favorable outcomes. Because the drug evaluated in this study is a well-characterized inhibitor of eukaryotic translation (19, 20), it is tempting to speculate that elevated transcripts levels of certain eukaryotic translation initiation factor isoforms in PBMCs may represent potential biomarkers of poor or favorable response to treatment with CCI-779 but this is currently unproven.
It is important to note that the present study cannot distinguish whether the profiles in PBMCs discovered here are simply prognostic of outcome in these patients regardless of therapy or whether they are specific to impending treatment with CCI-779. In the absence of a placebo or active control arm, we were unable to determine whether there are patterns in pretreatment PBMCs that are predictive of clinical outcome only in the context of CCI-779 therapy. To address this, we are currently collecting whole blood samples at baseline before patient entry into a phase III oncology trial of CCI-779 in RCC, which includes several different treatment arms. Surrogate tissue analysis in the phase III trial will enable the discrimination between pretreatment transcriptional profiles that are specific to the therapies in question from those that are simply prognostic of disease outcome regardless of therapy. Analysis of the larger phase III trial of CCI-779 in a completely independent set of RCC patients currently under way will allow validation and/or further refinement of the current predictive models identified in the present studies. The findings reported in the present phase II trial support the continued evaluation of surrogate tissue expression profiles to enhance the prediction of clinical outcomes in cancer populations.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Note: Supplementary data for this are available at Clinical Cancer Research Online (http://clincancerres.aacrjournals.org).
Received 7/30/04; revised 10/18/04; accepted 10/26/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
O. Galamb, F. Sipos, N. Solymosi, S. Spisak, T. Krenacs, K. Toth, Z. Tulassay, and B. Molnar Diagnostic mRNA Expression Patterns of Inflamed, Benign, and Malignant Colorectal Biopsy Specimen and their Correlation with Peripheral Blood Results Cancer Epidemiol. Biomarkers Prev., October 1, 2008; 17(10): 2835 - 2845. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Woelfelschneider, O. Popanda, C. Lilla, J. Linseisen, C. Mayer, O. Celebi, J. Debus, H. Bartsch, J. Chang-Claude, and P. Schmezer A distinct ERCC1 haplotype is associated with mRNA expression levels in prostate cancer patients Carcinogenesis, September 1, 2008; 29(9): 1758 - 1764. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bouwens, L. A Afman, and M. Muller Fasting induces changes in peripheral blood mononuclear cell gene expression profiles related to increases in fatty acid {beta}-oxidation: functional role of peroxisome proliferator activated receptor {alpha} in human peripheral blood mononuclear cells Am. J. Clinical Nutrition, November 1, 2007; 86(5): 1515 - 1523. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Learn, P. E. Fecci, R. J. Schmittling, W. Xie, I. Karikari, D. A. Mitchell, G. E. Archer, Z. Wei, H. Dressman, and J. H. Sampson Profiling of CD4+, CD8+, and CD4+CD25+CD45RO+FoxP3+ T Cells in Patients with Malignant Glioma Reveals Differential Expression of the Immunologic Transcriptome Compared with T Cells from Healthy Volunteers Clin. Cancer Res., December 15, 2006; 12(24): 7306 - 7315. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. O'Toole, D. B. Janszen, D. K. Slonim, P. S. Reddy, D. K. Ellis, H. M. Legault, A. A. Hill, M. Z. Whitley, W. M. Mounts, K. Zuberek, et al. Risk Factors Associated With {beta}-Amyloid(1-42) Immunotherapy in Preimmunization Gene Expression Patterns of Blood Cells Arch Neurol, October 1, 2005; 62(10): 1531 - 1536. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |