
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Human Cancer Biology |
Authors' Affiliations: Departments of 1 Pathology and Immunology, 2 Biostatistics, 3 Surgery, and 4 Medicine, Washington University School of Medicine and 5 John Cochran Veterans Administration Hospital, St. Louis, Missouri
Requests for reprints: Rebecca Aft, Department of Surgery, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8109, St. Louis, MO 63110. Phone: 314-747-0063; Fax: 314-454-5509; E-mail: aftr{at}wustl.edu.
| Abstract |
|---|
|
|
|---|
Experimental Design: Enrichment of TACSTD1 (EpCAM)–expressing cells from bone marrow of breast cancer patients was achieved using immunomagnetic beads. Gene expression profiles were compared between enriched cell populations and whole bone marrow from 5 normal volunteers and 23 breast cancer patients after neoadjuvant chemotherapy treatment. Enriched cells from bone marrow samples of breast cancer patients before treatment or at 1 year follow-up were also analyzed (total of 87 data sets). The expression of transcripts specifically detected in enriched cell populations from breast cancer patients was correlated with 1-year clinical outcome using quantitative reverse transcription-PCR in an independent cohort of bone marrow samples.
Results: Analysis of EpCAM-enriched bone marrow cells revealed specific expression of a subgroup of transcripts, including the metastasis regulator, TWIST1. Most transcripts identified, including TWIST1, were not expressed in enriched populations of bone marrow from normal volunteers, suggesting that this expression profile reflects a signature of breast cancer bone marrow micrometastases that persist after chemotherapy. In an independent set of bone marrow samples obtained before any treatment, TWIST1 expression correlated with early disease relapse.
Conclusions: Disseminated breast cancer cells present in bone marrow after chemotherapy possess unique transcriptional signatures. Genes whose expression is overrepresented in these cell populations, such as TWIST1, may prove to be excellent markers of early distant relapse in breast cancer patients.
Several clinical studies have shown that disseminated tumor cells (DTC) in the bone marrow of breast cancer patients are an independent predictor of metastatic disease development and overall poor prognosis (3). The proposed biological basis for this clinical observation is that bone marrow serves as an important reservoir that allows DTCs to adapt and disseminate to other organs (4). DTCs can persist for years and remain a predictor for disease recurrence (5). However, over 60% of patients with detectable DTCs at diagnosis remain relapse-free after a median follow-up of 6 years (6), indicating that not all DTCs have the potential to form metastatic disease. Those patients with detectable DTCs in their bone marrow after chemotherapy have a particularly poor prognosis, indicating that chemotherapy does not eliminate all DTCs and that those subpopulations of DTCs that survive cytotoxic chemotherapy have a high metastatic potential (7). The heterogeneity of DTCs has been confirmed by differential gene and protein expression studies (8), and this heterogeneity is reduced after adjuvant treatment (9).
DTCs are rare; thus, some efficient enrichment scheme is necessary to characterize these cells at the molecular level. Many enrichment schemes rely on positive immunomagnetic cell separation with antibodies to the TACSTD1 antigen (EpCAM), because of its frequent and relatively specific expression on epithelial cells (10, 11). Approximately 68% (range 48-100%) of cytokeratin-positive bone marrow DTCs are EpCAM positive (12). Furthermore, EpCAM overexpression in breast cancers is frequently associated with poor prognosis and metastatic disease development (13). To date, EpCAM seems to be the single best target for immunoselection of micrometastatic cells (11).
In the present study, we hypothesized that gene expression profiling of DTCs isolated from bone marrow of breast cancer patients after treatment with chemotherapy would enrich for chemotherapy persistent DTCs with high metastatic potential and identify novel, biologically relevant gene transcripts associated with early tumor relapse. We have analyzed bone marrow from 23 patients with clinical stage II/III breast cancer who received four cycles of neoadjuvant epirubicin/Taxotere chemotherapy. A set of expressed transcripts in enriched DTC populations, but not present in similarly processed bone marrow from normal volunteers, was identified. Expression of a subset of these transcripts was then correlated with either the presence or absence of disease recurrence within 1 year in an independent set of identically treated patients. One of the transcripts, TWIST1, is a known marker of tumor metastases, suggesting that this approach will yield other potentially useful markers that predict the development of metastatic disease in breast cancer patients.
| Materials and Methods |
|---|
|
|
|---|
|
5 x 107 nucleated cells were incubated with EpCAM immunomagnetic beads (DAKO) for 1 h at 4°C with shaking. The magnetic isolation was repeated five times to optimize capture of cells. Immunomagnetic beads were subsequently washed with cold PBS/1% FCS, as previously reported (11). For some experiments, the remaining effluent was incubated with one of six biotinylated antibodies (CXCR4, osteopontin, SCA-1, CD44, ABCG-2, EMMPRIN), which had been coupled to avidin magnetic beads. The immunomagnetic beads were isolated using a magnetic particle concentrator (Dynal MPC). Captured cells were then snap-frozen for RNA isolation. For cytokeratin staining and qRT-PCR experiments, whole bone marrow was enriched for mononucleated cells using a standard Ficoll-Hypaque gradient technique. RNA isolation and gene expression profiling. Microarray expression profile analysis was initially done on 79 bone marrow samples from 25 patients enrolled on the trial. The characteristics of these patients are summarized in Supplementary Table S1. Total RNA was isolated from whole bone marrow or immunoselected bone marrow cells using TRIzol reagent (Invitrogen). For immunoselected cell samples, the method was scaled down by a factor of 10, and yeast tRNA was added to the cell lysate as a coprecipitant. RNA from total bone marrow samples, processed in parallel with immunoselected samples, was quantified and qualitatively assessed using an Agilent Bioanalyzer and RNA NanoChip assay. Because RNA yields from immunoselected cells were expectedly low, these RNAs were not qualitatively assessed.
Total bone marrow RNA was diluted to 50 ng/µL, and 1 µL was used for two-cycle biotinylated cRNA target synthesis (Affymetrix). The entire RNA yield from immunoselected cell populations was concentrated to 3 µL and similarly used. Biotinylated cRNA targets were synthesized using the Affymetrix two-cycle target preparation protocol and reagents as supplied by the manufacturer (Affymetrix). Resulting biotinylated cRNA was quantified and samples that yielded >15 µg of cRNA were used for GeneChip microarray hybridization. In 17 cases, RNAs from immunoselected cell populations failed to produce labeled cRNA target, presumably due to the absence of immunocaptured cells and corresponding RNA in the sample.
Fragmented, biotinylated cRNAs were hybridized to Affymetrix Human Focus gene expression microarrays following standard protocols. Arrays were hybridized, washed, and scanned following the manufacturer's protocol. Array images were processed using the Affymetrix Microarray Analysis Suite (MAS5) statistical algorithm. All arrays were scaled to a target intensity of 1,500 and data was exported to the Siteman Cancer Center Bioinformatics Core Facility6 for probe set annotation and further analysis. The complete microarray data set can be found at the aforementioned URL.
Quantitative RT-PCR. Two micrograms of each whole bone marrow RNA were converted to first-strand cDNA using Omniscript reverse transcriptase (Qiagen) and random hexamers. Resulting cDNA was diluted to an equivalent of 5 ng/µL of input RNA. Primer/probe sets for indicated transcripts were purchased from Applied Biosystems. Each reaction consisted of 10 µL of cDNA, TaqMan Master Mix (Applied Biosystems), and primer/probe set in a total volume of 50 µL, following the manufacturer's standard protocol. For each transcript/sample, duplicate or triplicate reactions were done in an ABI 7500 FAST Sequence Detection System. Resulting cycle threshold data was exported for further analysis.
Immunohistochemistry. Cytokeratin-positive cells in the bone marrow were independently identified by immunohistochemistry analysis of 2 million mononucleated bone marrow cells stained with the anticytokeratin antibody mixture AE1/AE3 (DAKO).
Data analysis and statistics. Annotated microarray data output was analyzed using a number of approaches as described in Results. Probe sets that were scored as not detectable ("A") by the Affymetrix algorithm in all samples, as well as control probe sets were removed before further analysis. Data filtering, visualization, and ANOVA analysis was done using DecisionSite for Functional Genomics software (Spotfire). Class distinction analyses were done using the Significance Analysis of Microarray algorithm, version 2.21 (14). For hierarchical clustering, normalized microarray signal data was first transformed to a mean of 0 and SD of 1 (Z-score calculation).
For qRT-PCR data, duplicate or triplicate reactions with a cycle threshold (CT) value difference of >1.5 were excluded from further analysis. Each run with a specific primer/probe set was normalized to CT values using a glyceraldehyde-3-phosphate dehydrogenase primer/probe set on the same cDNA sample. This value was in turn normalized across the entire sample set by using the average
CT from three bone marrow samples from healthy volunteers, using the 
CT method (15). Data for transcript expression levels were then expressed as fold difference relative to the average of normal bone marrow. In samples where a specific transcript was not detected (CT > 40), the fold difference in expression was defined as 1.0 (equal to that of the normal bone marrow average).
To correlate expression of the genes with clinical variables, a Fisher's exact test was done. For these analyses, patient specimens were considered to express the gene of interest if expression levels in bone marrow were at least 2 SDs above the mean expression of three bone marrow samples from volunteers. Patients were considered "positive" for biomarker gene expression or cytokeratin-positive cells when detected in at least one of the two bone marrow samples analyzed (left or right side). Logistic regression was used to estimate the odds of 1 year recurrence in patients who were positive relative to those who were negative for each of the markers analyzed.
| Results |
|---|
|
|
|---|
Effective immunoenrichment of DTCs using the EpCAM antigen. We used previously documented methods to enrich for putative tumor cells from bone marrow cell populations using the TACSTD1 antigen, EpCAM (11). The estimated amount of contamination using this technique is in the range of 0.74% to 74% (11). As a pilot study in our laboratory showed, this methodology can selectively isolate breast tumor cells spiked into normal bone marrow with >80% recovery (data not shown). To confirm that EpCAM immunoselection enriched for DTCs in vivo, we examined a number of marker genes reported to be expressed by DTCs (16). Expression of TACSTD1 (EpCAM) itself was enriched over 5-fold compared with corresponding whole bone marrow in many, but not all immunoselected samples (Fig. 2 ). Keratin 19 and mammoglobin (SCGB2A2; ref. 17) were also enriched in some immunoselected cell samples (i.e., patient 3422) by as much as 100-fold. The enrichment for EpCAM expression was not a profound as expected, but this result is likely due to a background level of transcript expression in total bone marrow.
|
|
7,000 gene transcripts, is shown in Fig. 3. From this analysis, several distinct classes of gene transcription are apparent. In particular, two subsets of EpCAM-selected bone marrow samples from predominantly posttherapy patients ("D" and "E") form a distinct expression cluster from unselected marrow, normal marrow, and EpCAM-selected marrow from primarily pretherapy patients ("F" and "G"). We propose that those samples in groups D and E represent bone marrow containing DTCs that persist after chemotherapy, and display a discernibly unique and functionally relevant molecular profile. In fact, a number of notable transcripts overrepresented in patient cluster D include those associated with tumor invasion and metastasis (SPARCL1, MMP2, TNC, FMOD, TIMP4, and TWIST1), epithelial cells and breast cancer (EMP1, EMP2, AMPH, ESR1, and BCAR3), and growth factors associated with breast cancer progression (MDK and VEGFC). In contrast, transcripts overrepresented in cluster E include a preponderance of ribosomal proteins and translational initiation factors, cell cycle, and DNA repair genes (PCNA, CCNB2, CDC2, MYC, CHEK1, XRCC1, BCCIP, BRCA1, and RAD51), and breast cancer biomarkers (BCAS2, MUC1, and TACSTD1). TWIST1 gene expression is part of a DTC signature. To maximize the possibility of identifying unique genes associated with DTCs that persist through chemotherapy, we directly compared 12 matched pairs of whole bone marrow and EpCAM-enriched samples from patients who had completed four cycles of chemotherapy. To determine whether the enriched cell expression profile was unique to EpCAM selection or an inconsequential result of cell processing, we also compared these expression profiles to those of cell populations selected with the six other antibodies described above. Finally, as a negative control, we compared these expression profiles to EpCAM-selected cells from bone marrow obtained from healthy volunteers.
Using the Significance Analysis of Microarrays algorithm (14), we identified 627 probe sets whose expression was significantly different between the 12 paired EpCAM-selected and unselected samples, with a predicted false discovery rate of 5%. Although the expression of these genes is significantly different between selected and unselected samples, we are unable to determine whether they are highly expressed in a few cells or present in the majority of DTC. As listed in Supplementary Table S2, 67 unique transcripts were overrepresented in EpCAM-selected samples at a level equal to or greater than that of the EpCAM antigen (TACSTD1) itself. Furthermore, 25 of these transcripts were undetectable in both EpCAM-selected and unselected bone marrow from healthy volunteers. We used the 67 transcript set to cluster 40 remaining samples that were not used for the initial analysis. As shown in Fig. 4 , this gene signature does not simply distinguish EpCAM-selected cells from total bone marrow, but rather segregates patients into two distinct classes. Six patients showed enhanced expression of the DTC signature in their posttherapy bone marrow samples, whereas samples from remaining patients (both pretherapy and posttherapy) and healthy volunteers were relatively devoid of the DTC signature. Interestingly, enhanced expression of the DTC signature is not observed in the corresponding pretreatment specimens of four of the patients (2153, 1092, 6268, 5385). Currently, there is insufficient follow-up information regarding this set of patients and whether these patients with the DTC signature will have a shorter disease-free survival.
|
TWIST1 expression in DTCs is associated with early relapse. The most clinically useful biomarker would be one that would indicate the potential of DTCs present in whole, unfractionated bone marrow to form distant metastasis at a time before any treatment intervention. To address this concept, an additional set of 50 bone marrow specimens obtained before any treatment intervention from 30 patients (Fig. 1) were assayed for expression of several DTC signature genes using qRT-PCR. In this validation set, 15 patients progressed or developed metastatic disease within 1 year and the remaining 15 patients had no evidence of disease at 1 year. Specimens from the two groups were matched for expression of estrogen receptor, progesterone receptor, and Her-2/neu expression. All patients in the validation set were subsequently treated with identical chemotherapy regimens. There was no statistically significant relationship between progression or early metastatic disease development and other tumor markers (estrogen receptor, progesterone receptor, Her-2), grade of tumor, or response of the primary tumor to neoadjuvant chemotherapy (data not shown). Importantly, progression at 1 year was also unrelated to the presence of cytokeratin-positive cells in bone marrow, as assessed by traditional immunocytochemistry (Fig. 5 ).
|
| Discussion |
|---|
|
|
|---|
Several investigators have used a similar approach to molecularly characterize DTCs in patient bone marrow using capture antibodies such as EpCAM (10, 11). However, it is known that DTCs lacking EpCAM expression do exist and, in particular, may be more prominent after chemotherapy exposure (9). Therefore, it is formally not possible to distinguish those cells that have lost EpCAM expression and are not captured, versus those cells that retain EpCAM expression, are captured, but are not detected due to loss of cytokeratin expression. This may explain the lack of correlation between number of cells detected in bone marrow by cytokeratin staining and EpCAM selection. In this study, we attempted to isolate DTCs that may not be captured by EpCAM targeting in a subset of samples by subsequent selection using antibodies directed against six other surface antigens reported to be expressed on metastatic breast tumor cells or breast cancer stem cells. None of the samples analyzed using these antibodies exhibited unique expression profiles, suggesting that we were unable to further select additional DTC subsets based on expression of these antigens.
Identification and molecular characterization of DTCs in bone marrow has been hampered by the extremely low frequency of these cells, even in patients with advanced-stage disease. In this study, even at a prevalence of 10 to 20 DTCs per million nucleated marrow cells, we would predict recovery of only 400 to 800 cells from the analyzed samples. Transcript detection using PCR-based strategies are relatively straightforward with these few cells, although linear transcript amplification and microarray analysis are more problematic (27). In fact, numerous EpCAM-selected samples in this study (included several from normal marrow donors) failed to produce microarray results, suggesting that too few cells were present for analysis in these samples.
EpCAM expression is also not absolutely specific for tumor cells, and is likely expressed on some cellular components of normal bone marrow (28). Therefore, many of the expression signatures identified in this study (including those from EpCAM-selected normal bone marrow) likely originate from non-DTC sources. Nevertheless, many transcripts identified were specific to a subset of EpCAM-selected samples from breast cancer patients and were not detected in normal marrow, suggesting that these markers are specific for DTCs. Interestingly, even if the expression of these signature genes do not directly originate from DTCs themselves, it is possible that the expression signature of other bone marrow elements may be equally informative with regard to the presence and metastatic potential of DTCs that lie within them, due to tumor-environment signaling interactions (29, 30). In the future, it will be interesting to determine how many "DTC signature" transcripts are actually expressed by other bone marrow elements.
Several lines of evidence further suggest that the expression signatures identified in this study truly reflect the presence of DTCs. Woelfle et al. (31) showed that the process of immunoselection itself does not substantially alter cellular expression profiles. This finding was confirmed in our study in that selection of cells with alternate antibodies and of cells from normal marrow does not recapitulate the signature obtained from EpCAM-selected cells from breast cancer patients. In EpCAM-selected cells, several known tumor markers such as cytokeratin 19 (KRT19) and mammaglobin (SCGB2A2) were enriched in many samples, indicating the successful isolation of true DTCs. Finally, in an independent set of samples, the DTC signature did not simply segregate EpCAM-selected versus unselected marrow, but rather defined two distinct classes of patients, presumably those with and without chemotherapy-persistent DTCs.
The identified DTC signature initially consisted of a large number of transcripts whose expression was enriched in EpCAM-selected cells. However, only a small number of these transcripts showed a comparable degree of enrichment to the EpCAM transcript itself, TACSTD, and only a further subset set of these transcripts were undetectable in normal marrow. The biological significance of many of the individual DTC transcripts is uncertain, with one notable exception, TWIST1. The TWIST1 gene is a basic helix-loop-helix transcription factor that directs mesodermal cell fates (32). Expression has been associated with epithelial-mesenchymal transition, a critical component of the metastatic cascade, whereby transformed epithelial cells lose normal cell-cell interaction and adopt a mesenchymal phenotype (25). Expression of TWIST1 has been associated with poor outcome in multiple tumors, including melanomas, breast adenocarcinomas, and gliomas (23, 33, 34). Moreover, increased TWIST1 protein levels are associated with resistance or reduced sensitivity to cytotoxic drugs (35).
For the initial microarray analysis, bone marrow samples obtained after chemotherapy were purposefully chosen to identify gene signatures associated with persistent DTCs that are more likely to have clinical significance with regard to metastasis and overall survival. To determine the predictive strength of gene expression on clinical outcome, treatment-naïve bone marrow samples from an independent subset of patients on the same clinical protocol were chosen for marker validation. Two transcripts from the DTC signature, PIR and GPR161, were not correlated with clinical outcome in this validation set. In addition to the many possible technical explanations for this result, it is possible that chemotherapy exposure selects for DTC subclones or alters DTC expression profiles such that expression of these genes is not relevant in pretreatment samples. Most importantly, however, we were able to confirm that TWIST1 expression even in pretreatment bone marrow did correlate with the occurrence of distant metastases or local progression at 1 year, whereas other traditional measures did not.
In the epithelial-mesenchymal transition regulatory pathway, TWIST1 may play a role similar to Snail genes (32). Members of the Snail family are involved in repression of E-cadherin, cell movement, breast cancer recurrence, and metastases development (36–39). Genes from this family may be involved in the progression of breast adenocarcinoma and, therefore, along with TWIST1, may serve as a marker of metastatic potential. In this study, we found that SNAI2 was expressed in a third of the bone marrow specimens tested. Unlike TWIST1 expression, however, not all SNAI2-positive patients developed metastatic disease or clinical progression within 1 year. Further follow-up of these patients will allow us to better determine the clinical specificity and sensitivity of SNAI2 versus TWIST1 bone marrow expression.
Finally, we observed no correlation between the development of early metastatic disease and the expression of two other transcripts frequently used to detect DTCs in blood, bone marrow, and lymph nodes—cytokeratin 19 and mammaglobin (24). Unlike these two markers, we propose that TWIST1 may provide a more accurate functional indicator of DTC with high metastatic potential. In this sense, TWIST1 and other transcripts identified through the approach outlined in this study may be more robust markers for predicting clinical outcome in certain subpopulations of breast cancer patients compared with any other conventional pathologic or molecular marker currently available.
One limitation of our study is the cohort of patents from whom the specimens were collected. All patients received a uniform chemotherapy regimen and the outcome focused on patients with early recurrent disease. Thus, our findings would benefit by confirmation in a larger cohort of patients undergoing a variety of chemotherapy regimens using multivariate analysis.
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: Supplementary data for this article are available at Clinical Cancer Research Online (http://clincancerres.aacrjournals.org/).
6 http://bioinformatics.wustl.edu ![]()
Received 1/ 5/07; revised 5/16/07; accepted 6/20/07.
| References |
|---|
|
|
|---|

C(T)) method. Methods 2001;25:402–8.[CrossRef][Medline]This article has been cited by other articles:
![]() |
G. Bonuccelli, M. C. Casimiro, F. Sotgia, C. Wang, M. Liu, S. Katiyar, J. Zhou, E. Dew, F. Capozza, K. M. Daumer, et al. Caveolin-1 (P132L), a Common Breast Cancer Mutation, Confers Mammary Cell Invasiveness and Defines a Novel Stem Cell/Metastasis-Associated Gene Signature Am. J. Pathol., May 1, 2009; 174(5): 1650 - 1662. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Alix-Panabieres, S. Riethdorf, and K. Pantel Circulating Tumor Cells and Bone Marrow Micrometastasis Clin. Cancer Res., August 15, 2008; 14(16): 5013 - 5021. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |