
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
1 Human Genetics Department and 2 Cancer Epigenetics Group, Molecular Pathology Programme, and 3 Bioinformatics Unit and 4 Cytogenetics Unit, Biotechnology Programme, Spanish National Cancer Centre, Madrid, Spain; and 5 Department of Genetics, H. Santa Creu i Sant Pau, Barcelona, Spain
Requests for reprints: Sara Alvarez, Human Genetics Department, Molecular Pathology Programme, Spanish National Cancer Centre, c/o Melchor Fernández Almagro, N°3, 28029 Madrid, Spain. Phone: 34-91-224-69-65; Fax: 34-91-224-69-23; E-mail: salvarez{at}cnio.es.
| ABSTRACT |
|---|
|
|
|---|
Key Words: Familial breast cancer tumors CGH Random Forests
| INTRODUCTION |
|---|
|
|
|---|
Breast carcinomas from patients with a germ line mutation in BRCA1 or BRCA2 show some typical pathologic characteristics. BRCA1 tumors are generally high-grade, steroid receptors and HER-2 negative and p53 protein positive compared with controls. In contrast, BRCA2 tumors do not show significant differences in the expression of these proteins compared with control tumors (9). In addition, we have identified specific differences between BRCAX and BRCA1 tumors. BRCAX cases generally show lower-grade, positive steroid receptors and BCL2, negative p53, low proliferation rate, and absence of P-cadherin expression. However, only lower grade and lower proliferation rate differentiate the BRCAX tumors from the BRCA2 and control group (10).
In addition to the pathologic differences, breast tumors in carriers of mutant BRCA1 or BRCA2 gene are characterized by a large number of chromosomal changes, some of which differ from sporadic tumors depending on their genotype (11, 12). Losses of 5q, 4q, 4p, 2q, and 12q in BRCA1 tumors and losses of 13q and 6q and gains of 17q22-24 and 20q13 in BRCA2 tumors were significantly more common than in controls (11). Recently, some of these changes, such as loss of 5q and 12q, have been confirmed (12). Then, based on the genomic changes observed using comparative genomic hybridization (CGH), the BRCA1 profile has been used to build a classifier that distinguished between sporadic and BRCA1 tumors (12).
In the present work, we have used a more refined genome-wide approach as high-resolution CGH (HR-CGH) that more precisely identify the somatic genetic changes associated to BRCA1, BRCA2, and non-BRCA1/BRCA2 germ line mutation carriers. Based on the distinctive profiles of tumor progression among BRCA1/BRCA2 mutation carriers, we have build a classifier that allow for the first time to differentiate BRCA1 and BRCA2 tumors and to define the differences and similarities of the BRCAX tumors compared with the BRCA1/BRCA2 classes.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Hybridizations of normal breast tissue paraffin-embedded DNA versus reference female DNA were used as negative controls previous to the samples analysis. Because the tumor and reference DNA were not sex matched, the X and Y chromosomes were omitted from the analysis.
Statistical Analysis of CGH Data. A nonparametric Mann-Whitney U test was used to identify differences in the number of chromosomal gains and losses among the three patient groups. Differences in the frequency of involvement of individual chromosomal regions among the three familial breast cancer classes and among the BRCA1 and BRCA2 group with recurrent versus nonrecurrent mutations were tested with Fisher's exact test. The indicated Ps were not corrected for multiple testing and were calculated using the Stat POMELO tool available at http://pomelo.bioinfo.cnio.es (18).
Building a Predictor Data Preprocessing. Because a high number of variables better defined the tumor progression profile of each group, we selected 63 G-banded cytogenetic regions to refine the detected HR-CGH profile (Fig. 1). We have chosen as the most common minimal regions of involvement 50 regions including imbalances in at least 30% of the 28 BRCA1/BRCA2 cases used to build the predictor and with at least 3 cases defining the cytogenetic thresholds. To include the rest of the genome not fitting the previously defined criteria, we grouped the unselected areas on 13 chromosomal regions.
|
Analyses. We have used random forests to obtain a simple and good predictor of BRCA1 and BRCA2. Other approaches attempted without success are detailed in supporting information. Random forests are essentially an ensemble of classification trees that can be used directly with several variables much larger than the number of samples and that often show very good predictive performance (20, 21).
Random forests return a prediction as the unweighed majority of predictions from a very large (e.g., 5,000) collection of classification trees (22); the important features about these trees are that each one is grown using a bootstrap sample of the data set and that, at each node, only a random subset of the original variables is examined. As part of a random forests run, we obtained measures of the importance of the variables. A large importance measure indicates that random permutation of that variable causes samples to be misclassified more often (hence, that variable is important). Although random forests provide a very good classification and return a ranking of the importance of variables, a predictor built from 66 predictor variables is cumbersome to use and very difficult to understand. Moreover, elimination of variables that do not contribute to the classification could actually result in an improved predictive performance. Our approach to variable selection is very similar to that of Svetnik et al.6; essentially, we iteratively eliminate the variable with a lower importance measure until the out-of-bag estimate of prediction error becomes larger than the out-of-bag estimate of prediction error of either the previous model or the original model. However, these out-of-bag estimates are biased down because of the variable selection process (2325). We have thus estimated the prediction error of our simplification procedure using a leave-one-out cross-validation of the complete variable selection process; in other words, excluding each observation in turn, we run the complete variable selection procedure and then predict the left-out observation. This, therefore, yields a honest, almost unbiased estimate of prediction error, because the error rate is computed from observations that were never used for the simplification and building of the forest that is predicting an observation. For the analyses, we have used the random forests package for R (by A. Liaw and M. Wiener) that uses the Fortran random forests code by L. Breiman and A. Cutler. The R code for the variable selection is available from authors on request. The predictor will be available at http://bioinfo.cnio.es/data.
Methylation. A methylation-specific PCR assay was used to distinguish unmethylated alleles from methylated alleles of BRCA1 based on sequence changes produced by treating DNA with bisulfite, which converts unmethylated (but not methylated) cytosines to uracil followed by a PCR assay involving primers designed for either methylated or unmethylated DNA (26). The methylated product is 75 bp long and the unmethylated one is 86 bp. DNA from normal lymphocytes was used as a negative control, and in vitro methylated DNA was used as a positive control (Fig. 3E).
|
| RESULTS |
|---|
|
|
|---|
|
The frequency and distribution of gains and losses of each group are shown in Fig. 1. In all groups, the most common changes, observed in >50% of the cases, were gain at 1q and losses at 8p21-23 and 11q22-25. Nevertheless, intergroup comparison revealed several chromosomal regions with significant differences in the frequency of involvement (Table 1). These data suggest a different pattern of genomic progression for BRCA1 and BRCA2 cases.
Building a Predictor for Classifying BRCA1 and BRCA2 Tumors. The CGH profiles of a series of 15 BRCA1 (cases 1-15; supplemental research data; Table 1) and 13 BRCA2 (cases 25-37; supplemental research data; Table 1) breast carcinomas were used to develop using random forests (see Building a Predictor) a molecular classifier, which assigns any given tumor to either the BRCA1 or the BRCA2 group.
As part of a random forests run, we obtained measures of the importance of the 63 cytogenetic and the 3 pathologic included variables. After we eliminated the variables of lower importance, the selected variables used to estimate the probability of that a given tumor to be BRCA1 or BRCA2 were the estrogen receptor status and the following six chromosomal regions: 2p11-21, 8p11-12, 12q11-21, 15q22-26, 18p, and 18q. Five of these somatic genetic imbalances also showed statistically significant differences using a Fisher's exact test (Table 1). Using the leave-one-out cross-validation of the complete variable selection process (see Building a Predictor), a score >0.5, indicating a probability of belonging to the BRCA1 class >50%, was obtained in 12 of 15 (80%) BRCA1 tumors and 5 of 13 (38%) of the BRCA2 tumors (Fig. 3A and B). To further validate the predictor, we run a new set of tumors. Nine BRCA1 (cases 16-24; supplemental research data; Table 1) and six BRCA2 (cases 38-43; Supplemental Research Data; Table 1) breast carcinomas were analyzed. The classification scores for this validation set are depicted in Fig. 3C. All the cases, except for one case of each group, were properly classified. The probability of being a BRCA1 tumor among the 43 BRCA1/BRCA2 tumors analyzed was estimated, with an accuracy of 76.7%, a sensitivity of 83.33%, and a specificity of 68.42%.
Figure 4 depicts the a posteriori probability of these regions as they are used in the predictor. These histograms represent the probability of that a tumor is BRCA1 or BRCA2 given the particular aberrations observed in the tumor and provide inside of the specific pattern of tumor progression. Therefore, the BRCA1 profile yielding the best classification performance is as follows: no change or gain at 2p11-21; no change or loss at 8p11-12 and 12q11-21; losses at 15q22-26, 18p, and 18q; and a negative estrogen receptor status. The BRCA2 profile is as follows: no change or loss at 2p11-21; no change or gain at 8p11-12, 15q22-26, 18p, and 18q; gain at 12q11-21; and a positive estrogen receptor status.
|
|
To determine if the high frequency of the BRCA1 pattern of tumor progression among the BRCAX and BRCA2 cases could be due to the BRCA1 gene silencing by epigenetic factors on these two groups, we analyzed the BRCA1 promoter region for aberrant methylation. Testing (in a blinded fashion) of all specimens of BRCAX group from our study indicated that 44% of the BRCAX tumors (15 of 34 BRCAX cases) had hypermethylation of the BRCA1 promoter region. When we consider the predictor, we observed that all the cases with this epigenetic event were assigned to the BRCA1 class with a probability of >50%. Interestingly, 84.6% (11 of 13) of the cases assigned to the BRCA1 class with a probability of >80% had an aberrant methylation of the BRCA1 promoter (Fig. 3D).
These results leave out whether the BRCA1 promoter methylation events could also be occurring as germ line events. We analyzed the methylation status of the BRCA1 promoter on eight DNA obtained from five normal breast tissues and three peripheral blood leukocytes from five BRCAX cases. All these BRCAX patients showed BRCA1 promoter hypermethylation on the tumor sample (two of these cases, cases 68 and 73, are included on this study). All the DNA analyzed samples were found unmethylated. These data suggest that methylation is not a heritable trait on these families but a frequent event during tumor progression.
We also tested all the BRCA2 tumors and we found that 31.6% (6 of 19) of them were methylated. Among the methylated cases, only case 34 was misclassified as BRCA1. This case has also been classified by immunohistochemical and cDNA expression studies as a BRCA1 tumor (data not shown).
Statistical analysis comparing the family history, the grade, and the steroid receptor status of methylated versus unmethylated BRCA2 and BRCAX tumors did not show significant differences (Fisher's exact test, P > 0.05).
| DISCUSSION |
|---|
|
|
|---|
Although a similar frequency of involvement at the chromosomal regions reported previously to be significantly different between BRCA1/BRCA2 tumors and sporadic breast carcinomas were observed (supplemental research data; Table 2; refs. 11, 12), we did not find the low frequency of 16q loss and the high presence of 20q gain reported previously on the BRCA2 tumors (11). These discrepancies may be due to the differences among the studied populations, with 80% of the cases of the previously studied nordic population carrying a 999del5 BRCA2 mutation. In addition, our intergroup comparisons revealed significant differences, as gains of the 15q11-21 region, which occur in 66.6% of the BRCA2 tumors with the 3036delAAAC mutation and in none of the non-recurrent tumors (Table 2). These facts suggest that different mutations may slightly modify the profile of somatic changes along the tumor progression.
A Classifier Based on Distinct Patterns of Tumor Progression among the BRCA Mutation Carriers Recently, the BRCA1 profile has been used to build a classifier to prescreen high-risk patients. This predictor distinguished with a high accuracy (84%) among BRCA1 and sporadic tumors. The regions used in this classifier are located at 3p, 3q, and 5q, but these regions did not show significantly different frequencies of imbalances when BRCA2 and BRCA1 tumors are compared (12).
In our study, several chromosomal regions seemed to be preferentially involved in the BRCA1 cases and yet others in the BRCA2 cases. By using random forests, we have developed for the first time a simple and good molecular classifier based on the genomic pattern of somatic changes accumulated during the tumor development that can be used to distinguish between BRCA1 and BRCA2 mutation carriers with an accuracy of 76.7%. Previously, this supervised classification method has been used for building, with similar accuracy to other methods, good prediction models (30). We obtained a high sensitivity (83.3%) identifying the BRCA1 cases, but further studies to refine the profiles, such as CGH array and a detailed analysis of the status of the BRCA1 gene among the BRCA2 tumors, are needed to improve the specificity (68.4%). The variables used to define the distinctive patterns of tumor progression in our classifier are six somatic genetic imbalances (2p11-21, 8p11-12, 12q11-21, 15q22-26, 18p, and 18q) and the estrogen receptor status. Some of these regions as 12q and 18q have also been found significantly more frequent lost in the BRCA1 cases than in the sporadic tumors (11, 31). Interestingly, we have shown previously using a familial breast cancer tissue microarray that the expression of the BCL2 gene located at 18q21.3 is more frequently lost in the BRCA1 cases (89.5%) than in the BRCA2 (57.1%) and BRCAX cases (44%; ref. 10). This is in good agreement with the higher frequency of 18q loss observed by HR-CGH in the BRCA1 (60%) compared with the BRCA2 (15.4%) or the BRCAX cases (56%).
We attempted several approaches to build a classifier to distinguish the three classes (see supporting information). However, we could not define a somatic genetic profile that distinguished between the BRCA1/BRCA2 mutation carriers and the BRCAX cases. Therefore, we assumed that the pattern of tumor progression in our BRCAX cases is not quite different to the BRCA1 and BRCA2 mutation carriers, and we analyzed the BRCAX cases with the classifier. Seventy-six percent of the cases were classified as BRCA1, with a probability of >50%, confirming the somatic genetic similarities that we found by previous statistical analysis (Table 1; Fig. 3D).
Effect of DNA Methylation on the Somatic Genetic Profile of the BRCA2 and BRCAX Cases Because BRCA1 promoter hypermethylation was shown to silence the BRCA1 gene in
13% of sporadic breast carcinomas (26, 32), we studied this phenomenon as a mechanism of BRCA1 silencing in the BRCAX and the BRCA2 cases. We found this epigenetic event in 44% of the BRCAX (15 of 34) and in 31% of the BRCA2 cases (6 of 19). Among the BRCAX cases, 84.6% of the cases (11 of 13) assigned with a probability of >80% to the BRCA1 class were BRCA1 promoter hypermethylated (Fig. 3D). This epigenetic phenomenon was not observed at the germ line level.
BRCA1 does indeed seem to be implicated in sporadic breast and ovarian tumors. Aberrant methylation of BRCA1 seems to be a crucial player in the development of these tumors. Already, some genomic changes as MYC amplification have been found in a significantly higher proportion on sporadic tumors with BRCA1 dysfunction (33). In addition, it has been suggested that BRCA1 hypermethylation could influence the phenotype of the tumor on a similar way to a genetic mutation (34). Our study suggest for the first time that this epigenetic event could also influence the tumor progression genotype on a similar way to a genetic mutation in most of the BRCAX cases. Further clinical studies will be needed to know if the selection of these patients is clinically relevant.
We have not found statistically significant differences in the family history or in the steroid receptor status between methylated and unmethylated tumors. However, in the analysis of a tissue array of BRCAX cases, a statistically significant higher percentage of p53 and Ki-67 positivity has been observed in the BRCA1 methylated tumors, indicating a higher proliferation rate and suggesting a more aggressive behavior of these cases.7
The identification of a method to successfully subdivide the histopathologically heterogeneous group of BRCAX families into recognizable groups could be of considerable value to further genetic analysis. Previously, global gene expression analysis have discover novel classes among BRCAX tumors (35). Our findings illustrate that also at the genomic and epigenetic levels BRCAX families can be grouped into homogeneous subsets, thereby potentially increasing the power of genetic linkage analysis. Moreover, the complete sequencing of the BRCA genes is very difficult due to the large number of exons of these genes; this tool could allow us to select cases among the BRCAX patients for further genetic studies as large deletions or rearrangement analysis. At the therapeutic level, these data could point to the inclusion of demethylating agents into the treatment protocols for familial breast cancer.
Among the six BRCA2 cases with BRCA1 promoter hypermethylation, five cases were correctly assigned to the BRCA2 class and only case 34 was assigned to the BRCA1 class. This case also clusters with the BRCA1 tumors in ongoing expression studies from our laboratory (Fig. 3B). These data suggest that hypermethylation of the BRCA1 gene in the majority of the BRCA2 tumors could be a later event in the genetic progression, when the specific genetic pathway is yet established.
In summary, we have build a simple molecular classifier that distinguish with an accuracy of 76.7% between BRCA1 and BRCA2 tumors. This BRCA1 pattern of tumor progression is frequently found among the BRCAX tumors and, in more than half of these cases, is associated with BRCA1 promoter hypermethylation, suggesting that this event can modify the profile of tumor development in most of the BRCAX tumors. Further studies to better establish the genomic integrity of the BRCA1 gene are guaranteed. Finally, the presence of this epigenetic event among the BRCA2 cases is also frequent but not associated in the majority of the cases with a BRCA1 profile.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: Supplementary data for this are available at Clinical Cancer Research Online (http://cancerres.aacrjournals.org/).
6 V. Svetnik et al. Variable selection in random forest with application to quantitative structure-activity relationship, in preparation 2004. ![]()
7 Honrado et al., unpublished data. ![]()
Received 6/14/04; revised 11/ 2/04; accepted 11/11/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. Melchor, E. Honrado, J. Huang, S. Alvarez, T. L. Naylor, M. J. Garcia, A. Osorio, D. Blesa, M. R. Stratton, B. L. Weber, et al. Estrogen Receptor Status Could Modulate the Genomic Pattern in Familial and Sporadic Breast Cancer Clin. Cancer Res., December 15, 2007; 13(24): 7305 - 7313. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Fan, P. S. Yan, C. Hartman-Frey, L. Chen, H. Paik, S. L. Oyer, J. D. Salisbury, A. S.L. Cheng, L. Li, P. H. Abbosh, et al. Diverse Gene Expression and DNA Methylation Profiles Correlate with Differential Adaptation of Breast Cancer Cells to the Antiestrogens Tamoxifen and Fulvestrant Cancer Res., December 15, 2006; 66(24): 11954 - 11966. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Fackler, K. Malone, Z. Zhang, E. Schilling, E. Garrett-Mayer, T. Swift-Scanlan, J. Lange, R. Nayar, N. E. Davidson, S. A. Khan, et al. Quantitative multiplex methylation-specific PCR analysis doubles detection of tumor cells in breast ductal fluid. Clin. Cancer Res., June 1, 2006; 12(11): 3306 - 3310. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Melchor, S. Alvarez, E. Honrado, J. Palacios, A. Barroso, O. Diez, A. Osorio, and J. Benitez The Accumulation of Specific Amplifications Characterizes Two Different Genomic Pathways of Evolution of Familial Breast Tumors Clin. Cancer Res., December 15, 2005; 11(24): 8577 - 8584. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Cell Growth & Differentiation |