
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
Authors' Affiliations: 1 Department of Neurosurgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan and 2 Osaka Medical Center for Cancer and Cardiovascular Diseases, Osaka, Japan
Requests for reprints: Kikuya Kato, Osaka Medical Center for Cancer and Cardiovascular Diseases, 1-3-2, Nakamichi, Higashinari-ku, Osaka 537-8511, Japan. E-mail: katou-ki{at}mc.pref.osaka.jp.
| Abstract |
|---|
|
|
|---|
Experimental Design: The expression of 3,456 genes in 32 patients, 12 and 20 of whom had prognostically distinct anaplastic oligodendroglioma and glioblastoma, respectively, was measured by PCR array. Next to unsupervised methods, we did supervised analysis using a weighted voting algorithm to construct a diagnostic system discriminating anaplastic oligodendroglioma from glioblastoma. The diagnostic accuracy of this system was evaluated by leave-one-out cross-validation. The clinical utility was tested on a microarray-based data set of 50 malignant gliomas from a previous study.
Results: Unsupervised analysis showed divergent global gene expression patterns between the two tumor classes. A supervised binary classification model showed 100% (95% confidence interval, 89.4-100%) diagnostic accuracy by leave-one-out cross-validation using 168 diagnostic genes. Applied to a gene expression data set from a previous study, our model correlated better with outcome than histologic diagnosis, and also displayed 96.6% (28 of 29) consistency with the molecular classification scheme used for these histologically controversial gliomas in the original article. Furthermore, we observed that histologically diagnosed glioblastoma samples that shared anaplastic oligodendroglioma molecular characteristics tended to be associated with longer survival.
Conclusions: Our molecular diagnostic system showed reproducible clinical utility and prognostic ability superior to traditional histopathologic diagnosis for malignant glioma.
Currently, the most widely used classification scheme for human glioma is that of the WHO (1). For classic glioma cases with typical morphologic features, histologic diagnoses are generally consistent and accurately predict the corresponding clinical course. However, many gliomas do not fit neatly into any of the WHO categories due to their atypical histologic features. Additionally, because the WHO classification schemes are based largely on visual criteria, they are inevitably subject to considerable interobserver variation (2). As a result, the more atypical a glioma, the more likely that the clinical outcome will fail to match the predicted biological behavior. Generally, there is a distinct prognostic difference between patients with glioblastoma and anaplastic oligodendroglioma (1). However, the clinical task is more complex than merely separating malignant gliomas into these two groups with distinct malignant properties, and the findings to date indicate the essential limitations of current glioma classification schemes (3).
Ideally, tumor diagnosis should be objective, and to the extent to which it is possible, it should correctly reflect the biological behavior and corresponding outcome. Recently, genetic analyses showed that allelic loss of chromosomes 1p and 19q was significantly associated with prolonged survival in anaplastic oligodendroglioma (4). This finding highlights the importance of classifying gliomas into molecularly distinct groups, a prospect that should allow for markedly increased predictive power in the future.
Thus far, we have done gene expression profiling of a total of 1,200 human cancers using adaptor-tagged competitive PCR (ATAC-PCR), a PCR array system based on a high throughput reverse transcription-PCR technique (5–7), and have constructed a cancer gene expression database that is open to the public (8). In this study, we did gene expression profiling on malignant gliomas to construct a prognostically useful diagnostic system for discriminating anaplastic oligodendroglioma from glioblastoma. The clinical utility of this system was then tested on the microarray-based public data set, which includes outcome information.
| Materials and Methods |
|---|
|
|
|---|
Random expressed sequence tag sequencing, ATAC-PCR assay, and data processing. The expression of 3,456 genes was measured by PCR array via the ATAC-PCR method. The selection of objective genes for measurement was based on an expressed sequence tag sequencing survey of the genes expressed in 12 glioma tissues as previously described (9). We obtained 3,012 unique sequences from the EST collection and prepared 3,456 primers for ATAC-PCR, including additional 444 genes selected from a literature survey. Use of tissue-specific genes avoided wasteful measurement of genes not detected in glioma tissues, providing the advantage of reducing noise in the statistical analysis. The ATAC-PCR experimental procedure was done as previously described (7). The raw value describing gene expression levels were divided by the median expression value of each sample. This standardization step corrects for variation in mRNA level from sample to sample. Values <0.05 and >20 were converted to 0.05 and 20, respectively, and subsequently the entire data matrix was converted to a logarithmic scale. The detailed protocols for the ATAC-PCR experimental procedure are available on our Web site. The complete list of genes and expression data collected and analyzed in this study are available as Supplementary Data. The data will be deposited in Center for Information Biology Gene Expression Database in the DNA Data Bank of Japan.
Methylation-specific PCR, loss of heterozygosity analysis of 1p and 19q, and mutation analysis. DNA methylation patterns in the CpG island of the MGMT gene was determined by the method of methylation-specific PCR as previously described (10, 11). Genotypes for multiple loci for loss of heterozygosity analysis were determined by PCR using fluorescent primers tagged with FAM (Hokkaido system science) for microsatellite markers on chromosome 1p36 and 19q13 as previously described (12, 13). P53 mutation status was analyzed by sequencing between exons 4 and 10, including the DNA-binding domain as previously described (14). EGFRvIII was detected by reverse transcription-PCR as previously described (15).
Statistical analysis. Hierarchical cluster analysis and principal component analysis were done using Genmath2.0 software. For the hierarchical cluster analysis, the Ward clustering method was adopted, with Euclid distance used as a similarity coefficient.
The statistical significance of the anaplastic oligodendroglioma/glioblastoma comparison was evaluated for each gene by both P value and q value analysis. The P value of a test measures the minimum false-positive rate that is incurred when calling that test significant. Likewise, the q value of a test measures the minimum false-discovery rate that is incurred when calling that test significant. The false-discovery rate is the expected proportion of false positives among the tests found to be significant (16). The P values were calculated using t statistics. The q values were calculated using the software "Qvalue," supplied online.3
To construct a molecular diagnostic system to discriminate anaplastic oligodendroglioma from glioblastoma, we used a weighted voting algorithm coupled with gene selection using signal-to-noise ratio ranking. The weighted voting algorithm is a popular supervised learning method with excellent predictive ability for binary classification using gene expression data (17). Briefly, we calculated the signal-to-noise ratio (S), S = (Ugb – Uao) / (
gb +
ao), where U and
represent the mean and SD of expression for each class glioblastoma and anaplastic oligodendroglioma, respectively. The magnitude of the gene vote (V) reflects the deviation of the test sample X value from the average of glioblastoma and anaplastic oligodendroglioma.
![]() |
The functional group analysis estimated the chance of association to a functional group using binomial distribution, described as follows.
![]() |
![]() |
The functional group was based on the Biological Process Ontology Guidelines from the Gene Ontology database. The chromosomal position of each gene was based on the RefSeq database.
We obtained the gene expression data of 50 malignant gliomas from a previously published microarray study from the Massachusetts General Hospital (MGH). The processing of the original raw data was done as follows. From the original raw data set, we judged the 9,285 genes that had been "present" in two or more samples to be eligible for further analysis (18). The expression value of each gene was divided by its median expression level among the samples. Values below 0.01 and above 100 were converted to 0.01 and 100, respectively, and subsequently the entire data matrix was converted to a logarithmic scale. Cumulative overall survival rates were calculated by the Kaplan-Meier method, and the differences in the survival curves were estimated by the log-rank test, using SPSS software.
We applied the classifier genes in the MGH study to our data set. Classification by K-nearest neighbor was done with K = 3 and Euclidean distance as the similarity measure. The diagnostic accuracy was evaluated by leave-one-out cross-validation.
| Results |
|---|
|
|
|---|
|
|
First, we did an unsupervised analysis to obtain a general view of global gene expression signatures. Hierarchical cluster analysis of the 3,269 genes in all 32 gliomas showed that there were distinct patterns of expression for both anaplastic oligodendroglioma and glioblastoma (Fig. 1D). Principal component analysis also showed a clear separation between anaplastic oligodendroglioma and glioblastoma in reduced three-dimensional space (Fig. 1B). These results raise the possibility of classifying anaplastic oligodendroglioma and glioblastoma based on the expression levels of the diagnostic genes that they differentially express.
Then, we evaluated the statistical significance of the anaplastic oligodendroglioma/glioblastoma comparison for each gene by performing both P value and q value analyses. Of the 3,269 genes, 416 genes differed significantly between anaplastic oligodendroglioma and glioblastoma with a cutoff of P value = 0.001.The estimated number of false-positive genes was <2, with a q value of 0.00354 (Fig. 1C).
Next, we did a supervised analysis to construct a diagnostic system for classifying anaplastic oligodendroglioma and glioblastoma for practical use. We adopted a weighted voting algorithm with gene selection by signal-to-noise ratio ranking (17). This diagnostic system was confirmed using leave-one-out cross-validation, showing stable 100% (95% confidence interval, 89.4-100%) diagnostic accuracy in differentiating the test anaplastic oligodendroglioma and glioblastoma cases, using at least the 168 top-ranked genes (Supplementary Figure). Accordingly, we established the top 168 genes by signal-to-noise ratio ranking for all 32 cases as the classifier set. These genes were a subset of the 300 genes whose expression levels were significantly different between anaplastic oligodendroglioma and glioblastoma, with a rigid cutoff value of P = 0.0003, q = 0.0011.
Of the 168 selected classifier genes, the 87 genes showed higher expression in anaplastic oligodendroglioma samples (anaplastic oligodendroglioma classifier genes), including a group of genes involved in general neuronal function or neuronal differentiation (Table 2 ). On the other hand, the 81 genes with higher expression in glioblastoma samples (glioblastoma classifier genes) included a group of genes for which high expression was previously reported to be correlated with the malignant character of glioblastoma (Table 3 ). Functional group analysis revealed that the biological process in which classifier genes involved were distinctively different between two tumor classes. Significantly more glioblastoma classifier genes were associated with cell motility and cell adhesion, whereas prominently more anaplastic oligodendroglioma classifier genes were associated with neuron-related functions such as neuron fate commitment, nervous system development, and synaptic activity (Table 4 ). Analysis of chromosomal positions of classifier genes revealed that there were fewer anaplastic oligodendroglioma classifier genes on 1p and 19q than glioblastoma classifier genes, reflecting the loss of 1p and 19q in most anaplastic oligodendroglioma cases (Tables 2 and 3).
|
|
|
|
| Discussion |
|---|
|
|
|---|
Interestingly, the selected classifier genes for each class possessed a distinctive feature in functional aspects. Among the classifier genes whose expression levels were higher in anaplastic oligodendroglioma samples, many genes are implicated in general neuronal function or in neural development. In particular, functional group analysis revealed that the genes associated with neuron fate commitment were most enriched in the anaplastic oligodendroglioma classifier genes. Recent study revealed a close correlation between the stage in neural development and the prognostic classes among high-grade gliomas. Similar to our finding, they showed that strong expression of markers of committed neuronal lineage was the characteristic feature of a good prognosis group (21). These results stressed the prognostic significance of the expression of neuronal lineage markers but also might offer clues for understanding the genesis of anaplastic oligodendroglioma. The neuronal feature in classic anaplastic oligodendrogliomas may reflect the cell they originated from. The previous studies showed that most of the genes showing distinctive expression in oligodendroglial tumors with allelic loss of chromosome 1p had neuron-related functions (22, 23). Of the genes they indicated as diagnostic markers for oligodendroglial tumors with 1p loss, SNCB, INA, L1CAM, and RIMS2 were also included in our classifier gene set.
In addition to such a neural character, some anaplastic oligodendroglioma classifier genes have been reported to be expressed in cells of the oligodendroglial lineage. Olig-1 and Olig-2 are both broadly expressed throughout oligodendrocyte development (24). Although previous studies revealed that Olig genes are not necessarily specific markers for oligodendroglial tumors (25, 26), our finding that expression of Olig-1 and Olig-2 was significantly lower in glioblastoma samples than in anaplastic oligodendroglioma samples indicates its utility in discriminating anaplastic oligodendroglioma from glioblastoma. Another pair of transcription factors, Sox-4 and Sox-8, which were also reported to be expressed in cells of the oligodendrocyte lineage, functioned as classifier genes for anaplastic oligodendroglioma and glioblastoma (27, 28). These results suggest that anaplastic oligodendroglioma shares some of the characteristic gene expression patterns of oligodendrocyte progenitors, and that these oligodendrocyte lineage genes, when used as a set, might act as a powerful diagnostic marker for anaplastic oligodendroglioma.
Conversely, the classifier genes up-regulated in glioblastoma samples included a group of genes that are involved in the tumor invasion process, one of the hallmark malignant features of glioblastoma. The functional group analysis statistically endorsed the result, demonstrating that the genes associated with cell motility and adhesion, which are the main biological processes of tumor invasion, were significantly enriched among glioblastoma classifier genes. The previous microarray study investigated the genes involved in glioma cell motility (29). Of genes up-regulated with increased glioma cell motility, Tenascin-C, CD44, Fn14, PAI-1, Annexin1, and Moesin were also included in our gene set. Invasion of glioma cells into adjacent brain tissue is dependent on their interaction with the extracellular matrix. Extracellular matrix components and cell surface receptors such as Tenascin-C, CD44, and Fn14 play a major role in regulating cell migration, and have been suggested as the principle mediators of the glioma invasion process (30–32). Urokinase plasminogen activator receptor and plasminogen activator inhibitor-1, a member of the urokinase plasminogen activator system, also play an important role in the tumor invasion process and are associated with either aggressive tumor characteristics or a poor prognosis in various malignancies, including gliomas (33–35). In addition to these genes, Galectin-1 was also reported to be involved in the invasion process, and its expression displays a positive correlation with shorter survival among astrocytic tumors (36). Insulin-like growth factor binding protein 2 (IGFBP2) was frequently overexpressed in glioblastoma. IGFBP2 stimulates glioma cell invasion (37, 38).
Recently, several studies have shown the prognostic value of YKL-40 expression for poor outcome (21, 39, 40). It has been suggested to stimulate Ras and Akt pathways (41). Recent evidence indicates that the variation of glioma phenotype results from different kinds of alteration in signaling pathways. Phillips et al. (21) mentioned that YKL-40 expression and Akt activation were markers for poor prognosis among high-grade gliomas. As glioblastoma showed strikingly more YKL-40 expression than anaplastic oligodendroglioma by immunohistochemistry (42), YKL-40 stimulating signaling pathways may cause a highly invasive phenotype.
We note that our diagnostic model could not only discriminate between anaplastic oligodendroglioma and glioblastoma, but also disclose their distinctive molecular features. The characteristic function associated with clinical malignancy of glioblastoma is the invasion process, whereas classic anaplastic oligodendrogliomas had some extent of neuronal property. Using such representative functional features, it may be possible to refine the molecular diagnostic scheme.
It is interesting that a lower PS correlated with longer survival among the histologically diagnosed glioblastoma cases. From a pathologic point of view, Donahue et al. (43) reported that glioblastomas with some oligodendroglial characteristics showed a tendency toward better prognoses than pure glioblastomas. Our results showed the same phenomenon from a molecular point of view. Thus, glioblastomas whose gene expression signatures contained anaplastic oligodendroglioma components would show improved survival. These findings suggest the existence of molecular subclasses within glioblastomas.
Regarding the known molecular prognostic factors, we noted that the loss of 1p and 19q coexisted with MGMT promoter methylation in our anaplastic oligodendroglioma cases, as reported previously (44).
From a technical point of view, it is noteworthy that the results of our PCR array–based molecular diagnosis and those of microarray-based diagnosis were highly consistent with each other, especially for gliomas of histologically indeterminate diagnosis. Recently, the reproducibility of such large-scale gene expression analyses has come into question, particularly as pertains to the use of new methods such as the PCR array–based method we have adopted (45). Common concerns regarding these approaches arise from the statistical problems of handling such vast data sets as well as from the amount of unavoidable noise stemming from both intersample variation and from the techniques themselves. This study showed a clinically useful and reproducible ATAC-PCR–based molecular diagnostic system for malignant gliomas, which was shown to be comparable in effectiveness to systems based on microarrays.
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: Supplementary data for this article are available at Clinical Cancer Research Online (http://clincancerres.aacrjournals.org/).
3 http://faculty.washington.edu/~jstorey/qvalue/ ![]()
Received 11/27/06; revised 6/22/07; accepted 9/26/07.
| References |
|---|
|
|
|---|
combination chemotherapy in advanced hepatocellular carcinoma. Clin Cancer Res 2004;10:6029–38.This article has been cited by other articles:
![]() |
F. Ducray, E. Criniere, A. Idbaih, K. Mokhtari, Y. Marie, S. Paris, S. Navarro, F. Laigle-Donadey, C. Dehais, J. Thillet, et al. {alpha}-Internexin expression identifies 1p19q codeleted gliomas Neurology, January 13, 2009; 72(2): 156 - 161. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |