
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Molecular Oncology, Markers, Clinical Correlates |
1 Laboratoire de Cancérologie Expérimentale, Commissariat á LEnergie Atomique, Direction des Sciences du Vivant, Département du Radiobiologie et Radiopathologie, Fontenay-aux-Roses Cedex France; 2 Département de Pathologie, Institut Gustave Roussy, Villejuif Cedex, France; Departments of 3 Head and Neck Surgery and 4 Pathology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas
| ABSTRACT |
|---|
|
|
|---|
Experimental Design: Gene expression analysis using microarray platform was performed on 6 pathologically normal thyroid samples and 12 primary follicular and papillary thyroid neoplasms. Microarrays containing probes for 5,760 human full-length cDNAs were used for hybridization with total RNA from normal and tumor thyroid samples labeled with Cy3-dUTP and Cy5-dUTP, respectively. Scanned array images were recorded, and data analysis was performed. Selected sets of differentially expressed genes were analyzed using quantitative real-time reverse transcription-PCR for verification.
Results: We identified 155 genes that differentiate histologically normal thyroid tissues from benign and malignant thyroid neoplasms. Of these 75 genes were differentiated between follicular neoplasms (adenoma and carcinoma) and the follicular variant of papillary carcinoma. Purely follicular neoplasms (adenomas and carcinomas) shared many genetic profiles, and only 43 genes were distinctly different between these tumors. Hierarchical cluster analysis also differentiated conventional papillary carcinoma from its follicular variant and follicular tumors. The differentially expressed genes were composed of members of cell differentiation, adhesion, immune response, and proliferation associated pathways. Quantitative real-time reverse transcription-PCR analysis of selected genes corroborated the microarray expression results.
Conclusions: Our study show the following: (1) differences in gene expression between tumor and nontumor bearing normal thyroid tissue can be identified, (2) a set of genes differentiate follicular neoplasm from follicular variant of papillary carcinoma, (3) follicular adenoma and carcinoma share many of the differentiated genes, and (4) gene expression differences identify conventional papillary carcinoma from the follicular variant.
| INTRODUCTION |
|---|
|
|
|---|
Parallel analysis of gene expression by microarray techniques offers a large-scale platform for screening tumors for novel markers for potential clinical applications. Using this approach, the biological and the molecular characteristics of several neoplastic entities, including a few thyroid neoplasms, have been defined (15, 16, 17) . To characterize the genetic events underlying the morphologic and biological heterogeneity of differentiated thyroid neoplasms, we performed gene expression analysis of histologically normal thyroid tissues from tumor- and nontumor-bearing resections and benign and malignant follicular and papillary neoplasms.
| MATERIALS AND METHODS |
|---|
|
|
|---|
cDNA Collection and Probes.
Our platform consisted of 5,760 full-length cDNA clones from the Soares human infant brain 1NIB library (18)
kindly provided by Genethon (Evry, France). Plasmids containing clones were grown in Escherichia coli by standard microbiologic methods. Each insert was (PCR) amplified (30 cycles) with vector primers derived by sampling 1 µL of cell culture. PCR products were purified by EtOH precipitation, washed in 70% EtOH, dried, dissolved in 1 mmol/L EDTA/10 mmol/L Tris-HCl (pH 8; TE)/DMSO at 50:50 concentration and stored at 80°C. The quality, size, and concentration of PCR products were determined after agarose gel electrophoresis using Genetools software (Syngene; Merck Eurolab, Fontenay-sous-Bois, France) and were automatically annotated in the final differential expression data file.
Arraying.
PCR products were arrayed on poly-L-lysine-coated slides (Menzel Glaser: CML, Nemours, France) using the Microgrid II pro arrayer (Biorobotics Ltd., Cambridgeshire, United Kingdom). Slides were packed and stored in a dark, dry place at room temperature until use.
Array Hybridizations.
Before hybridization, the slides were hydrated over boiled water, dried for 3 seconds at 80°C, and exposed to UV irradiation (300 mJ, 254 nm) for DNA cross-linking. Slides were immersed in a freshly prepared blocking solution consisting of succinic anhydride (0.02 mmol/L) dissolved in 150 mL of 1-methyl-2-pyrrolidinone and 17 mL of sodium borate (0.2 mol/L; pH 8) and placed in an orbital shaker for 20 minutes. Slides were washed in H2O and were immersed in 100% EtOH at 20°C before a final 5-minute centrifugation at 500 rpm at room temperature. Slides were then prehybridized at 42°C in a 3x SSC, 0.1% SDS, 0.1% bovine albumin for 30 minutes at 42°C, washed in H2O, and immersed and in isopropanol and absolute EtOH, successively. The slides were then centrifuged at 500 rpm for 5 minutes at room temperature.
RNA, cDNA, and Labeling.
Total RNA was prepared using RNAplus according to the manufacturers protocol (Q-Biogene, Illkirsch, France). For each competitive hybridization, total RNA from a clinical sample and the normal thyroid reference from Clontech (Palo Alto, CA) were labeled with Cy3-dUTP and Cy5-dUTP (Amersham Pharmacia Biotech, Saclay, France) by reverse transcription. For each reverse transcription reaction, 20 µg of RNA was mixed with random hexamer (pdN6, Amersham Pharmacia Biotech; 2.5 µg/mL) in a total volume of 15.5 µL, heated at 65°C for 10 minutes, and placed on ice for at least 5 minutes. Unlabeled nucleotide pool (final concentration 500 µmol/L each dATP, dCTP, dGTP, and 200 µmol/L dTTP), either Cy3 or Cy5 conjugated dUTP (final concentration 66 µmol/L; NEN, Saclay, France), 1x first-strand Superscript II buffer, 10 mmol/L DTT, and reverse transcription (400 U; Superscript II; Life Technologies, Inc., Cergy-Pontoise, France) were added to a final volume of 30 µL. After incubation at 42°C for 2 hours in a dark room, RNA was hydrolyzed by adding EDTA (45 mmol/L) and NaOH (180 mmol/L) and incubated at 65°C for 5 minutes. The mixture was neutralized by Tris-HCl (450 mmol/L; pH 7.5) and by adjusting the volume to 500 µL with Tris-EDTA. Labeled cDNA was purified by centrifugation in a Microcon YM-30 (Amicon; Millipore, Bedford, MA) and eluted twice with 40 µL of TE.
Hybridization.
The Cy3 and Cy5-labeled cDNAs were mixed with 20 µg human Cot-1 DNA, 20 µg yeast tRNA (Life Technologies, Inc.) and 10 µg poly(A; Sigma-Aldrich, Saint-Quentin Fallavier, France), then precipitated with 0.5 volumes of 7.5 mol/L sodium acetate (pH 5.2) in EtOH. The pellet was dissolved in 60 µL of hybridization solution (50% formamide, 2.5x Denhardts solution, 0.5% SDS, and 6x SSPE. The probe was heated for 2 minutes at 100°C, incubated at 37°C for 20 to 30 minutes, and placed between slide and coverslip. The arrays were incubated overnight at 42°C in a custom humidified slide chamber. The slides were then washed in a first bath of 0.1x SSC with 0.1% SDS for 5 minutes and then twice in 0.1x SSC for 5 minutes at room temperature. The arrays were dried by centrifugation at 600 rpm for 10 minutes. As recommended, each hybridization was performed twice with reverse labeling (dye-swap; refs. 19, 20, 21
).
Image Analysis and Array Quantification.
Arrays were scanned using a two-color laser confocal scanner (GMS 418, Genetic Microsystems, Woburn, MA). Independent images were acquired for Cy3 (532 nm) and Cy5 (635 nm). Each spot was defined by automatic positioning of a grid using image-analysis spot-tracking software (patent US 10/173,672 June, 19, 2002; CA 2,389,901 June, 20, 2002). Each signal intensity was integrated >15 to 20 µm square pixels and recorded in 16-bit format. For each spot and fluorochrome, the background and the signal pixels were segmented with a specific Expectation-Maximization algorithm, and finally the net signal intensity of a spot was obtained by subtracting calculated local background intensity from the signal intensity. The dye-swap hybridization allows us to estimate the reproducibility between the two independent measures obtained for either the reference thyroid or the clinical sample (21)
. A reliability factor was calculated to compare the two measurements of a given spot and to score the reproducibility of signal intensities obtained for Cy3C and Cy5C. The reliability factor (i) factor of the ith spot is equal to the ratio of the two relative intensities measured for the ith spot (or inverse ratio if >1).
For additional intermicroarray comparisons and for calculating the expression ratio "clinical sample/thyroid reference," the series of paired Cy3C and Cy5C measures (each being already scored with its reliability factor) were corrected on the line of slope 1, and normalization was performed according to a set of constantly expressed genes (22) . A cutoff of 0.6 for reliability factor was used, considering that the two measures are reproducible.
The discrimination between two categories of samples, for example, thyroid adenomas and normal thyroid samples, is performed in two steps. The first step involves finding a set of genes that are differentially expressed within each category. The second step involves clusterizing samples according to this set of genes. The expression data are organized in a matrix N.P, where the N lines represent the genes and the P columns denote the samples. Each line of the matrix (vector line) is centered according to the following formula:
![]() |
![]() |
Gene Clustering.
To identify these genes that differentiate between two categories, we constructed two templates that represent the two theoretical vector lines. The first vector line has a value of 1 in all of the samples belonging to the first category and a value of 1 in samples belonging to the second category. The second vector line is the inverse of the first one with values of 1 in samples belonging to the first category and 1 for those belonging to the second category.
After centering and normalization of the data, the two templates and the experimental vector lines were clustered using locally developed software (ClusterIt, Paris, France)
All of the initiator vectors located at a weighted Euclidian distance inferior to a threshold s were grouped in the same cluster. If a vector also belonged to another cluster, there are two possible solutions. First, if this common vector is the initiator vector of one of the two clusters of interest then for each cluster, the mean of the vectors exclusively belonging to it is calculated. If the weighted Euclidian distance between the two means is inferior to the threshold S, the two clusters are grouped. If the distance is not inferior to the threshold, the two clusters are maintained as different clusters, and the common vectors are included in the nearest cluster (except if it is the initiator cluster). Secondly, if the common vector corresponds to a noninitiator vector, the vector is directly included in the closest cluster.
The threshold s is steadily decreased (s/100), and with each reduction, the same procedure is applied on each new cluster derived from the previous reduction. The initial threshold s is chosen such that
![]() |
![]() |
![]() |
![]() |
![]() |
The final step of the cluster analysis corresponds to s = 0, then each cluster is composed of a unique vector line. By following step by step the evolution of the cluster composition, it is possible to determine the lineage of each cluster and to define a hierarchical tree as a function of the cluster composition. In that context, attention was given to the two clusters, including the two template vector lines. To define at which s value (i.e., at what step), the composition of the cluster is homogeneous (see below), we developed two parameters Cd and RSC that permit the evaluation of cluster stability of a given:
![]() |
![]() |
![]() |
![]() |
![]() |
The definition of Cd and RSC factors are based on the following hypothesis: using DNA microarray, the error in the measure follows a normal distribution. Thus, as soon as a class of genes is homogeneous, a normal distribution of the vectors around the mean of all vectors within the class is expected. At the first step the clustering, defined with a threshold
![]() |
Specimen Clustering.
Gene clusters, from the template vector line and the experimental vector lines, represent the set of genes that discriminate between two categories. For tumor classification, a new matrix excluding the previous template vector lines and adding two sample vector columns representing each of the two experimental categories were developed. The lines for the new columns were coded 1 for the first cluster of genes and 1 for the second cluster of genes. These two additional columns represent an experimental sample template for each category, whereas other columns correspond to the gene expression of the selected genes in each experimental sample. The vector lines of the transposed matrix organized around the vector template allowed for the construction of the hierarchical sample tree. For calculating the mean value of gene expression within each category, a matrix N'.Q is defined for a given category, where the N' lines represent the genes and the Q columns the different samples within the chosen category. For each gene, the weighted mean of expression was calculated according to the following equation:
![]() |
The mean reproducibility factor is calculated according to the following equation:
![]() |
![]() |
![]() |
We also investigated whether the follicular variant of papillary carcinoma can be distinguished from conventional papillary carcinoma by constructing 2p vectors (p being the number of tumors) with the coordinates randomly equal to 1 or 1. The se vectors were organized in amatrix N.P, where the N lines represent the genes and the P columns the tumors. The five papillary tumors were randomly organized in the matrix, where each gene represents a vector. Within the homogeneous clusters of genes, we looked for clusters presenting two distinct levels of gene expression within these groups of tumors. We identified all of the clusters that permit the discrimination of one tumor against four or two tumors against three. We then grouped clusters that identify the same tumor, and, finally, we retained the group of clusters occurring at the highest frequency. With a such procedure, we constructed a reduced matrix N'. P and transposed the matrix to classify the five papillary tumors.
Quantitative Real-Time Reverse Transcription-PCR.
For several genes, mRNA expression was analyzed by real-time reverse transcription-PCR (ABI PRISM 7700 Sequence Detector apparatus, Applied Biosystems (Courtaboeuf, France). First-strand cDNA synthesis was described previously, reactions were performed using Syber green incorporation (SYBR Green PCR Core Reagent, Perkin-Elmer), and quantification was performed using the dedicated software (Gene Amp software, Perkin-Elmer). Quantification of each gene expression was calibrated using a reference standard curve obtained by serial dilutions of PCR product prepared from a mixture of cDNAs from normal and tumorous thyroid samples handled separately but concomitantly with clinical samples. Gene expression was normalized as a function of the expression of the 18S rRNA, as described previously (23
, 24)
.
| RESULTS |
|---|
|
|
|---|
|
|
The gene differences between two tumors within a category were used to develop a strategy for the expression within a category independent of the expression ratio. In that strategy, a given gene can be used to discriminate between two categories without applying a strict cutoff if the expression ratio of this gene is as follows: (1) the same for all samples within a category, and (2) different between the two categories. As shown in Fig. 2, A and B
, and Fig. 3, A and B
, 23, 52, and 80 genes define adenoma from papillary carcinoma, follicular carcinoma from follicular variant of papillary carcinoma, and adenomas from follicular carcinomas, respectively. The differentially expressed genes classifying follicular adenomas from follicular carcinomas are listed in Table 2
. Restricting gene clustering to the five papillary carcinomas, a set of genes (Table 3)
differentiated the follicular variant from conventional papillary carcinomas (Fig. 3B)
.
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
The data also show that adenomas can be differentiated from follicular carcinomas by a set of 43 genes of known functions. Of these, 10 were reported previously (17) to be highly expressed in follicular carcinomas (PEG10, PLCB4, FBN1, ID4, DUSP1, NID, BSG, CBS, HXB, RPP1, and TRAP240) and 3 genes (SORD, X123, MCG14797) in follicular adenomas (25 , 26) . Granulin, a putative growth factor, was noted to be significantly overexpressed in carcinomas compared with adenomas. Previous studies of glioblastoma and gastric and ovarian cancers have also shown an up-regulation of this gene (27, 28, 29) . Of the down-regulated genes, DUSP 1, which has known proliferation and differentiation functions, has been shown previously to be down-regulated in follicular carcinomas (17) and in advanced epithelial ovarian cancer (30) . We also noted the PKIA gene, a potent inhibitor of protein kinase B, to be down-regulated in follicular carcinomas compared with adenomas, supporting previous molecular studies of sporadic thyroid cancer (31) . As in earlier studies, the VEGF gene was characteristically noted in follicular carcinomas (32 , 33) .
Our study identified a set of 23 and 52 genes that differentiated papillary carcinoma from both follicular adenomas and carcinomas, respectively (Fig. 2, A and B)
. These genes were composed of members of the protein metabolism/catabolism (15%), intracellular trafficking (12%), ion/metal binding and transport proteins (10%), and transcription related (12%) gene families. The Cyr61 gene was underexpressed in both papillary and follicular carcinomas (Fig. 2A)
compared with adenomas, and it was shown previously to be down-regulated in papillary carcinomas (16)
. Also MCM7, a DNA replication gene, was overexpressed in papillary carcinomas compared with adenomas. Although, the overall gene expressions of papillary and follicular carcinoma (Fig. 1)
showed common features, we identified individual genes that differentiate between papillary and follicular carcinomas. Among these are the DUSP6 and DUSP5, members of the dual-specificity phosphatase gene family. DUSP5 has been reported previously to be up-regulated in premalignant lesions and underexpressed in invasive pancreatic carcinoma (34)
. The MFGE8 gene that plays a key role in activities such as cell motility, activation, contact, and the maintenance of the membrane is also highly expressed in papillary carcinomas compared with adenomas. MFGE8 was shown previously to be highly overexpressed in human breast tumors (35)
.
Gene cluster analysis additionally segregated the conventional form from the follicular variant of papillary carcinomas (Fig. 3B)
. Fourteen of these genes had transcriptional functions and included 2 of the Id4 and HOXC6, down-regulated and 12 (TRIM28, CUTL1, MSL3L1, SNW1, E64F1, CGBP, JUNB, ASCL1, IFGS3G, HCNGP, HCNP, and SSX1) overexpressed. Only a few of these genes have been reported previously to be associated with thyroid gland development. The Id4 expression, a dominant-negative transcriptional regulator, is known to be mainly expressed in thyroid (26)
and to induce cell proliferation and inhibit differentiation induced by thyroid hormone (36)
. The NR0B2 gene, a negative regulator of receptor-dependent signaling pathways, interacts with thyroid hormone and retinoid and thyroid hormone receptors to repress nuclear hormone receptor-mediated transactivation (37)
.
Interestingly, the human ACSL1/HASH1 gene, a transcription factor that is overexpressed in the follicular variant of papillary carcinoma, has also been reported to be highly expressed in neuroendocrine tumors, including medullary thyroid and small cell lung cancers (38)
. The finding suggests that the follicular variant, in contrast to the conventional form, preferentially manifest neuroendocrine features. The finding of under- and overexpression of the NTRK2 and NTRK3 genes, respectively (Table 4)
, and reports of medullary thyroid carcinoma support this hypothesis (39)
. Similarly, the 2NF151/MIZ2I, CDK2API, and P12DOC-I genes, which are involved in cell proliferation (40
, 41) , were found to be overexpressed in follicular variants than in conventional papillary carcinomas. The PEG10 gene was distinctly overexpressed in the follicular variant compared with the conventional papillary carcinoma. This gene has been reported to be highly expressed in hepatocellular carcinomas (42)
and was found to differentiate follicular thyroid tumors from normal thyroid (17)
. Cyclin D1 (CCND1), a regulatory subunit of CDK4 or CDK6, has been found to be overexpressed in papillary thyroid carcinomas (43)
and predicts lymph node metastases in papillary thyroid carcinoma (44)
.
In addition, the BGLAP, SPARC, and MSN genes, which bind to the Osteopontin (OPN) gene, were found to be distinctly overexpressed in the follicular variant of papillary carcinoma. These genes play a prominent role in bone and calcium metabolism and the formation of psammoma bodies in meningiomas (45) and ovarian serous papillary (46) and thyroid carcinomas (47) . In addition, SPARC and MSN are also associated with tumor invasiveness and metastasis (48 , 49) . Tumor necrosis factor, a proinflammatory cytokine, TNFRSF25, a receptor expressed preferentially in tissues enriched with lymphocytes, NOTCH2, furin (PACE-1), GRB2, LNK, and HLA-DRB1 and HLA-DQA1, MHC-class II, were also overexpressed in follicular variants compared with papillary carcinomas. The expression of MHC class II was reported previously to be altered in papillary carcinomas, including the follicular variant (50) .
Our study shows that gene expression array analysis correlated generally with classification of thyroid tumors. We also identified a set of genes that are differentially expressed between follicular benign and malignant neoplasms and between these tumors and papillary carcinoma. Although interesting, the final verification of the true biological nature must await additional follow-up on these patients.
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Requests for reprints: Adel K. El-Naggar, The University of Texas M.D. Anderson Cancer Center, Department of Pathology, 1515 Holcombe Boulevard, Houston, TX 77030. Phone: 713-792-3109; Fax: 713-792-5532; E-mail: anaggar{at}mdanderson.org
Received 1/15/04; revised 5/11/04; accepted 6/22/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. Durand, C. Ferraro-Peyret, S. Selmi-Ruby, C. Paulin, M. El Atifi, F. Berger, N. Berger-Dutrieux, M. Decaussin, J.-L. Peix, C. Bournaud, et al. Evaluation of Gene Expression Profiles in Thyroid Nodule Biopsy Material to Diagnose Thyroid Cancer J. Clin. Endocrinol. Metab., April 1, 2008; 93(4): 1195 - 1202. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. T. Netea-Maier, S. W. Hunsucker, B. M. Hoevenaars, S. M. Helmke, P. J. Slootweg, A. R. Hermus, B. R. Haugen, and M. W. Duncan Discovery and Validation of Protein Abundance Differences between Follicular Thyroid Neoplasms Cancer Res., March 1, 2008; 68(5): 1572 - 1580. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Klopper, A. Berenz, W. R. Hays, V. Sharma, U. Pugazhenthi, J. Janssen, M. Singh, R. P. Bissonnette, and B. R. Haugen In vivo and Microarray Analysis of Rexinoid-Responsive Anaplastic Thyroid Carcinoma Clin. Cancer Res., January 15, 2008; 14(2): 589 - 596. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Riesco-Eizaguirre and P. Santisteban New insights in thyroid follicular cell biology and its impact in thyroid cancer therapy Endocr. Relat. Cancer, December 1, 2007; 14(4): 957 - 977. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fujarewicz, M. Jarzab, M. Eszlinger, K. Krohn, R. Paschke, M. Oczko-Wojciechowska, M. Wiench, A. Kukulska, B. Jarzab, and A. Swierniak A multi-gene approach to differentiate papillary thyroid carcinoma from benign lesions: gene selection using support vector machines with bootstrapping Endocr. Relat. Cancer, September 1, 2007; 14(3): 809 - 826. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Eszlinger, K. Krohn, A. Kukulska, B. Jarzab, and R. Paschke Perspectives and Limitations of Microarray-Based Gene Expression Profiling of Thyroid Tumors Endocr. Rev., May 1, 2007; 28(3): 322 - 338. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. L. Griffith, A. Melck, S. J.M. Jones, and S. M. Wiseman Meta-Analysis and Meta-Review of Thyroid Cancer Gene Expression Profiling Studies Identifies Important Diagnostic Biomarkers J. Clin. Oncol., November 1, 2006; 24(31): 5043 - 5051. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Lubitz, S. K. Ugras, J. J. Kazam, B. Zhu, T. Scognamiglio, Y.-T. Chen, and T. J. Fahey III Microarray Analysis of Thyroid Nodule Fine-Needle Aspirates Accurately Classifies Benign and Malignant Lesions J. Mol. Diagn., September 1, 2006; 8(4): 490 - 498. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Eszlinger, M. Wiench, B. Jarzab, K. Krohn, M. Beck, J. Lauter, E. Gubala, K. Fujarewicz, A. Swierniak, and R. Paschke Meta- and Reanalysis of Gene Expression Profiles of Hot and Cold Thyroid Nodules and Papillary Thyroid Carcinoma for Gene Groups J. Clin. Endocrinol. Metab., May 1, 2006; 91(5): 1934 - 1942. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting E |