
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
Authors' Affiliations: Departments of 1 Internal Medicine I, 2 Visceral and Transplantation Surgery, 3 Internal Medicine III, and 4 Neuroinformatics, University of Ulm, Ulm, 5 Division of Functional Genome Analysis, Deutsches Krebsforschungszentrum, Heidelberg, 6 Department of General, Visceral, and Vascular Surgery, University of the Saarland, Homburg/Saar, Germany, and 7 Department of Pathology, University of Verona, Verona, Italy
Requests for reprints: T.M. Gress, Abteilung Innere Medizin I, Universität Ulm, Robert-Koch-Str. 8, 89081 Ulm, Germany. Phone: 49-731-500-24385/24311; Fax: 49-731-500-24302; E-mail: thomas.gress{at}medizin.uni-ulm.de.
| Abstract |
|---|
|
|
|---|
80%. The purpose of the current study was therefore to develop a novel diagnostic approach based on expression profiling of biopsy material using a specialized diagnostic cDNA array. Experimental Design: Previous gene expression profiling studies were reevaluated to design a 558-feature diagnostic array. Minimal amounts of residual material from pancreatic cytology samples as well as surgically resected tumor and control tissue specimens were analyzed using the diagnostic array and a newly developed statistical classification system.
Results and Conclusions: Our diagnostic approach resulted in 95% accurate differentiation between ductal adenocarcinomas and nonmalignant tumors of the pancreas. The diagnostic array, in conjunction with conventional diagnostic procedures, is thus suitable to significantly improve the reliability of pancreatic cancer diagnostics and can be expected to become a valuable new tool in the routine workup of suspect masses in the pancreas.
|
|
|
Analysis of single molecular markers is therefore not sufficient to provide for accurate diagnosis of suspect pancreatic masses. DNA arrays with their potential to survey the expression levels of many genes simultaneously represent ideal tools to circumvent this problem. Several expression profiling analyses using different technological platforms (1419) have shown the existence of distinct gene expression signatures characteristic of pancreatic cancer. However, the use of large-scale ("whole-genome") arrays is extremely costly and generates vast amounts of data which are difficult to analyze in a routine diagnostic setting. Both drawbacks can be circumvented by designing dedicated arrays with limited numbers of genes specifically selected for diagnostic purposes. The aim of this study was therefore to develop specialized cDNA arrays specifically designed for the differential diagnosis of pancreatic tumors based on expression profiling of fine needle aspiration biopsies. Because >90% of all malignant pancreatic tumors represent ductal adenocarcinomas (20), we focused on this tumor type for the present study.
| Materials and Methods |
|---|
|
|
|---|
Surgically resected pancreatic adenocarcinoma and chronic pancreatitis tissues were provided by the surgery departments at the Universities of Ulm and Homburg/Saar. Normal pancreas samples were obtained from healthy areas at the borders of chronic pancreatitis resectates. Informed consent was obtained from all patients prior to using tissue or biopsy samples. The study was approved by the local ethics committees at the Universities of Ulm (Germany), Homburg/Saar (Germany), and Verona (Italy).
RNA isolation and linear amplification. Snap-frozen surgical samples were ground on dry ice with a mortar and pestle, suspended in RLT buffer and total RNA were isolated using the RNEasy Mini Kit (Qiagen, Hilden, Germany). Fine needle biopsy in the routine diagnostic workup of pancreatic tumors was done with transabdominal ultrasound or endoscopic ultrasound guidance. FNAB sample material was recovered by flushing the needle and syringe with RLT buffer after material for cytologic analysis had been removed. Total RNA was then isolated using the RNEasy Mini Kit (Qiagen). In both cases, the total RNA was finally dissolved in water and quality-checked on a BioAnalyzer Lab-on-a-Chip system (Agilent, Waldbronn, Germany). In order to obtain sufficient material for hybridization, the complete FNAB RNA samples were subjected to one round of T7 RNA polymerase-based linear amplification using the MessageAmp Kit (Ambion, Huntingdon, Great Britain). To avoid data bias, all surgical samples were treated likewise by linearly amplifying 0.5 µg of total RNA prior to hybridization.
Array production and hybridization. cDNA fragments were PCR-amplified using vector primers and contact-printed in duplicate on nylon membranes (Nytran N+, Schleicher and Schuell, Germany). For radioactive labeling, the complete amplified RNA samples were labeled with 33P-dATP using the StripEZ-RT Kit (Ambion) and hybridized overnight to nylon membrane arrays in ULTRArray hybridization buffer (Ambion) at 50°C. Radioactive signals were detected using a STORM phosphorimaging system (Amersham Biosciences, Feiburg, Germany) and quantified with the ArrayVision software (InterFocus, Haverhill, Great Britain). Signal intensities were normalized to the mean signal intensity of all features on an individual array.
Construction of the classifier. All equations used are listed in Panel 1. Details of the analysis and complete data sets are available as part of the Supplementary Data.8 The nylon array data set was filtered to include only featfures (genes) for which normalized intensities exceeded a value of 0.8 in at least 10 samples to remove uniformly low (and thus uninformative) signals. Control spots were excluded as well. The remaining 169 features were used in the construction of a linear classifier based on the analysis of the 42-sample training set (Table 1).
During the first step of the analysis, a principal component analysis (21) was done on the training data set. The first 30 principal components of the training data set, which represented 99.9% of the total variation within the data, were then used for a linear discriminant analysis to search for combinations of principal components facilitating complete separation of the tumor from the control tissue samples in the training set. All possible combinations of up to 7 out of the 30 principal components were tested for their performance in the linear separation of the diagnostic classes. Evaluation of the feature set combinations was done by measuring the area under the receiver operating characteristic curve (2224). All combinations producing an area under the receiver operating characteristic curve of
0.95 were subjected to a stochastic search to add additional discriminative principal components until perfect separation of the diagnostic classes was achieved. Out of all combinations producing perfect linear separation, we selected the set that resulted in the greatest margin between tumor and control samples when plotting the samples according to their relative distances to the separating hyperplane. The resulting linear classifier was then evaluated using the independent 20-sample test set (Table 2).
| Results and Discussion |
|---|
|
|
|---|
In the present study, 16 FNAB samples of pancreatic adenocarcinoma and benign pancreatic tumors for which clinical patient follow up with a definitive diagnosis was available were analyzed both by conventional cytology and diagnostic array hybridization. Cytologic analysis of the 16 FNAB samples correctly identified a malignant process in 9 out of 10 adenocarcinoma cases (90%) and a benign process in 3 out of 6 chronic pancreatitis and pseudocyst cases (50%). The remaining adenocarcinoma case as well as the three benign cases were nondiagnostic due to the absence of evaluable intact cells. The resulting overall diagnostic accuracy of 75% is well in agreement with the numbers reported in the literature (14).
Residual material from the same biopsy samples which were used for cytologic analyses were subjected to expression profiling analyses using the diagnostic arrays. In order to ensure an adequate representation of different tumor stages in the data sets used for the development and evaluation of the classification procedure, we analyzed an additional 27 samples of histopathologically well-defined surgically resected ductal adenocarcinomas as well as 19 surgically resected control samples of chronic pancreatitis or normal pancreas. The samples were arbitrarily divided into a 42-sample training set (Table 1) and a 20-sample test set (Table 2), such that both sets contained equal proportions of malignant and benign samples as well as FNAB's. The training set was subsequently used to develop the classification system for the distinction between malignant and benign samples (see below), which was independently evaluated using the test sample set. The complete process is schematically outlined in Fig. 2.
|
For the construction of the classifier, we opted to perform linear discriminant analysis using the first 30 principal components of the training data set, which represented 99.9% of the total variation within the data. Because linear discriminant analysis assumes a relatively simple model of sample distribution, it is far less prone to overadaption to a specific data set than nonlinear methods, again increasing the robustness of the classification procedure. We identified a total of 429,917 different combinations of principal components producing perfect linear separation of tumor and control samples in the training data set. Out of these, a set of 23 principal components which provided the maximum margin between tumor and control samples was selected to define the linear classifier (Fig. 3A; see also Supplementary Information).
|
We were thus able to show that expression profiling analyses of surgically resected tumor specimens and FNAB samples using specialized diagnostic arrays with limited numbers of highly selected genes produce reliable, reproducible, and informative results. Supplementing conventional cytologic analyses of aspiration biopsies with diagnostic array profiling thus promises to significantly improve the accuracy of preoperative diagnostics of suspect masses in the pancreas. The large number of different combinations of principal components yielding perfect linear separation of the diagnostic classes in the training set, as well as the convincing performance of the classifier on the independent test set, serve to show both the expedience of the diagnostic gene collection as well as the validity of the analytic approach. Our results confirm conclusions drawn from earlier expression profiling studies using large-scale arrays, which have shown that the number of informative genes for the classification of different types and subtypes of cancer is usually <100 (2830), suggesting that dedicated diagnostic arrays should perform as well as whole-genome arrays in defined diagnostic settings.
Due to the use of residual material from biopsy needles for the analysis of the FNAB samples, the amount of starting material available for expression profiling analysis was extremely limited, so that we initially produced the arrays in the nylon membrane format to take advantage of the superior sensitivity of radioactive labeling and detection. Parallel hybridizations of a subset of samples to diagnostic arrays produced in the glass microarray format (see Supplementary Material), however, showed that the concept and design of the diagnostic array can readily be transferred to the glass microarray/fluorescent labeling platform as well, which may be better suited for routine clinical settings.
In the present study, we have focused on the distinction between pancreatic ductal adenocarcinoma and nonmalignant diseases of the pancreas, because pancreatic ductal adenocarcinoma is by far the most frequent malignant tumor arising in the pancreas and thus poses the clinically most relevant diagnostic problem (20). We are currently in the process of analyzing additional tumor entities, such as acinar and neuroendocrine tumors, using both the diagnostic array as well as large-scale arrays, in order to develop a multiclass classification system for the comprehensive diagnosis of different malignancies in the pancreas. In addition, we expect further development of the array in combination with careful analysis of clinical patient data to result in the recognition of distinct prognostic gene expression signatures predicting important clinical variables such as stage of disease, response to therapy, or prognosis, thus setting the stage for therapeutic regimens custom tailored to the individual patient.
| Appendix. Panel 1 (Equations used in the report) |
|---|
|
|
|---|
The sample mean
i is estimated by:
![]() |
The scatter matrices Si and Sw are:
![]() |
The projection vector w for Fisher's linear discriminant is given by:
![]() |
The area under the receiver operating characteristic curve is estimated by:
![]() |
j (j = 1, ..., n2) are the ranks of these cases obtained by ranking all n = n1 + n2 values of wTx.
The margin m for  = 1 is calculated by:
![]() |
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: M. Buchholz and H.A. Kestler contributed equally to this study and should both be considered first authors.
8 Supplementary information is accessible at http://www.informatik.uni-ulm.de/ni/mitarbeiter/HKestler/DiagArray/default.html. ![]()
Received 6/14/05; revised 7/28/05; accepted 8/10/05.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. Ghaneh, E. Costello, and J. P Neoptolemos Biology and management of pancreatic cancer Gut, August 1, 2007; 56(8): 1134 - 1152. [Full Text] [PDF] |
||||
![]() |
R. Chen, T. A. Brentnall, S. Pan, K. Cooke, K. W. Moyes, Z. Lane, D. A. Crispin, D. R. Goodlett, R. Aebersold, and M. P. Bronner Quantitative Proteomics Analysis Reveals That Proteins Differentially Expressed in Chronic Pancreatitis Are Also Frequently Involved in Pancreatic Cancer Mol. Cell. Proteomics, August 1, 2007; 6(8): 1331 - 1342. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |