
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
Authors' Affiliations: 1 Departments of Obstetrics and Gynecology and Reproductive Science, Yale University School of Medicine, New Haven, Connecticut; 2 Division of Public Health Science, Fred Hutchinson Cancer Research Center, Seattle, Washington; 3 The Nevada Cancer Institute, Las Vegas, Nevada; 4 Department of Statistics, The George Washington University, Washington, District of Columbia; 5 Millipore Corporation, Temecula, California; and 6 Departments of Epidemiology and Public Health and Obstetrics and Gynecology, Yale Cancer Center, Yale University School of Medicine, New Haven, Connecticut
Requests for reprints: Gil Mor, Department of Obstetrics, Gynecology and Reproductive Sciences, Reproductive Immunology Unit, Yale University School of Medicine, 333 Cedar Street FMB 301, New Haven, CT 06520. Phone: 203-785-6294; Fax: 203-785-4883; E-mail: Gil.Mor{at}yale.edu.
| Abstract |
|---|
|
|
|---|
Experimental Design: We analyzed 362 healthy controls and 156 newly diagnosed ovarian cancer patients. Concentrations of leptin, prolactin, osteopontin, insulin-like growth factor II, macrophage inhibitory factor, and CA-125 were determined using a multiplex, bead-based, immunoassay system. All six markers were evaluated in a training set (181 samples from the control group and 113 samples from OC patients) and a test set (181 sample control group and 43 ovarian cancer).
Results: Multiplex and ELISA exhibited the same pattern of expression for all the biomarkers. None of the biomarkers by themselves were good enough to differentiate healthy versus cancer cells. However, the combination of the six markers provided a better differentiation than CA-125. Four models with <2% classification error in training sets all had significant improvement (sensitivity 84%-98% at specificity 95%) over CA-125 (sensitivity 72% at specificity 95%) in the test set. The chosen model correctly classified 221 out of 224 specimens in the test set, with a classification accuracy of 98.7%.
Conclusions: We describe the first blood biomarker test with a sensitivity of 95.3% and a specificity of 99.4% for the detection of ovarian cancer. Six markers provided a significant improvement over CA-125 alone for ovarian cancer detection. Validation was performed with a blinded cohort. This novel multiplex platform has the potential for efficient screening in patients who are at high risk for ovarian cancer.
The high mortality rate of ovarian cancer is due to the lack of a screening strategy to detect early-stage disease. Ovarian cancer presents with very few, if any, specific symptoms. Twenty percent of patients are diagnosed at stage I and II when the disease is still confined to the ovary. In patients diagnosed with advanced disease, the 5-year survival rate ranges from 20% to 25%, depending on the stage and grade of tumor differentiation (2, 3). Of these patients, 80% to 90% will initially respond to chemotherapy, but less than 10% to 15% will remain in permanent remission (2, 4, 5). Therefore, an adequate screening test for early detection of ovarian cancer should greatly improve patient survival.
Currently, in some institutions, the screening strategy for ovarian cancer is annual pelvic examinations. Transvaginal ultrasound and serial measurements of the biomarker CA-125, have been included for the high-risk population, but with little success. CA-125 exhibits a sensitivity of less than 60% in early stages of the disease (6, 7). Even the addition of ultrasound screening to CA-125 measurement improves the positive predictive value (PPV) to only about 20% (8–10).
Recently, we described the characterization of a panel of biomarkers that can recognize ovarian cancer, including stage I and II (11). The test consisted of four serum proteins, which when analyzed together gave a sensitivity of 95% and a specificity of 94%. Although the reported sensitivity and specificity for this assay is better than any other presently available test, due to the low prevalence of ovarian cancer in the general population (40/100,000), it could be argued that it is not specific enough as a clinical test for ovarian cancer detection. In order to outweigh the benefit of early detection with the complication of surgery for false-positive screening results, an ovarian cancer screening strategy must achieve a minimum of 99.6% specificity (6).
The objective of this study is to evaluate whether the inclusion of additional biomarkers and the use of an alternative platform may improve the sensitivity and specificity of this test and will enable its use as a screening tool for the detection of ovarian cancer. We describe the development and characterization of a serum biomarker test with a sensitivity of 95.3% and a specificity of 99.4%.
| Materials and Methods |
|---|
|
|
|---|
|
Patients were enrolled in the "Discovery to Cure" early detection program in the Department of Obstetrics and Gynecology at Yale University under HIC protocol No. 10425/26784. Information about diagnosis, staging histology, and grade was provided by the Gynecologic Oncology Clinic at the Yale New Haven Hospital. No data allowing identification of the patients were provided.
Qualified personnel obtained informed consent from each individual participating in this study.
Sample collection
Ten mL of peripheral blood was drawn from subjects using standardized phlebotomy procedures (11). Within 2 to 4 hours of collection, samples were processed using guidelines set by the National Cancer Institute Inter-Group Specimen Banking Committee and stored at –80°C in the Tissue/Sera Bank of the Discovery to Cure program.
Multiplex analysis
Details of the assay are described in the Supplementary Material. In short, the Beadlyte 6-plex Ovarian Cancer Panel Kit includes two panels: one for leptin, prolactin, macrophage inhibitory factor (MIF), osteopontin, and CA-125 (Beadlyte 5-plex Ovarian Cancer Panel) and a separate panel for IGF-II (Beadlyte Anti-Human IGF-II Bead Set).
Multiplex study design
The characterization and validation of the multiplex assay was done using 535 individual serum samples divided in two study phases. In phase 1 (characterization), the training set consisted of 294 serum samples: 181 samples from healthy individuals (control group) and 113 samples from patients with newly diagnosed ovarian cancer (stage I-IV; case group).
In stage 2 (validation/test), 224 samples were evaluated in a blinded manner (the technical personnel running the test and the statistician had no prior information related to the samples). This test set consisted of 181 samples obtained from healthy individuals (43 samples from newly diagnosed ovarian cancer patients [Stage I-IV]). A smaller number of cases in the validation/test set was due to inclusion of only those samples that had never been used in the discovery process leading to the identification of the above six markers or combination rules, to avoid optimism in reporting performance. An additional 17 samples from patients with benign disease were evaluated with the final model.
Serum analysis using ELISA analysis. ELISA was used as previously described (11).
Statistical analysis of data
Three independent statisticians from two different institutions analyzed the data (ZF and GL from the Fred Hutchinson Cancer Research Center, Seattle, Washington and YL from George Washington University, Washington DC).
Training data analysis. Each individual marker was evaluated by a receiving operational characteristic (ROC) curve, the area under the ROC curve (AUC), and sensitivity at 95% specificity (ROC [0.05]). Confidence intervals (CIs) were calculated by 1,000 bootstrap samples. Four models were used to combine markers. Model I used logistic regression on raw values of each marker. Model II was similar to model I, except it allowed to drop some markers from the model. Model III created an indicator for each marker at the cutoff point corresponding to 95% specificity (1 if above cutoff, 0 otherwise, invert marker sign if smaller value indicates cancer). A logistic regression model was then used to find a linear combination of the six indicators to minimize the total classification error. Model IV was similar to model III, except the cutoff point was chosen to minimize the total classification error. Models III and IV aimed to produce a simple intuitive classification rule for clinicians with possible reduction in classification accuracy compared with models I and II. Ten-fold cross validations (CV) were used for estimating classification accuracies. To reduce the impact of randomness on 10-fold CV, 1000 random sample divisions were performed.
Validation set data analysis. Sensitivity, specificity, and total classification accuracy were calculated using the validation set data for each marker and for each of four models obtained from training data (Table 2 ).
|
| Results |
|---|
|
|
|---|
Characterization of six biomarkers in a multiplex platform: Training set. To increase the sensitivity and specificity of the multiplex assay, we added two additional markers to the multiplex platform: MIF, which we found to be highly expressed in the sera of ovarian cancer patients (11) and CA-125. First, we evaluated the serum concentrations of six markers with a training set consisting of 113 newly diagnosed ovarian cancer patients and 181 age-matched healthy individuals as the control group. MIF levels were significantly higher in the sera from ovarian cancer patients compared with healthy individuals (1302.45 pg/mL ± (232) vs 85.56 ± 25.2 pg/mL, P < 0.001) confirming our previous results using ELISA (12). Similarly, serum from patients with ovarian cancer showed elevated levels of prolactin, OPN, and CA-125 as well as low levels of leptin and IGF-II compared with the healthy group (Supplementary Table S1). The distributions of each marker in ovarian cancer (all ovarian cancers and early-stage ovarian cancers) and healthy control groups were compared with boxplots (Fig. 1 ). (The distribution summary for all six markers by case control/status is included in Table 1 of the Supplementary Material.).
|
|
Design of the final model for the six biomarkers in a multiplex platform. We then evaluated the performance of model I for both sets of samples, the training and test set. No differences were observed when we applied model I to the test set or the combination of training and test (Fig. 3B and Supplementary Table 2), therefore, we fit a "final model" using observations from the combined training and test sets:
|
The concentration for each biomarker in the healthy and cancer groups are summarized in Supplementary Table S3.
The performance of the final model was evaluated using 10-fold CV. From 362 healthy samples, two were misclassified, and from 156 ovarian cancer samples, five were misclassified (Fig. 3A). Evaluation of the results according to disease stages showed that from 36 stage I and II samples, 4 were misclassified (91.6%), whereas 119 of 120 stage III and IV samples were correctly classified (100%). The misclassified stage I and II samples were granulosa cell carcinoma, undifferentiated carcinoma, and a papillary serous carcinoma. From 17 samples obtained from patients diagnosed with a benign mass, two samples were misclassified as cancer, and the remaining 15 were correctly classified as normal (88.2%).
With this unique combination of six markers, we were also able to accurately recognize all of the different histologic subtypes of primary ovarian cancers: epithelial (papillary serous, clear cell, endometroid, and mixed), germ cell (yolk sac and dysgerminomas), and sex cord stromal (granulosa cell) tumors. The misclassified cancer samples were a stage I granulosa cell tumor, a stage II undifferentiated carcinoma, and a stage II papillary serous carcinoma.
ROC curves were used to examine the performance characteristic of the six markers in the training set, the test set, and these sets together, as well as comparing the results obtained from ELISA. As shown in Fig. 3B, the multiplex test using the six markers provided a significant improvement to the sensitivity and specificity compared with the 4-plex ELISA platform. As indicated above, no significant differences were found between the ROC curves for the training set, the test set, and the combination. Therefore, the statistical results for the final model are sensitivity 95.3% and specificity 99.4%. The PPV = TP/(TP+FP) for the test sample is 99.3% and the negative predictive value = TN/[FN+TN] is 99.2%. The final model provides more optimistic results compared with the test group, because it includes the training set; however, there are not major differences in terms of specificity and sensitivity for either model. Nevertheless, this final model will be validated in a multicenter validation study.
| Discussion |
|---|
|
|
|---|
Ovarian cancer does not show specific symptoms that allow its identification during early stages. Diagnosis is frequently made only after progression to later stages, at which point the dissemination of the cancer limits effective treatment. Therefore, the development of sensitive and specific methods for early detection has been a priority as a means for improving the treatment of this disease.
The potential implication of early detection of ovarian cancer on patient outcome is shown by the differential survival rates of women diagnosed at different stages of disease progression. The 10-year survival rate is nearly 90% when the disease is localized to the ovaries at the time of diagnosis and drops to 20% when the disease has spread to distant sites at the time of diagnosis (13).
Serum markers are useful tools for the diagnosis and follow-up of patients with different forms of cancer (14). Numerous proteins have been identified as specific markers for ovarian cancer either in blood or urine (13, 15, 16). However, no single protein has provided adequate sensitivity and specificity. As shown in this study, each marker by itself poorly differentiates between cancer and healthy individuals, whereas the combination of these markers was able to significantly increase both the specificity and sensitivity of the assay. Lokshin's group reported a multiplex assay consisting of 24 cytokines in combination with CA-125 that when used in different combinations could differentiate early-stage ovarian cancer (17).
Recently, Anderson suggested a road map for the creation of a viable diagnostic biomarker, which is composed of the following steps: discovery, verification/validation, and clinical implementation (14, 18). Our entire approach in the development of this test has followed the suggested "road map." For the discovery phase, we used protein arrays, which is a powerful technology for discovery but not for diagnostics. In addition, we limited the output to proteins associated with the control of cell growth. With this approach, we were able to avoid covering the whole proteome. The identified candidates were measured in a coherent technology platform, ELISA, which provided sufficient uniformity across samples and laboratories. In the validation stage, the samples were evaluated in a blind manner, the statistical analysis was confirmed by different approaches, and the results were confirmed by validation within the research community. For the third stage, the potential clinical implementation for a diagnostic test, we have included two additional markers, including CA-125 and implemented a novel platform that could simplify the test and decrease potential variability. The use of multiplex provides numerous advantages as a platform for the diagnostic implementation (17). The ability to measure multiple markers in a small sample volume makes the multiplex platform suitable for large validation studies (19). The use of six biomarkers is adequate enough to provide the sensitivity and specificity required but not sizeable enough to impede and complicate evaluation.
The biological significance of the described biomarkers provides for a better understanding of the test. The proteins identified in this study, with the exception of CA-125 and MIF, are all related to the normal physiology of the ovaries. These proteins are produced either by the surrounding supportive cells or as a response to signals originating from the ovary. IGF-II is the primary IGF in the human ovary, acting as mediator of gonadotropin action (20, 21). Prolactin on the other hand, has been shown to participate in the regulation of steroidogenesis in ovarian follicles, particularly the inhibition of progesterone secretion in the early stages of the follicular growth and its enhancement in the luteal phase (22, 23). There is strong evidence that links leptin and the hypothalamic gonadotropin-releasing hormone secretion, which affects the hypothalamic-pituitary-ovarian axis (24). The levels of expression of these proteins are maintained by a delicate balance between each of the cellular components of the ovary.
We hypothesize that the presence of abnormal cells may alter this intercellular communication and disrupt the axis, resulting in the abnormal expression levels observed in cancer samples. The proteins identified in this study may not represent factors produced by the tumor but instead represent the organ/body's response to the presence of neoplastic cells. We propose that significant levels of tumor products could only be detected in peripheral blood at later stages of tumor development; however, the body is able to recognize and respond to early stages of the tumorigenicity, and this is reflected in peripheral blood proteins. Based on this premise, the protein panel identified in this study is able to detect early stages of the disease.
An important aspect of this study is the use of different cohorts for the training and the test sets. As shown in our study, the use of the same samples for the training and test evaluation generate results that are not representative of the test's real function. If we use the same samples used for the training, we observed a high specificity and sensitivity for each of the markers; however, when we evaluate in a different cohort, the sensitivity and specificity of each marker decrease, although they remain higher than the standard CA-125.
Although the sensitivity of the test for stage I and II is 91.6%, it is significantly higher than the only available test CA-125, which has a sensitivity less than 60% in early disease (6, 7). Consequently, the use of this test will enhance the potential of treating ovarian cancer in its early stages and therefore, increases the successful treatment of the disease. In terms of the PPV for the general population, the test is above the suggested 0.10 necessary used as a screening test (6, 7).
| Conclusion |
|---|
|
|
|---|
The sensitivity of the test fulfills the last requisite of the proposed road map: a viable application for screening with a limited risk for false-positives results. The implementation of this test for early detection of ovarian cancer may prove beneficial for the management of this deadly disease.
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: Supplementary data for this article are available at Clinical Cancer Research Online (http://clincancerres.aacrjournals.org/).
Received 6/26/07; revised 9/28/07; accepted 10/18/07.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. L. Clarke-Pearson Screening for Ovarian Cancer N. Engl. J. Med., July 9, 2009; 361(2): 170 - 177. [Full Text] [PDF] |
||||
![]() |
J. Xie Activation of Hedgehog Signaling in Human Cancer: Basic Mechanisms and Clinical Implications Am. Assoc. Cancer Res. Educ. Book, April 18, 2009; 2009(1): 25 - 41. [Full Text] [PDF] |
||||
![]() |
C. Kuk, V. Kulasingam, C. G. Gunawardana, C. R. Smith, I. Batruch, and E. P. Diamandis Mining the Ovarian Cancer Ascites Proteome for Potential Ovarian Cancer Biomarkers Mol. Cell. Proteomics, April 1, 2009; 8(4): 661 - 669. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Dieplinger, D. P. Ankerst, A. Burges, M. Lenhard, A. Lingenhel, L. Fineder, H. Buchner, and P. Stieber Afamin and Apolipoprotein A-IV: Novel Protein Markers for Ovarian Cancer Cancer Epidemiol. Biomarkers Prev., April 1, 2009; 18(4): 1127 - 1133. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Garcia-Velasco and E. Somigliana Management of endometriomas in women requiring IVF: to touch or not to touch Hum. Reprod., March 1, 2009; 24(3): 496 - 501. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. VRZALOVA, M. PRAZAKOVA, Z. NOVOTNY, O. TOPOLCAN, M. CASOVA, and L. HOLUBEC JR. Test of Ovarian Cancer Multiplex xMAP Technology Panel Anticancer Res, February 1, 2009; 29(2): 573 - 576. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Mor, P. E. Schwartz, and H. Yu Reply to the Letters to the Editor from Greene et al. Clin. Cancer Res., November 15, 2008; 14(22): 7577 - 7579. [Full Text] [PDF] |
||||
![]() |
R. J. Coates, K. Kolor, S. L. Stewart, and L. C. Richardson Diagnostic Markers for Ovarian Cancer Screening: Not Ready for Routine Clinical Use Clin. Cancer Res., November 15, 2008; 14(22): 7575 - 7576. [Full Text] [PDF] |
||||
![]() |
M. H. Greene, Z. Feng, and M. H. Gail The Importance of Test Positive Predictive Value in Ovarian Cancer Screening Clin. Cancer Res., November 15, 2008; 14(22): 7574 - 7574. [Full Text] [PDF] |
||||
![]() |
G. P. Bertenshaw, P. Yip, P. Seshaiah, J. Zhao, T.-H. Chen, W. S. Wiggins, J. P. Mapes, and B. C. Mansfield Multianalyte Profiling of Serum Antigens and Autoimmune and Infectious Disease Molecules to Identify Biomarkers Dysregulated in Epithelial Ovarian Cancer Cancer Epidemiol. Biomarkers Prev., October 1, 2008; 17(10): 2872 - 2881. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |