
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Imaging, Diagnosis, Prognosis |
Authors' Affiliations: 1 Fred Hutchinson Cancer Research Center, Seattle, Washington and 2 Department of Pathology and Laboratory Medicine, Mount Sinai Hospital; Department of Clinical Biochemistry, University Health Network and Toronto Medical Laboratories; and Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada
Requests for reprints: Eleftherios P. Diamandis, Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, 600 University Avenue Toronto, Ontario, Canada M5G 1X5. Phone: 416-586-8443; Fax: 416-586-8628; E-mail: ediamandis{at}mtsinai.on.ca.
| Abstract |
|---|
|
|
|---|
Experimental Design: We here validate the performance of hK11 on an independent data set and further characterize its behavior in multiple types of controls. We also investigate its behavior when combined with CA 125 to form a composite marker. hK11 had not previously been evaluated on these serum samples. CA 125, hK11, and the composite marker were evaluated for their performance in identifying ovarian cancer and for temporal stability.
Results: hK11 significantly distinguished ovarian cancer cases from healthy controls and is less sensitive to benign ovarian disease than is CA 125.
Conclusion: We conclude that hK11 is a valuable new biomarker for ovarian cancer and its temporal stability implies that it may do even better when used in a longitudinal screening program for early detection.
Currently, one of the most promising early detection strategies for ovarian cancer uses the serum biomarker CA 125 in conjunction with imaging as a trigger for surgical intervention. CA 125 can identify 85% of clinically advanced ovarian carcinomas (3–5), and several studies have shown that elevations in CA 125 may occur 18 months or more before clinical diagnosis (6, 7). Moreover, the ability of CA 125 to detect cancer early in a screening program is supported by the observation that individual women have temporally stable levels of the marker (8). This suggests that specially tailored screening algorithms could lead to disease detection based on very small serial elevations in CA 125 levels (9–12).
Although CA 125 may be among the best available single diagnostic ovarian cancer biomarkers, its sensitivity and specificity are imperfect. In particular, it is elevated above reference levels in only 50% of clinically detectable early-stage disease (3–5) and it is frequently elevated in patients with benign ovarian tumors and other gynecologic diseases (13–15). Adding one or several markers to CA 125 for use as a composite marker (CM) could improve diagnostic performance if sensitivity were improved with no loss in specificity.
Several research groups recently reported candidate biomarkers for ovarian cancer diagnosis and early detection. For example, carcinoembryonic antigen, placental alkaline phosphatase, various other carbohydrate antigens (e.g., CA15-3 and CA19-9), OVX1, matrix metalloproteinases, prostasin, HE4 protein, mesothelin, members of the interleukin family, and inhibin have all been proposed as candidate ovarian cancer biomarkers (reviewed in ref. 16). In addition, the family of kallikreins, a group of serine proteases encoded by 15 genes that are localized in tandem on human chromosome 19q13.4, has also been shown to be candidate markers for ovarian and other cancers (17–20), including breast, testicular, and prostate cancer (reviewed in refs. 17, 20). Specifically for ovarian cancer, we have previously reported that serum levels of human kallikreins 5, 6, 8, 10, 11, and 14 are elevated in many patients with ovarian cancer (21–26). Human kallikrein 11 (hK11), measured by a newly developed ELISA (19), was found to be elevated in the serum of
70% of ovarian cancer patients (at 95% specificity) and has favorable prognostic value in ovarian cancer (26, 27).
Here, we report the performance of hK11 on its own in a set of sera collected from cohorts not previously evaluated for hK11, which constitutes a validation set for this marker. We characterize hK11 on its ability to distinguish ovarian cancer from healthy controls, from women with benign ovarian disease, and from women undergoing surgery who have histologically normal ovaries. We include the latter surgical normal control group to evaluate the potential for biases that may arise when different sample collection methods are used for cases undergoing surgery and for healthy controls; differences between surgical controls and normal controls may indicate biases in the ascertainment of the case and control samples or perhaps biomarker sensitivity to nonspecific conditions. As a first step toward evaluating the performance of hK11 as an early detection marker, we also evaluated the temporal stability of the marker over a 1-year period among healthy women. This temporal stability assesses whether the sensitivity and specificity of the marker will improve if used in a longitudinal screening program (11, 12, 28).
In addition, we evaluated the marker CA 125 in each subject to compare hK11 and CA 125 and also to investigate whether the two markers could complement each other when combined in a CM.
| Materials and Methods |
|---|
|
|
|---|
Specimens from women with ovarian cancer, women with benign ovarian disease, and surgical normal controls were collected in surgery, following anesthesia, but before surgical intervention (i.e., removal of the ovaries). A pathologist examined fixed, paraffin-embedded specimens to confirm the histology for these tumors. The serum from healthy women came from a National Cancer Institute–funded ovarian cancer screening research trial and so represented healthy asymptomatic women (29, 30). Specimen collection and processing protocols were identical for all women regardless of case or control status; participants donated up to 50 mL of blood, which was processed into sera, plasma, and WBC and epithelial cell pellets.
We characterized all research participants at the time of specimen collection with respect to race, age, menopausal status, and use of hormone replacement therapy (HRT). Women were considered postmenopausal if they reported no menstruation for 6 months, used HRT, or were >50 years old and did not report menstrual history. All women using HRT had been taking it for
1 year. Stage and histology were recorded for all cancer cases. Table 1
summarizes these variables for the study population.
|
As a first step toward evaluating the relevance of our marker when used in a screening study, we also did measurements on specimens collected serially (1 year apart) from a subsample (n = 20) of the healthy controls. Temporal stability was measured by the correlation between the two time points. The performance of markers with high temporal stability may improve when measured over time to monitor deviations from baseline.
ELISA assay for hK11
The ELISA assay for quantifying hK11 in serum has been previously published and validated (22). In short, this assay has a detection limit of 0.1 µg/L, and the dynamic range extends to 50 µg/L. The assay has no cross-reactivity from other tissue kallikreins and varies within the measurement range by <10%. All samples were analyzed undiluted in duplicate. More recently, we improved the detection limit to 0.02 µg/L without changing any other assay characteristics.3 We evaluated CA 125 using the CA 125II sandwich RIA kit from Fujirebio Diagnostics according to the manufacturer's directions. The intraassay and interassay coefficients of variation were <10%. All samples were blinded to the technologists running the assays, and the code was broken to the statisticians after the database was constructed.
Statistical analysis
Quantifying the diagnostic ability of a marker. Receiver operating characteristic (ROC) curve methods were used to quantify marker performance for hK11 and CA 125. ROC curves associate the sensitivity of a diagnostic test to the entire range of the possible false-positive rate. The false-positive rate is equal to one minus test specificity. The area under the ROC curve (AUC) indicates the average sensitivity of a marker over the entire ROC curve. We also computed the sensitivity of each marker at 95% specificity, a value more relevant to diagnosis and early detection than the overall average sensitivity measured by the AUC. Establishing statistical significance of a single marker is done by the Wilcoxon rank-sum test, which evaluates the significance of the entire ROC curve.
Standardizing markers. To aid interpretation of our data when comparing two markers, we first transformed all markers with the natural log so their behavior among healthy women more accurately reflected a normal distribution. We then standardized the markers so they had a mean of 0 and unit SD in the sample of healthy controls (28). Standardization of the markers, which leaves the ROC curves and temporal stability unchanged, facilitates the comparison of two different markers because their units of measurement are now similar (the number of SDs above the average normal subject) as illustrated below.
Combining markers. We evaluated a CM of hK11 with CA 125 as a linear combination, or weighting, of the standardized CA 125 and hK11, where logistic regression is used to estimate the weights. Logistic regression has several theoretical properties that make it convenient for applied biomarker research (31), including its capacity to estimate the optimal marker combination.
We estimated our CM by predicting ovarian cancer cases from among all noncancer controls, including healthy, benign, and surgical controls. We limited our attention to a linear combination (i.e., a logistic regression linear link) to facilitate ease of interpretation, although other more complex rules are possible.
Evaluating temporal stability. We measured the temporal stability in healthy subjects by computing the Pearson correlation from two time points in the 20 healthy women for whom yearly specimens were available. Markers with high Pearson correlation yielded improved performance in a longitudinal algorithm (8, 11). A high correlation, in particular one exceeding 0.5, implies that monitoring markers for their deviation from historical levels using the parametric empirical Bayes screening rule will yield earlier detection than a simpler diagnostic rule that ignores screening history (11, 32), although we cannot conclude that one marker is better than another, or a marker panel is better than a simple marker, based solely on its temporal stability.
| Results |
|---|
|
|
|---|
|
4.822 times more than hK11 in a typical case (5.464/1.133 = 4.822). Among healthy subjects alone, we found no statistically significant differences in CA 125 and hK11 marker concentration with respect to age, HRT use, or race. Nearly all cases and controls were postmenopausal, but data for CA 125 were consistent with it being elevated among women who are premenopausal and consistent with previous studies for these markers.
Results for CA 125. Table 3 shows that CA 125 is a significant predictor of ovarian cancer from among healthy women and that it did as expected (P < 0.0005; AUC = 0.936; sensitivity of 88.2% at 95% specificity). Moreover, CA 125 significantly differentiates between ovarian cancer cases and surgical normal controls (P < 0.0005; AUC = 0.902; sensitivity of 76%) and to a similar degree as it does between cases and healthy controls. There is no statistically significant difference between surgical normal and healthy normal controls (P = 0.840, bottom row of Table 3), implying that the marker is not affected by surgical collection. However, as expected, CA 125 is significantly different between healthy normal controls and women with benign ovarian disease (P = 0.029), and data also suggest differences between benign ovarian disease and surgical normal controls (P = 0.060). This shows the well-known sensitivity of CA 125 to nonmalignant ovarian tumors (13–15).
|
Marker combinations. We investigated the ability of hK11 and CA 125 to form a CM with a better ROC curve than either individual marker. The initial logistic regression controlled for menopausal status and indicator of serous or nonserous tumor. Because neither the interaction terms nor the main effects were significant, they were dropped from the regression. Our resulting CM weights standardized CA 125 with a coefficient of 1.126 (P < 0.001) and standardized hK11 with a coefficient of 0.761 (P = 0.094). Because the markers have been standardized, these weights imply that CA 125 conveys 60% [1.126 / (1.126 + 0.761) x 100] of the total information in the panel compared with 40% of hK11. The CM is thus defined by CM = 1.126 x (standard CA 125) + 0.761 x (standard hK11). In the marker panel, the coefficient of standard CA 125 is highly significant, but the coefficient for standard hK11 is only modestly significant perhaps due to sample size limitations. Thus, we could not be certain that the two markers combine to form an effective diagnostic panel. Addressing whether they combine together to diagnose cancer early will require the availability of preclinically collected specimens.
The joint behavior of standardized hK11 and CA 125 among cases and all control groups is displayed in Fig. 1 . The horizontal axis represents standardized hK11, and the vertical axis represents standardized CA 125. Note that the healthy controls, denoted with "x," have mean 0 and variance 1 for each marker. The cases, denoted by "o" in the figure, tend to have higher standardized values for each marker than do the various controls, which means a classification rule that uses only one or the other marker has some ability to separate the cases from controls. The role of a CM is to separate cases from controls using two dimensions instead of one. The diagonal line shown in Fig. 1 represents the 95% specificity classification rule estimated from a linear combination of the markers; points above the line are classified as cases and those below are classified as controls. Lines with more (or less) specificity can be represented by lines parallel to that given but higher (or lower) than that shown.
|
Longitudinal marker behavior. The performance of markers with temporal stability can be improved by accounting for marker history in a longitudinal screening program (8, 33), and several longitudinal algorithms have been proposed (9, 11, 12). In particular, the parametric empirical Bayes algorithm, intended for application of novel markers, makes use of the simple Pearson correlation of a marker measured at two different time points to generate a screening rule (11). Because the CM summarizes both markers into a single numerical score, the parametric empirical Bayes rule can also be applied to the CM.
Figure 2 plots the values of standardized CA 125, hK11, and the CM for the participants who provided repeat measurements. Both CA 125 and hK11 show high temporal stability, with correlations equaling 0.72 and 0.85, respectively. The high temporal stability of CA 125 has been well established (8), but the temporal stability of hK11 is reported here for the first time. The heterogeneity of hK11and CA 125 imply that the baseline levels of these markers can be highly informative and, if used for early detection in a longitudinal algorithm, that controls for baseline could achieve levels of performance that exceed the performance of the cross-sectional studies presented here. For example, McIntosh et al. (11, 12) show that, when used in a longitudinal study, a correlation of 0.85 for hK11 could mean detecting elevations that are 38% smaller than seen in this cross-sectional study while maintaining the same specificity. One cannot conclude how much earlier cancer would be detected using longitudinal samples without preclinical samples from women who eventually went on to develop cancer; this calculation only establishes the possibility of further gain by controlling baseline, whereas small correlations may rule out such gains.
|
| Discussion |
|---|
|
|
|---|
Here, we have independently validated hK11 as a serologic diagnostic biomarker for ovarian cancer. Our results are independent because we evaluated the concentrations of the marker in cases and in controls not previously characterized by hK11. In addition, we evaluated the temporal stability of hK11 and have shown (because the marker is highly heterogeneous in the population) that the marker could do even better if used in a longitudinal algorithm for early disease detection.
We also combined CA 125 and hK11 to evaluate the potential increase in diagnostic efficacy of the resulting CM. In the future, a multiparametric marker panel may be developed by combining other kallikreins that have shown diagnostic value for ovarian cancer (as mentioned above) as well as other newly discovered biomarkers. Future investigation of the combination of CA 125 and hK11 needs to be confirmed in studies specifically powered to evaluate such combinations (36).
The high correlation of hK11, comparable with that of CA 125 (Fig. 2), implies that its AUC and sensitivity reported in Table 3 could be improved if longitudinal algorithms are used to monitor for abnormal marker levels. The potential improvement in the detectable limit of a biomarker when using the parametric empirical Bayes rule can be quantified by computing [square root (1 – correlation)], which gives the relative size of a revised reference range for markers when controlling for screening history. Estimating this for each of the three markers while controlling for screening history would allow us to detect deviations of approximately half the magnitude with the same specificity and sensitivity. In particular for CA 125, hK11, and the CM, we could detect deviations of 44%, 47%, or 59% the size of those detected without controlling for history, respectively.
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Received 9/ 6/06; revised 3/ 5/07; accepted 5/30/07.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. Kuk, V. Kulasingam, C. G. Gunawardana, C. R. Smith, I. Batruch, and E. P. Diamandis Mining the Ovarian Cancer Ascites Proteome for Potential Ovarian Cancer Biomarkers Mol. Cell. Proteomics, April 1, 2009; 8(4): 661 - 669. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |