
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Clinical Trials |
-Difluoromethylornithine
University of Kansas Medical Center, Kansas City, Kansas 66160 [C. J. F., B. F. K., D. A. B., M. S. M., C. H. J. C., J. A. F.]; Hutchinson Hospital Corporation, Hutchinson, Kansas 67502 [C. M. Z.]; University of Kansas, Lawrence, Kansas 66045 [A. L. S.]; University of Florida Health Science Center, Jacksonville, Florida 32209 [S. M.]; University of Alabama, Birmingham, Alabama 35294 [W. E. G.]; Ontario Cancer Institute, Princess Margaret Hospital, Toronto, M5G 2M9 Canada [N. F. B.]; Midwest Research Institute, Kansas City, Missouri 64110 [D. W. A.]; and National Cancer Institute, Bethesda, Maryland 20892 [K. A. J.]
| ABSTRACT |
|---|
|
|
|---|
-difluoromethylornithine (DFMO) was conducted in a group of women at high risk for development of breast cancer. DFMO is an irreversible inhibitor of ornithine decarboxylase, the limiting enzyme of polyamine synthesis that is often up-regulated in breast cancer. Experimental Design: Study entrants were required to have random periareolar fine-needle aspiration cytology prior to entry that exhibited hyperplasia or hyperplasia with atypia, as well as a mammogram and clinical breast exam judged as not suspicious for breast cancer and no clinical hearing loss. Subjects were randomized to 6 months of oral DFMO (0.5 g/m2/day) or placebo, followed by repeat fine-needle aspiration and biomarker assessment. The main study end point was an improvement in cytologic pattern.
Results: Of 119 subjects entered, 96% completed the study and were evaluable for the main study end point. A modest reduction (28%) in average total urine polyamines was obtained in the DFMO group, but there was no reduction in the spermidine:spermine ratio. There was no difference in cytologic improvement between DFMO and placebo. Likewise, there was no difference between DFMO and placebo for the secondary end points of breast molecular marker changes (immunocytochemical expression of proliferating cell nuclear antigen, p53, and epidermal growth factor receptor), mammographic breast density, serum insulin-like growth factor I: insulin-like growth factor binding protein 3 ratio, adverse events, quality of life indices, or subsequent cancer development.
Conclusions: DFMO at a dose level of 0.5 g/m2/day administered for 6 months does not modulate breast risk biomarkers tested in this study.
| INTRODUCTION |
|---|
|
|
|---|
DFMO is an irreversible inhibitor of ODC, the rate-limiting enzyme of polyamine synthesis (2) . It is attractive as a potential chemopreventive agent because it has the potential to disrupt biological processes important for both ER-negative and ER-positive tumor formation (3, 4, 5) , reduces the incidence of carcinogen-induced and spontaneous mammary cancers in animal models (6, 7, 8, 9) , and has minimal side effects at low doses (10, 11, 12, 13, 14, 15) .
Polyamines (putrescine, spermidine, and spermine) enhance proliferation and other changes associated with cell growth and transformation by increasing transcriptional activation at gene promotors including those for proteases, cell adhesion, and cytoskeletal proteins via hormone-dependent and -independent transcription factors (16, 17, 18, 19, 20, 21)
. This is accomplished through facilitation of coactivator binding to key receptors such as ER-
and nuclear factor
B and their corresponding response elements in DNA (5
, 21, 22, 23, 24, 25)
.
After exposure to estradiol or peptide growth factors such as IGF-I or epidermal growth factor, intracellular polyamines are increased via activation of ODC as well as enhanced intracellular polyamine transport from extracellular sources (26, 27, 28, 29, 30, 31) . Active transport of polyamines is tightly controlled in normal epithelial cells, but negative feedback signals are blunted in transformed and malignant tissues, resulting in increased intracellular transport despite elevated intracellular polyamine levels (6 , 32 , 33) . Increased ODC expression and elevated polyamine levels are observed in breast cancer relative to normal tissue, with ODC expression detected by immunohistochemistry in 1090% of malignant epithelial cells (34, 35, 36) . ODC expression and polyamine levels appear to be highest in poorly differentiated rapidly proliferating tumors (37 , 38) , but no consistent correlations have been reported with the presence or absence of hormone receptors (38, 39, 40) . ODC expression has also been detected in epithelial cells in benign breast disease (41) .
Inhibition of ODC activity by DFMO results in major reductions in putrescine and spermidine but only small reductions in spermine (6)
. Furthermore, spermidine and putrescine can be replenished via spermine pools through the enzyme spermidine/spermine N1-acetyltransferase (see Fig. 1
; Ref. 6
). Cell growth is diminished only with depletion of spermidine levels (6
, 11)
, so that the spermidine:spermine ratio is often used as a marker of DFMO biochemical activity (10)
in chemoprevention trials.
|
3 g/m2/day and occasional in doses of 13 g/m2/day but rare in doses of
0.5 g/m2/day (10
, 11
, 13, 14, 15
, 42, 43, 44, 45)
. Despite minimal side effects, low-dose DFMO (0.5 g/m2/day) is associated with a reduced colonic spermidine:spermine ratio (13)
. Therefore, a dose of 0.5 g/m2 was selected by the NCI Chemoprevention Branch as the optimal daily dose for Phase II prevention trials in breast as well as several other organ sites (46)
. In 1997, we initiated a Phase II prevention trial in women at increased risk for breast cancer. The purpose of the study was to assess the ability of DFMO at a dose of 0.5 g/m2/day for 6 months to modulate several breast cancer risk biomarkers relative to placebo. We selected random periareolar FNA as our method for breast tissue sampling to obtain cells for cytology and proliferation assessments based on our prospective study (47) , which demonstrated a high prevalence of hyperplasia (49%) and hyperplasia with atypia (21%) in women at increased risk for breast cancer undergoing this type of sampling. Furthermore, random FNA evidence of hyperplasia with atypia was strongly correlated with short-term risk of breast cancer independent of risk based on epidemiological factors (47) . Importantly, this sampling method had been demonstrated to have good patient acceptance with 85% of 480 women returning for a second aspiration within 618 months (47) .
Several risk biomarkers were selected as response indicators, including cytologic pattern, proliferation index, mammographic breast density area, and serum IGF-I level and the ratio of IGF-I level to its binding protein IGFBP-3 (48, 49, 50, 51, 52, 53) . Modulation of cytologic pattern was selected as the primary end point. Breast tissue polyamines were not assessed as part of this trial due to concerns that there would be insufficient specimen for assay. Urine polyamines were monitored, however, as a measure of the biochemical activity of DFMO in the individual subject.
| PATIENTS AND METHODS |
|---|
|
|
|---|
1 affected first-degree relative or multiple affected second-degree relatives), prior treated breast cancer or breast biopsy showing atypical hyperplasia or lobular carcinoma in situ, or mammogram exhibiting
50% density relative to the entire breast area. Women exhibiting one or more of these characteristics underwent a baseline periareolar random FNA (see below). Women younger than 30 years could only be aspirated if they were within 10 years of the age of onset of breast cancer in a close relative; women older than 55 years could only participate if they had prior evidence of generalized proliferative breast disease. Women were allowed to be on oral contraceptives or HRT, provided they had not changed their dose of these medications for at least 6 months prior to baseline aspiration.
To enroll in the study, women were required to have a baseline random periareolar FNA (C. J. F.) within 6 months of randomization demonstrating either hyperplasia with atypia or hyperplasia without atypia but with expression (
2+ intensity of
10% of epithelial cells) of p53 and/or EGFR. Women must have had reasonable renal and liver function (creatinine < 1.5 mg/dl, albumin > 3.0 g/dl, bilirubin < 1.5 mg/dl, aspartate aminotransferase < 100 units/liter, alkaline phosphatase < 200 units/liter). Separate consents were signed for the screening FNA and blood sampling and for participation in the chemoprevention trial. Mammography must have been performed within 6 months of randomization, with an interpretation (C. H. J. C.) of not suspicious for breast cancer, i.e., an ACR score of I-III (54)
. Subjects were required to report no clinical hearing loss and to exhibit hearing thresholds on baseline audiometry (J. A. F.) of no greater than 25 dB for frequencies between 0.5 and 2.0 kHz and no greater than 40 dB for frequencies above 2 kHz. Premenopausal women were required to take birth control precautions throughout the study.
Study Design.
A schema of the study design is shown in Fig. 2
. Subjects were stratified by menopause status and presence or absence of atypia in the baseline FNA and were randomized to take 0.5 g/m2 DFMO or placebo p.o. once per day for 6 months. DFMO was formulated as an aqueous solution containing DFMO at 200 mg/ml. Molecular weight of DFMO is 236.65. DFMO or matched placebo was to be taken with fruit juice once daily in the morning with breakfast. Subjects were contacted monthly by the study nurse (D. A. B.) and queried for adverse events using a formatted interview technique. Collection of urine for assessment of polyamines; collection of blood for measurement of DFMO levels as well as complete blood count, renal, and liver function tests; clinical breast exams; QOL surveys; and study medication compliance checks were performed every 2 months (i.e., 0, 2, 4, and 6 months). History and physical exam, mammography, audiometry, QOL surveys, FNA for breast biomarkers, and collection of blood for assessment of serum hormones and growth factors were repeated at the end of the 6-month study period. Optional germ-line genetic testing for BRCA1 and BRCA2 mutations was offered at study exit (55)
.
|
Cytologic Assessment.
Slides for cytology were prepared by filtration and pap staining and classified as nonproliferative, hyperplasia, or hyperplasia with atypia (56)
. Cytology preparations were also given a semiquantitative index score by assigning 14 points to each of six morphological characteristics (cell arrangement, pleomorphism, number of myoepithelial cells, anisonucleosis, nucleoli, and chromatin clumping) according to the method of Masood et al. (57)
. In general, nonproliferative cytology scored between 6 and 10, hyperplasia scored 1114, and hyperplasia with atypia scored 1518. Apocrine metaplasia does not lend itself to this scoring system because some nuclear features of apocrine metaplasia would result in samples scoring in the proliferative range. Because apocrine metaplasia is considered nonproliferative, samples with only apocrine metaplasia were arbitrarily assigned a score of 8, which is in the middle of the nonproliferative range. Morphological category and Masood scores were assigned separately, i.e., Masood scores were not used to determine morphological category assignment. A single cytopathologist (C. M. Z.) assigned cytologic category and semiquantitative Masood scores to all entry and exit specimens within 1 month of aspiration. These values were analyzed for the primary end point. Before unblinding of the study, all slides were comprehensively reviewed by the same cytopathologist (C. M. Z.) without knowledge of treatment assignment or whether the slide was a baseline or 6 month specimen. Slides were likewise reviewed in a blinded fashion by a second cytopathologist (S. M.). Slide sets for six subjects were not available for comprehensive review because either a baseline or repeat aspiration slide had been broken in transit.
Immunocytochemistry.
Slides were prepared for immunocytochemistry by cytospin technique (58)
. Slides for EGFR and PCNA were fixed in 10% neutral buffered formalin; acetone fixation was used for p53. EGFR was detected with F-7 monoclonal antibody (Sigma Pharmaceuticals), PCNA was detected with P10 antibody (Dako), and p53 was detected with Pab240 (Oncogene Science). PCNA was scored as positive only if strong or granular (replicon-associated) nuclear staining was observed because these patterns are better correlated with other proliferation markers as well as disease-free survival in cancer treatment trials (59, 60, 61, 62, 63, 64, 65, 66)
. Immunocytochemistry slides were scored by a weighted immunocytochemistry scoring system (67)
. For immunocytochemistry scoring, technicians selected the most intensely stained area on the slide for assessment, and 100 cells were manually counted. Biesterfeld et al. (66)
have demonstrated that selecting the most intensely stained area for ER, progesterone receptor, MIB-1, and PCNA in breast cancer had a greater prognostic/predictive significance than using a mean value for all viewing fields. Each cell assessed was scored as exhibiting 04+ stain intensity. The weighted score was calculated from the proportion of cells out of 100 exhibiting each stain intensity. Thus, if 20% of cells counted exhibited 0 staining, 50% of cells counted exhibited 1+ staining, and 30% of cells counted exhibited 2+ staining, the weighted intensity score would be 1.1. Two technicians scored each slide independently. In cases of disagreement, scores were averaged. In general, specimens that exhibited
2+ intensity staining of
10% of epithelial cells were considered to show evidence of antigen expression.
Blood Hormones, IGF-I, and IGFBP-3.
Serum for assay of estradiol, estrone, FSH, progesterone, IGF-I, and IGFBP-3 was obtained at the time of baseline aspiration or at study entry. All hormone, IGF-I, and IGFBP-3 specimens were obtained during day 114 of the menstrual cycle for premenopausal women. FSH specimens were obtained between day 3 and 5 of the menstrual cycle for premenopausal women. Serum was transferred to plastic tubes and stored at -80°C until analysis by radioimmunoassay for hormones and by two-site immunoradiometric assay for IGF-I and IGFBP-3 at Midwest Research Institute.
Urine Polyamine Levels.
First morning urine was collected at 0, 2, 4, and 6 months (end of study) and frozen at -80°C until analysis (Dr. Lawrence Demers, Pennsylvania State University Medical Center, Hershey, PA) of polyamine levels by reverse-phase high-performance liquid chromatography with fluorescence detection after derivitization with dansyl chloride (68, 69, 70)
.
Plasma DFMO Levels.
Plasma was collected at 2, 4, and 6 months and frozen at -80°C until analysis by reverse-phase liquid chromatography using UV detection (71)
at Midwest Research Institute.
ACR Scores of Mammograms.
Standard assessment of mammograms was performed by one radiologist (C. H. J. C.), who categorized each mammogram using the 5-point scoring system recommended by the ACR (65)
. Under this system, scores of I and II are assigned to nonsuspicious mammograms, a score of III is assigned to mammograms with suggested short interval follow-up, a score of IV denotes a suspicious abnormality that should be biopsied, and a score of V indicates a lesion highly suggestive of malignancy.
Mammographic Density.
For calibration purposes, all mammograms were performed on a rhodium target unit with a stepwedge in place. Films were digitized on a Lumiscan 85 film digitizer and then assessed (N. F. B.) for the percentage of area of increased density relative to the entire breast area in a blinded fashion as described previously (72)
.
Audiometry.
Standard audiometry evaluation was performed prestudy and poststudy and as clinically indicated after a complaint of hearing loss. Hearing thresholds at both air and bone conduction were determined for 0.5, 1, 2, 3, and 4 kHz; air only was determined at 6 and 8 kHz.
QOL.
Patient-rated QOL was assessed at 0, 2, 4, and 6 months with Health Survey SF-36 (73
, 74)
, which is used frequently in clinical trials and contains two summary scales reflecting QOL related to physical health and mental health (75)
. The two scales are aggregates of factor score coefficients on the eight subscales of Health Survey SF-36 (i.e., physical functioning, role function-physical, bodily pain, social functioning, emotional well-being, role function-emotional, vitality, and general health perceptions) based on data representative of the general population of the United States. They are standardized to have a mean of 50 and a SD of 10 in the general population, with higher scores representing more positive QOL. General population norms are available, and psychometric adequacy has been demonstrated in a variety of medical populations (75)
.
Statistical Methods.
The primary end point of the study was a comparison between placebo and DFMO of improvement in cytology assessed by both cytology category and a semiquantitative cytology index score, using the original readings of the primary cytopathologist (C. M. Z.). We assumed a possible 30% improvement in cytology pattern in the placebo group between baseline and reaspiration 6 months later. Given this assumption, accrual of 100 evaluable subjects would allow us to detect a 60% (doubling) improvement in cytology category for the DFMO group with 80% power and a type I error rate of 0.05. This same number of subjects gives us 80% power to detect a 5-point reduction in mean cytology index in the DFMO group compared with no reduction in the placebo group, assuming a SD of 10 points with a type I error rate of 0.05. We targeted 120 eligible subjects to ensure 100 evaluable subjects for the main study end point, as specified by the NCI contract. Subjects were stratified by cytology (hyperplasia with versus without atypia) and menopause status (premenopausal versus postmenopausal). This resulted in four groups with the indicated number of subjects in each: (a) atypia/premenopausal (36)
; (b) without atypia/premenopausal (33)
; (c) atypia/postmenopausal (24)
; and (d) without atypia/postmenopausal (26)
. Subjects were randomized within each stratum to placebo or DFMO in a 1:1 ratio, with the preparation of the blinded randomization codes performed by an outside statistical consultant.
Quantitative baseline variables were summarized by means, and categorical baseline variables were summarized by frequencies. Intent-to-treat principles were used throughout, and for analysis of primary end points, no change was imputed for cases of missing data. For secondary end points and exploration of possible correlations, only available data were analyzed, and no adjustments were made for multiple comparisons. The primary end point, cytologic improvement from baseline, was summarized by the mean change in index score and by the proportion of subjects who exhibited a cytologic improvement by treatment group. For purposes of analysis, improvement was defined as a change in morphological category (e.g., atypia to hyperplasia or hyperplasia to nonproliferative) or a decrease in the Masood score of 3 or more points. The criterion of 3 points was based on our previous experience that a trained cytopathologist was unlikely to vary by more than 2 points when blindly rereading the same slides. Secondly, it was considered that a decrease of 3 points would typically be accompanied by a change in morphological category and would therefore be clinically relevant. Fishers exact test was used to compare the proportion of subjects who exhibited a cytologic improvement between the two groups. Change in Masood score was summarized by its mean for each treatment group, and this change was compared via a two-sample t test. The proportion of subjects in each group showing improvement in ACR score was compared by Fishers exact test. Changes in other quantitative measures including PCNA, EGFR, and p53 intensity score and breast density were summarized and compared. Side effects were summarized by the percentage of subjects who had a grade 2 or higher adverse event in each treatment group, by specific events. Fishers exact test was used to compare the proportion of grade 2 or higher toxicities between the two treatment groups. Polyamine levels were summarized by means over four time periods, 0, 2, 4, and 6 months. Mixed linear regression was used to compare these levels over time, assuming an autoregressive correlation structure, between the two treatment groups, adjusting for baseline levels. Similar analyses were performed on QOL measures.
| RESULTS |
|---|
|
|
|---|
Baseline Characteristics.
Baseline characteristics in both the placebo and DFMO groups are listed in Table 1
. Mean age for the placebo and DFMO groups was similar at 46 and 47 years, respectively. Six minority, non-Caucasian women (5%) were enrolled in the study, and there was a slightly higher proportion of these women in the DFMO group. Fifty-eight percent of entrants were premenopausal, and 42% were postmenopausal. Sixty-two percent of postmenopausal subjects received hormone replacement during study, thus 84% of entrants had a high estrogen milieu at baseline, as reflected by a mean entry FSH of 17 mIU/ml (ranging from the limit of detection of 1.5 to 96.9 mIU/ml) and a mean estradiol level of 75 pg/ml (ranging from the limit of detection of 5 to 346 pg/ml) for the entire cohort. The mean area of the breast considered to be mammographically dense was 46% (range, 085%). Fifty percent of subjects had hyperplasia with atypia in their baseline FNA. Median 10-year calculated Gail risk was 3.9% and 4.6% for the placebo and DFMO subjects, respectively. There were no statistically significant differences between the placebo and the DFMO groups for any of the baseline variables shown in Table 1
. Furthermore, there was no difference in baseline variables, with the exception of Masood score, between women with atypia and those without atypia. Nor was there a difference, with the exception of serum FSH levels, between premenopausal and postmenopausal women.
|
Self-reported Study Medication Compliance.
With the exception of the five subjects mentioned above, compliance as indicated by self-reported consumption of study medication was excellent, with mean compliance values of 96% and 97% for the placebo and DFMO groups, respectively. Only one subject, randomized to DFMO, reported consuming <75% of study medication and thus was considered to be noncompliant.
Plasma DFMO Levels.
Because of the potential unreliability of self-reported compliance, plasma levels of DFMO were measured at 2, 4, and 6 months (for the DFMO group only) and used as a secondary measure of compliance. Mean plasma DFMO levels were 6.73 µg/ml (0.028 mM) at 2 months, 6.17 µg/ml at 4 months, and 6.53 µg/ml at 6 months. Values ranged from the limit of detection (0.5 µg/ml) to 18.3 µg/ml. For the three time points studied, there were 21 total instances in 16 subjects when no DFMO was detectable, including 8 subjects with no detectable DFMO at 6 months. However, five of these eight subjects had at least a 25% reduction in polyamines at 6 months. Thus, at most 3 subjects (of 60 subjects in the DFMO group that completed the study) could be considered as noncompliant on the basis of both DFMO level (below limit of detection at 6 months) and total polyamine levels.
Urine Polyamines.
Overall, means for total urine polyamines were lower in the DFMO group than the placebo group when adjusting for baseline level and time (P
0.0001). A 28% reduction in the average total polyamine level in the DFMO group versus an 8% reduction in the placebo group was observed by 2 months on study (Fig. 3)
. However, decreases were observed only in putrescine (29%) and spermidine (24%; P < 0.0001). No significant differences were noted between DFMO and placebo for spermine (P = 0.81) or spermidine:spermine ratio (P = 0.052) when adjusting for time and baseline values. There were no correlations between changes in urine polyamines and changes in biomarkers.
|
|
Change in Breast Proliferation and Other Molecular Markers.
There was no difference between placebo and DFMO groups in change in PCNA weighted intensity score between the prestudy and poststudy FNAs. The mean change in PCNA weighted intensity score was -0.12 (SD = 0.65) for the placebo group and 0.14 (SD = 0.70) for the DFMO group (Table 2)
. However, there was a significant difference (P = 0.017) between the placebo and DFMO groups in the change in the percentage of cells judged to have PCNA expression. The mean change in the percentage of cells expressing PCNA was -10% (SD = 28%) for the placebo group and 7% (SD = 38%) for the DFMO group, favoring the placebo group. Change in the percentage of cells expressing PCNA (but not weighted index score) was correlated with a change in mammographic breast density. No significant differences between placebo and DFMO groups were noted for change in EGFR or p53 expression as assessed by weighted intensity score (Table 2)
.
Mammographic Change.
With a mean baseline mammographic breast density area of 46.1% and 45.6% for placebo and DFMO groups, respectively, the mean change between the prestudy and poststudy mammograms in the area considered to be dense was -2.0% and -2.7% for placebo and DFMO groups, respectively, and was not different (Table 2)
. There was a significant difference in the percentage of subjects who showed improvement in categorical ACR scores, favoring the DFMO group (P = 0.017), using analysis of either all subjects, imputing no change for missing data [2 of 57 (4%) for placebo versus 11 of 62 (18%) for DFMO], or only those for whom prestudy and poststudy mammograms were available [2 of 54 (4%) for placebo versus 11 of 59 (19%) for DFMO]. No correlations were observed between change in individual polyamine levels and/or spermidine:spermine ratio and the change in breast density or change in ACR categorical scores.
Serum IGF-I/IGFBP-3.
For serum levels of IGF-I, IGFBP-3, or ratio of IGF-I to IGFBP-3, there was no difference between placebo and DFMO groups for prestudy values, poststudy values (Fig. 4)
, or change between the prestudy and poststudy values (Table 2)
. There was no correlation between individual changes in serum IGF-I or IGFBP-3 levels or IGF-I:IGFBP-3 ratios and changes in serum estradiol or urinary polyamine levels. Change in serum IGFBP-3 was negatively correlated with change in breast tissue PCNA (r = -0.23; P = 0.033).
|
|
Grade 2 or greater GI effects, reported tinnitus, or measured hearing loss were the same for placebo and DFMO groups (Tables 3
and 4
). GI toxicity (nausea, dyspepsia, flatulence, and diarrhea) occurred in 32% of subjects randomized to placebo and 34% of subjects randomized to DFMO. Similarly, 25% of subjects randomized to placebo reported tinnitus or had a
10-dB hearing loss, as did 27% of subjects randomized to DFMO. Study agent was discontinued when grade 23 tinnitus occurred. This generally resolved in 23 days, and study agent was reinstituted. Only four subjects (6.5%) in the DFMO group and two subjects (3.5%) in the placebo group had a
15-dB shift in the poststudy bone threshold audiogram (Table 4)
. The 10- and 15-dB shifts in threshold are criteria proposed by Shotland et al. (77)
for defining audiotoxicity in the context of a chemoprevention trial. There was no association between auditory toxicity and prestudy urinary polyamine levels, poststudy polyamine levels, change in polyamine levels, or plasma DFMO levels.
|
|
Mixed-model analyses of variance, controlling for baseline values on the dependent variables, were conducted on the two Health Survey SF-36 scales, with study arm and menopausal status as categorical independent variables and assessment point (time) as a within-subjects variable. No significant main effects emerged for study arm. A significant arm x time interaction emerged on the physical health scale (P = 0.018). Follow-up analyses revealed that women receiving DFMO reported lower QOL related to physical health than did women receiving the placebo, but only at the 4-month assessment (P = 0.028). Analysis of the mental health scale yielded a significant arm x menopausal status interaction (P = 0.0078). Mental health-related QOL was lower in the DFMO arm than in the placebo arm, but only for the postmenopausal subjects.
Cancer Development.
Including the one cancer that was detected while the subject was on study, four breast cancers have subsequently been detected in the DFMO group, and five breast cancers have subsequently been detected in the placebo group. Five subjects developing breast cancer had atypia (Masood scores of 16, 17, 15, 15, and 16) in their baseline FNA, and four had hyperplasia without atypia (Masood scores of 11, 10, 11, and 11) in their baseline FNA. Median follow-up time since randomization is 39 months. Subjects with baseline atypia developed cancer within 616 months of aspiration, and those without atypia developed cancer within 2350 months of aspiration (Fig. 6)
. By Cox regression analysis, there was no significant association between subsequent cancer detection and baseline cytology category, polyamine levels, Masood score index, or 10-year Gail risk; however, too few cancers have developed for a meaningful analysis.
|
| DISCUSSION |
|---|
|
|
|---|
The most probable reason for lack of a modulation of cytology and proliferation index was that a dose of DFMO sufficient to cause reduction in the spermidine:spermine ratio was not delivered. We did not measure breast DFMO levels, but the mean plasma DFMO level of 0.028 mM was below the 0.1 mM level consistently associated with reduced spermidine:spermine ratios and suppression of proliferation (3 , 78) . Furthermore, it is below the 1 mM level associated with reduction of spermine and enhanced apoptosis in in vitro studies (3 , 79) . Improvement in cytomorphology in women with benign breast disease is likely to require both a reduction in proliferation and an increase in apoptosis (80 , 81) . Exogenous polyamine availability from diet and colonic bacterial production, as well as high estrogen levels, may also counteract the ability of DFMO to suppress proliferation at low to moderate dose levels (82, 83, 84, 85) . Dietary polyamine restriction and antibiotics are often used to enhance the effects of DFMO in animal studies (83, 84, 85) , but we did not use such strategies. Furthermore, 84% of our subjects were either premenopausal or taking HRT.
Even today, the minimal dose level of DFMO required for effect in breast tissue is unknown. The 0.5 g/m2/day dose of DFMO used was that selected by the NCI Chemoprevention Branch based on its excellent safety profile after 612 months of administration (10 , 13 , 15) and its ability to reduce colonic putrescine and spermidine:spermine ratios (10 , 13) . Moreover, the minimal dose for biological effect may be tissue and disease specific (11) . Subsequent to initiation of our study, Carbone et al. (86) , using oral DFMO at an identical dose, reported no effects on skin spermidine levels, although putrescine was reduced. Likewise, Brenner et al. (87) found no reduction in esophageal polyamines or the proportion of cells expressing Ki-67/MIB-1 in a 6-month study of 0.5 g/m2/day DFMO versus placebo in subjects with Barretts esophagus. Mitchell et al. (88) found significant decreases in the cervical spermidine:spermine ratio after 1 month of DFMO at a dose of 1 g/m2/day but found no significant decreases at the 0.5 g/m2/day level. Because the current study was conducted under a Phase II NCI contract, no preliminary study was performed to document the effects of 0.5 g/m2/day DFMO on breast tissue polyamines or other biomarkers before initiation.
Likewise, the upper nontoxic dose limit for chronic DFMO administration is not firmly established. We did not observe adverse auditory, GI, or other events more frequently for subjects randomized to DFMO than for those randomized to placebo, similar to results reported by Meyskens et al. (10) using the same dose of 0.5 g/m2/day. Given the excellent tolerability of chronic DFMO in doses of 0.61.0 g/m2/day (15) , doses in this range could be explored for effects on breast polyamines as well as risk biomarkers. DFMO has a short half-life of 3.5 h, and twice daily dosing would be another option for more consistent polyamine suppression (89) .
Although inadequate dose is the most likely explanation for the lack of observed effect of DFMO on our main response indicators, another possible reason is that a portion of the study cohort may not have exhibited elevated ODC activity in breast tissue. A precedent is the situation in breast cancer treatment where antihormonal agents generally do not produce responses in the absence of ER-
expression (90)
. Subjects selected on the basis of elevated ODC expression in their hyperplastic epithelium might have been more responsive to DFMO, even at low doses. However, a specific antibody for detection of ODC expression by immunohistochemistry (36)
was not generally available when our study was initiated in 1997, and other means of assessment of ODC activity or polyamine levels were not feasible with the small amounts of tissue available from FNA.
Although the biomarkers we selected as response indicators had previously been shown to be associated with short-term or longer-term risk, other biological processes affected by DFMO and potentially important for breast neoplastic promotion and progression were not specifically assessed. ODC activity is associated with tyrosine phosphorylation of key signaling proteins in the mitogen-activated protein kinase pathway and release of matrix metalloproteinases (30 , 91) . Measurement of phosphorylated activated mitogen-activated protein kinase (pERK1/2) and metalloproteinases such as matrix metalloproteinase 2 might be a more sensitive indicator of the biological activity of low-dose DFMO than proliferation markers (36 , 88 , 91 , 92) .
Finally, given the modest sample size used in this study, physiological, sampling, and assessment variability may have contributed to the negative outcome. These are common problems in trials in which biomarkers are used as the primary outcome measures (67 , 93) and attest to the need to include a blinded placebo control in such studies. What measures could be taken to reduce these sources of variability?
Repeated sampling of the same area of the breast with core biopsies directed at a mammographically dense area and ductal lavage are two possibilities to reduce sampling variability. Historically, prevention trials with repeated core biopsies have not accrued well in the United States (94) , although a small successful trial using core biopsies in high-risk women has recently been reported from the United Kingdom (95) . There is little information on whether sampling or interpretive discordance would be less of a problem using this technique. Ductal lavage offers the theoretical advantage of repeated access to the same site with a less invasive procedure than core biopsy and has been reported as well tolerated (96) . However, ductal lavage is generally performed only for those women producing nipple aspirate fluid, which might eliminate 1540% of potential subjects (96 , 97) . There is little information on the success rate of repeated cannulation of the same duct after an interval 6 months or more. Given that occult hyperplasia with or without atypia is generally multifocal and multicentric (98 , 99) , it is possible that modulation of a field effect as assessed by random tissue sampling may have as much or more implication for risk reduction as elimination of a specific focus of atypia from a single duct.
It is difficult to determine how much of the observed variability in cytologic improvement was due to sampling versus interpretive discordance in our study. Interpretive variability is substantial when assessing either cytologic and histological preparations of breast tissue specimens (76 , 100) . Despite interpretive variability, hyperplasia ± atypia is an established risk factor for breast cancer (48) , and tamoxifen, an approved agent for breast cancer risk reduction (1) , was found to reduce the incidence of hyperplasia ± atypia in the National Surgical Adjuvant Breast and Bowel Project 1 trial (101) . Use of Consensus Panel Criteria (benign, intermediate/atypia, and suspicious) developed to characterize diagnostic FNA biopsies has been suggested as a method of reducing interpretive variability and has the advantage of consolidating borderline hyperplasia/atypical hyperplasia into one category (102) . Use of Consensus Panel Criteria has the disadvantage that both nonproliferative and most hyperplasia specimens without atypia are lumped under the benign grouping, resulting in only two groups and little room for change in response to an effective agent. Nuclear morphometry (103) and semiquantitative index scoring (57) such as that used in this study are also potential measures to reduce morphological interpretive variability, but none of these assessment techniques have been fully evaluated in prospective trials as risk biomarkers. Of interest, however, is that there was little variation in the cytology Masood index score in the placebo group (e.g., mean of -0.46 for the pre-poststudy change). If the Masood index score were to be assessed in a prospective trial as predictive of later cancer development/detection in a high-risk population, then change in cytology index score might be a better method for assessment of morphological change than improvement in cytologic category. Alternatively, improvement in cytologic category could continue to be used as a response end point if the number of evaluable subjects/arm was increased to offset the observed variability in the placebo group.
We observed substantial variation in the change in the proportion of epithelial cells expressing the proliferation marker PCNA in the placebo group. Use of Ki-67/MIB-1 has been associated with less interpretive variability than PCNA by some investigators (59 , 62 , 66) , and expression of Ki-67/MIB-1 within hyperplastic foci has been reported as correlating with breast cancer development in a case-control study (49) . Furthermore, early reduction in Ki-67/MIB-1 is associated with later clinical response to tamoxifen and other antihormonal agents in breast cancer treatment trials (104, 105, 106, 107) . However, the mean proportion of cells staining positive for MIB-1 is only 35% in hyperplasia (49 , 108) , whereas PCNA staining is much more diffuse, with a higher proportion of cells staining positive for benign breast disease (109) . Because we were concerned that the number of epithelial cells in the repeat FNA samples might be too low (e.g., <500) for adequate Ki-67/MIB-1 if DFMO was effective in reducing proliferation and/or normalizing morphological pattern, we chose PCNA for this study. However, for the majority of subjects, adequate numbers of cells would have been available for Ki-67/MIB-1 assessment at both baseline and completion of treatment.
Mammographic density is a reflection of the proportion of stroma and epithelial cells relative to fat (110 , 111) and varies with the proliferative activity of ductal epithelium and stroma (112) . We found little variation in area of density using a computer-based software system (113) for assessing the proportional area considered to be dense when imaging for premenopausal women was performed in the follicular phase of the menstrual cycle, and subjects were imaged on the same equipment at baseline and completion of study (114 , 115) . Given the importance of high breast density (>60%) as a risk factor (51 , 53) , low variation of mean area of density, and reduction of density with tamoxifen relative to placebo in small preliminary studies (116 , 117) , breast density deserves further study as a response end point in Phase II trials.
There was also little variation in serum IGF-I:IGFBP-3 ratios in the placebo group over the course of the study. High serum levels of IGF-I (e.g., >200 mM) and/or IGF-I:IGFBP-3 ratios in the upper tertiles of the normal range have been correlated with breast risk in premenopausal women (50) . IGF-I is a powerful breast epithelial cell mitogen, and in premenopausal women, serum IGF-I:IGFBP-3 may be a surrogate for tissue IGF-I bioactivity (118) . Although tamoxifen and several other potential preventive agents such as fenretinide reduce serum IGF-I:IGFBP-3 (119) , reduction in cancer incidence is not necessarily correlated with drug-induced reductions in IGF-I:IGFBP-3 (120) .
Despite the fact that DFMO at a dose of 0.5 g/m2/day did not modulate the risk biomarkers selected, the clinical model performed well in terms of (a) identification of subjects at high short-term risk for breast cancer, (b) subject accrual and retention, and (c) ability to sample tissue for biomarker assessment at both baseline and study completion. Selection of study subjects on the basis of epidemiological risk factors and random periareolar FNA evidence of hyperplasia ± atypia produced a cohort at very high short-term risk as evidenced by in situ or invasive breast cancer in 9 of the 119 subjects at a median follow-up time of 42 months from baseline aspiration (39 months from randomization). This is consistent with findings in our previously reported study, in which 17 of 337 high-risk women with random FNA evidence of hyperplasia with or without atypia had developed in situ or invasive breast cancer at a median follow-up time of 45 months from initial aspiration. Twenty-five percent of subjects screened by random periareolar FNA ultimately entered the study, with 95% of subjects completing the second FNA procedure, attesting to the excellent subject tolerance of repeated tissue sampling in a relatively short time frame using this technique. Accrual was accomplished in a single institution on schedule and is, to our knowledge, the first randomized placebo-controlled Phase II breast cancer chemoprevention study to be completed in the United States using breast tissue biomarkers as an end point. Finally, random periareolar FNA was able to produce sufficient material for cytology and three additional immunocytochemical tests at baseline and follow-up in the majority of subjects.
In conclusion, DFMO at a dose of 0.5 g/m2/day for 6 months is not associated with significant modulation of cytologic pattern, proliferation, mammographic density, or IGF-I relative to placebo. However, we believe that the random periareolar FNA approach to breast tissue acquisition provides a feasible means of obtaining tissue for initial assessment of potential chemopreventive efficacy in women at high risk of developing breast cancer. Modification of cytology index scores, proliferation indices, and mammographic breast density area would appear to be the most useful and clinically relevant markers to chose as response markers and main study end points. However, a final validation of these end points will only be obtained when a trial is performed with an agent that is in fact effective and a statistically significant improvement in these variables is obtained that is correlated with reduction in cancer incidence.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
1 Supported by contract NO1-CN-65124 from the Chemoprevention Branch, Cancer Prevention Research Program-Cancer Control, Division of Cancer Prevention of the National Cancer Institute, NIH. ![]()
2 To whom requests for reprints should be addressed, at University of Kansas Medical Center, 3901 Rainbow Boulevard, Kansas City, KS 66160. Phone: (913) 588-7791; Fax: (913) 588-3679; E-mail: cfabian{at}kumc.edu ![]()
3 The abbreviations used are: ER, estrogen receptor; DFMO,
-difluoromethylornithine; FNA, fine-needle aspiration; ODC, ornithine decarboxylase; PCNA, proliferating cell nuclear antigen; EGFR, epidermal growth factor receptor; IGF, insulin-like growth factor; IGFBP-3, IGF binding protein 3; ACR, American College of Radiology; HRT, hormone replacement therapy; FSH, follicle-stimulating hormone; QOL, quality of life; NCI, National Cancer Institute; GI, gastrointestinal; dB, decibel. ![]()
Received 9/24/01; revised 5/31/02; accepted 6/ 5/02.
| REFERENCES |
|---|
|
|
|---|
-difluoromethylornithine: a chemopreventive agent. Prev. Med., 18: 646-652, 1989.[CrossRef][Medline]
-difluoromethylornithine. Anticancer Res., 10: 67-72, 1990.[Medline]
-difluoromethylornithine on rectal mucosal levels of polyamines in a randomized, double-blinded trial for colon cancer prevention. J. Natl. Cancer Inst. (Bethesda), 90: 1212-1218, 1998.
-difluoromethylornithine in individuals at risk for colorectal cancer. Cancer Epidemiol. Biomark. Prev., 7: 989-992, 1998.[Abstract]
-difluoromethylornithine as a potential chemopreventive agent: tolerance to daily oral administration in humans. Cancer Epidemiol. Biomark. Prev., 2: 243-247, 1993.[Abstract]
B functions by polyamines and their role in polyamine analog-induced apoptosis of breast cancer cells. Oncogene, 20: 1715-1729, 2001.[CrossRef][Medline]
-difluoromethyl-ornithine on the expression and function of the epidermal growth factor receptor in human breast epithelial cells in culture. Breast Cancer Res. Treat., 68: 139-146, 2001.[CrossRef][Medline]
-difluoromethylornithine: an inhibitor of polyamine biosynthesis. J. Clin. Oncol., 2: 124-130, 1984.[Abstract]
-difluoromethylornithine, an inhibitor of polyamine synthesis, in advanced small cell lung cancer and colon cancer. Cancer Treat. Rep., 70: 843-845, 1986.[Medline]
-difluoromethylornithine (DFMO) for the treatment of metastatic melanoma. Investig. New Drugs, 4: 257-262, 1986.[Medline]