
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Cancer Therapy: Clinical |
Author's Affiliation: Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, Massachusetts
Requests for reprints: Haesook T. Kim, Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, 44 Binney Street, M218 Boston, MA 02115. Phone: 617-638-6547; Fax: 617-632-2444; E-mail: kim.haesook{at}jimmy.harvard.edu.
| Abstract |
|---|
|
|
|---|
In addition to estimating the cumulative incidence of an event, it is often of interest to determine whether there is a difference in the cumulative incidence rates among different groups. In standard survival analysis, this is done using the log-rank test to compare curves generated with the KM method (2). In the presence of competing risks, however, this is inappropriate, for the same reason given above. Instead, Gray investigated this issue and proposed a class of tests for comparing the cumulative incidence curves of a particular type of failure among different groups in the presence of competing risks (3).
Finally, when there is a difference in the cumulative incidence curves among different treatment groups, it is also important to determine whether this difference is solely due to the treatment or to the confounding factors, such as age or baseline disease stage. In standard survival analyses, this question is usually addressed by fitting a Cox proportional hazards model (4, 5). In fact, one may attempt to construct a cause-specific standard Cox model for a particular failure treating other competing risks censored. However, the effect of a covariate on an event from either a cause-specific (e.g., relapse) model or cause nonspecific (e.g., relapse and TRM combined) model may be very different from the effect of the covariate of the event (e.g., relapse) in the presence of competing risks (e.g., TRM). Fine and Gray (6) and Klein and Andersen (7) proposed a method for direct regression modeling of the effect of covariates on the cumulative incidence function for competing risks data. As in any other regression analysis, modeling cumulative incidence functions for competing risks can be used to identify potential prognostic factors for a particular failure in the presence of competing risks, or to assess a prognostic factor of interest after adjusting for other potential risk factors in the model.
In this article, we will present the three aforementioned methods in the analysis of competing risks data in more detail. We will compare each method to its counterpart in standard survival analysis, and show why the latter are inadequate in the presence of competing risks. Using hypothetical numeric example and real data, we will demonstrate the use of these three methods, compare the results to the results obtained from standard survival analysis, and discuss the source and magnitude of bias that arises from standard methods.
| Estimating Cumulative Incidence in the Presence of Competing Risks |
|---|
|
|
|---|
Suppose that 10 patients enter a clinical trial at different time points (i.e., staggered entry), are followed until a specified time, and have independent and distinct failure times. Suppose that the event of interest is relapse. Patients who fail from other causes or who are still alive at the specified time point are censored. Patients who are lost to follow up during the study period are considered alive and relapse-free at the last time seen alive and thus censored. In order to calculate the KM estimate of cumulative relapse-free survival probability, the observed failure times for all patients need to be ordered from the smallest to the largest, irrespective of the censoring status, and each failure time is paired with the information of censoring status. For example, let t0 denote the time zero or the time of study entry, t1 denote the smallest observed failure time, and t10 denote the largest observed failure time to the event. Then the KM estimate of relapse-free survival probability, SKMrel(t), is
|
|
The KM estimate of incidence of relapse at a specified time point is then the probability of relapse-free survival just prior to that time, multiplied by the number of relapses at that time, divided by the number of patients at risk (that is, alive, relapse-free, and not lost to follow-up) just prior to that time. Cumulative incidence is then a sum of these conditional probabilities over time. More specifically, the cumulative incidence using the KM method, denoted as CIKMrel, is calculated as follows:
|
|
The KM estimate of cumulative incidence function is simple and useful for a single end point such as relapse. However, if deaths due to other causes exclusive of relapse (i.e., TRM) are also of interest, the KM method does not capture the dependency of competing risks. For example, in allogeneic HSCT for patients with hematologic malignancies, both relapse and TRM are of equal importance to patients as well as physicians. As previously mentioned, efforts to modify the relapse rate through immune effector mechanisms may adversely affect TRM rates (vice versa is also true), and therefore, relapse and TRM are not independent events. If the KM method is used to estimate the cumulative incidence of relapse in the presence of TRM, patients dying of TRM are censored. However, unlike truly censored observations, patients who die of TRM cannot then relapse, and hence, their risk for relapse is 0. Therefore, the survival probability is overestimated and thus cumulative incidence of relapse is also overestimated (as shown in the example below) in the KM method.
| Competing Risks Method |
|---|
|
|
|---|
|
|
Example 1: a hypothetical numeric example. We shall show the computation of cumulative incidence function of relapse below using the KM and CR methods. Suppose that there are 10 patients with the ordered failed or censored times shown below
![]() |
|
|
|
| Comparison of Cumulative Incidence Curves in the Presence of Competing Risks |
|---|
|
|
|---|
2 distribution. Example 2b. Using the same data set presented in example 2a, Fig. 3 [Fig. 1 in Alyea et al. (8)] shows cumulative incidences of relapse and TRM using the CR method after myeloablative and nonmyeloablative transplantation. The 3-year cumulative incidence of TRM (CIT) was 50% after myeloablative transplantation, but was 32% after nonmyeloablative transplantation (P = 0.01). The CIT was calculated in the presence of relapse as a competing risk and this difference was tested using the Gray method. Similarly, the 3-year CIR was 30% after myeloablative transplantation and 46% after nonmyeloablative transplantation (P = 0.052). In comparison with the CR method, the 3-year CIR using the KM method was 50% after myeloablative and 61% after nonmyeloablative transplantation (P = 0.35). These results suggest that even though the cumulative incidence rates of combined events (relapse and TRM) are similar between two types of transplantation (80% versus 78%), the immunologic effects are different.
|
| CR Regression Analysis |
|---|
|
|
|---|
Because the simple relationship between a single end point and a single crude hazard does not hold in the presence of competing risks, Fine and Gray (6) and Klein and Andersen (7) proposed a direct regression modeling of the effect of covariates on the cumulative incidence function for CR data. These models distinguish between patients who are still alive and those who have already failed from competing causes and allow direct inference regarding the effects of covariates on the cumulative incidence function. The Fine and Gray method is based on proportional hazards model, whereas the Klein and Andersen method is based on the pseudovalues from a jackknife statistic from the cumulative incidence curve. When the two methods were compared in a real data example, results from both approaches were in close agreement (7).
Example 2c. Returning to our example of myeloablative versus nonmyeloablative allogeneic HSCT for patients >50 years of age, the cumulative incidence curves in Fig. 3 indicate that myeloablative transplantation is associated with an increased risk for TRM (P = 0.01) and nonmyeloablative transplantation is associated with an increased risk for relapse (P = 0.052). To investigate these differences further, we fitted regression models using the standard Cox and the Fine and Gray approach for relapse and TRM. In the Cox model, relapse and TRM are considered jointly in the outcome; in the Fine and Gray model, they are considered individually. Both models used the type of transplantation (myeloablative versus nonmyeloablative) as a covariate, along with patient age, unrelated versus related donor, bone marrow versus peripheral blood progenitor cells, unfavorable versus favorable prognosis, second versus first transplant, FK506 versus cyclosporine-based acute graft-versus-host disease (GVHD) prophylaxis, and donor-recipient sex mismatch. The results are shown in Table 2 . In both models, the type of transplantation was not a significant factor for outcome after adjusting for pretransplant characteristics. This can be explained by confounding. Of the 40 patients with myeloablative transplantation who died of TRM, 35 received bone marrow stem cells. Of five patients with nonmyeloablative transplantation, four received bone marrow stem cells and died of TRM. Therefore, the difference in the cumulative incidence of TRM (Fig. 3) was confounded in part by the source of progenitor stem cells (i.e., bone marrow versus peripheral blood). This is apparent in the CR regression model for TRM in Table 2. In that model, the hazard ratio (HR) for TRM of bone marrow use was 2.24 (P = 0.057). In contrast, in the Cox model for relapse and TRM combined, the HR for bone marrow as compared with peripheral blood stem cell use was 1.13 (P = 0.71).
|
CR regression analysis is also useful to identify other prognostic factors, other than type of transplantation, for each type of failure. In Table 2, donor-recipient sex mismatched were associated with a decreased risk for relapse (ß = 0.66, HR = 0.52, P = 0.04). Although this is indicated in the Cox model of relapse and TRM combined events in Table 2 (ß = 0.39, HR = 0.68, P = 0.053), it is unknown from this model whether this is due to relapse or TRM prevention. In fact, the hazard of relapse and TRM for sex-mismatched patients compared with sex-matched patients point to the opposite direction: ß = 0.069 with HR = 0.52 for relapse versus ß = 0.179 with HR = 1.20 for TRM, even though the latter is not statistically significant. Again, the Cox model is not appropriate for identifying risk factors for cumulative incidence of a specific event in the presence of competing risks.
| Discussion |
|---|
|
|
|---|
When competing risks are present, there are three ways to analyze the data: (a) analysis of the event of interest ignoring CR, (b) analysis of joint events as a single end point, and (c) analysis of CR. The first approach is incorrect and will lead to erroneous results as shown in the previous sections. The magnitude of the error could be substantial if the incidence of CR is high, or could be minimal if the incidence of CR is low. However, one could not know the effect of competing risks a priori unless CR analysis is done. The second approach is correct, but is too limited to address various important research questions. The combination of the second and third approaches is a comprehensive approach that addresses general as well as specific study questions. In the example of the myeloablative versus nonmyeloablative HSCT study, when the events of relapse and TRM are combined and analyzed as a single event as progression, there is no difference between the two types of transplantation [see Fig. 4 in Alyea et al. (8)]. This is because this type of analysis is limited to answer to only one questionwhether the progression-free survival of nonmyeloablative HSCT is better than myeloablative HSCT. Although this approach is informative, this analysis is not sufficient to answer whether the CIR or CIT between myeloablative and nonmyeloablative transplantations are different. The CR analysis of relapse and TRM is an appropriate method to answer these more advanced questions.
In the analysis of CR data, it is important to present both the results of the event of interest and the results of competing risks. In the example of the myeloablative versus nonmyeloablative HSCT study, presentation of CIR without CIT would be misleading because even though the CIR is higher in the nonmyeloablative transplantation compared with the myeloablative transplantation, the CIT is lower. Similarly, in a study of umbilical cord blood transplantation, comparison of cumulative incidence of acute GVHD between umbilical cord blood and other types of transplantation would be erroneous if the early infection rates are not considered simultaneously.
Fitting a CR regression model is also important. Just as in the standard survival analysis, analysis of competing risks is incomplete without CR regression analysis. CR regression analysis is used to identify risk factors for each competing risk. In example 2c, donor-recipient sex mismatch was associated with a decreased risk of relapse (HR = 0.52, P = 0.04), but not for TRM. Also, the effects of bone marrow stem cells on relapse (ß = 0.78, HR = 0.46) and on TRM (ß = 0.81, HR = 2.24) were contrasting, as shown in the coefficient estimate, ß, and HR (even though these effects are not statistically significant) and this opposite effect of a covariate would not be detected by fitting a standard Cox model. This is because the standard Cox model is not designed to answer what risk factors contribute to relapse in the presence of the competing risk of TRM. Using a cause-specific Cox regression model is incorrect because it ignores competing risks and treats them as censored. Fitting a CR regression model is also important to confirm whether the difference seen in the cumulative incidence curves is true or confounded by other risk factors. This is illustrated in example 2c. The difference in the cumulative incidence of TRM was confounded by the bone marrow stem cell use. Similarly the difference in the cumulative incidence of relapse was confounded by unfavorable prognosis at stem cell infusion.
In summary, the important first step for the analysis of CR data is the recognition that competing risks are present. Following this, the analysis should include a calculation of cumulative incidence of an event of interest in the presence of competing risks, a proper test for cumulative incidence curves of an event, and CR regression analyses. Software packages for the KM method and Cox proportional hazards regression model are available in S-plus, SAS, and SPSS. R and S programs for the Gray test (3) and the Fine and Gray CR regression model (6) can be obtained from the web page of Robert Gray1 or by contacting him at gray.robert{at}jimmy.harvard.edu. Future research should include sample size and/or power calculation in CR data.
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 http://biowww.dfci.harvard.edu/~gray/. ![]()
Received 5/18/06; revised 8/ 8/06; accepted 9/26/06.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Shardell, D. O. Scharfstein, D. Vlahov, and N. Galai Sensitivity Analysis Using Elicited Expert Information for Inference With Coarsened Data: Illustration of Censored Discrete Event Times in the AIDS Link to Intravenous Experience (ALIVE) Study Am. J. Epidemiol., December 15, 2008; 168(12): 1460 - 1469. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Dignam and M. N. Kocherginsky Choice and Interpretation of Statistical Tests Used When Competing Risks Are Present J. Clin. Oncol., August 20, 2008; 26(24): 4027 - 4034. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-S. Wang, S.-T. Wang, and P. Chou THREE AUTHORS REPLY Am. J. Epidemiol., March 15, 2008; 167(6): 751 - 752. [Full Text] [PDF] |
||||
![]() |
C. E. Behrendt RE: "HEPATITIS C VIRUS INFECTION AND THE DEVELOPMENT OF TYPE 2 DIABETES IN A COMMUNITY-BASED LONGITUDINAL STUDY" Am. J. Epidemiol., March 15, 2008; 167(6): 751 - 751. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |