Professional Documents
Culture Documents
REVIEW ARTICLE
Survival Analysis
Part 15 of a Series on Evaluation of Scientific Publications
Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639 163
MEDICINE
BOX 1
164 Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639
MEDICINE
BOX 2
KaplanMeier method: an example based on data from 5 children with brain tumors
Patient data are shown in Table 1, in order of observation time. 3 of the 5 patients died.
Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639 165
MEDICINE
KaplanMeier curve for 33 children and adolescents with medulloblastoma and FIGURE 2
metastasis status M1
tumors are censored because the event tumor-related Survival rates and median survival time
death has not occurred. In more complex evaluations, Survival rates can be determined using the
both events can be investigated in parallel (as compet- KaplanMeier curve. Survival rates indicate the
ing risks). However, this will not be examined in this number of patients in whom no event has occurred up
article. HIT 91 investigated the time from primary to a certain point in time. In the example above, the
brain tumor operation to death for any reason. 1-year survival rate is 30% (Box 2). This can be inter-
Alongside data from patients with known survival preted as follows: one year after diagnosis, we can ex-
times, data from censored patients must also be in- pect 30% of patients to be still alive. When stating sur-
cluded in evaluation. Specific evaluation strategies are vival rates, it is important to also state the point in time
needed in order for censored patients data to be suffi- to which it corresponds. When comparing two treat-
ciently reflected in analysis. ment groups, it is advisable to plot KaplanMeier
If survival time data are not evaluated in this way, curves for both treatment groups, as these provide more
the results are generally faulty. The mistakes most com- information than survival rates alone.
monly made when evaluating survival time data are The mean survival time is very much affected by
described in Box 1. censorings. Because of this, median values of survival
When evaluating survival times, it is important to times are always given. The median survival time is the
take into account both the time until an event occurs time at which half the patients have suffered an event.
and censored patients. This article describes methods The median survival time of the five brain tumor
for evaluation and graphical representation of survival patients is ten months. If the KaplanMeier estimator
time data on the basis of the trial HIT 91. Simple intro- for the whole observation period is more than 50%, the
ductions to survival analyses are provided by textbooks median survival time cannot be determined. In such
by Wei (2) and Schumacher and Schulgen (3). Text- cases, fewer than half the patients have suffered an
books by Collett (4) and by Kalbfleisch and Prentice event by the end of the observation period.
(5) may be consulted for further reading.
Log-rank test
KaplanMeier curves In HIT 91, the survival times of patients from the two
Table 1, Box 2 shows the survival times of five children treatment groups were compared according to their
with brain tumors. The probability that a patient has metastasis statuses. KaplanMeier curves can be used
survived up to a certain point in time is calculated using for descriptive comparison of the two treatment groups
the KaplanMeier method (6). The survival times can survival times for patients with metastasis status M1
be shown graphically using a KaplanMeier curve (also (Figure 2). The standard method, the log-rank test, was
called a survival time curve) (Figure 1 in Box 2). Pa- used for statistical comparison of survival times. The
tients survival times are plotted on the x-axis, and the log-rank test examines whether there is a difference be-
probability of survival calculated according to the tween two groups survival times. This involves not
KaplanMeier method is plotted on the y-axis. only a specific point in time, such as the 6-month
Calculation of the probability of survival and graphi- survival rate, but also the whole observation period. To
cal representation using a KaplanMeier curve are put it more simply, we might say that KaplanMeier
explained step by step in Box 2. curves are compared with each other.
166 Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639
MEDICINE
Cox regression cross, it is not the case. Parmar and Machin (9) describe
The simultaneous effects of several variables on sur- how to test the assumption of proportional hazard. The
vival time can also be investigated. The parameters log-rank test is also based on the assumption of
examined in the HIT 91 study include the following: proportional hazard.
Treatment An example of a situation in which this assumption
Sex does not hold is the following: The survival times of
Degree of resection patients who have undergone an operation need to be
Metastasis status. compared to those of patients who received radio-
The effect on survival time of age at operation, a therapy instead of surgery. The risk of death is high im-
continuous variable, should also be examined. Cox mediately after surgery and then drops. In patients who
regression (8) can be used in both cases. Cox regression receive radiotherapy, the risk of death at the beginning
can also be used to obtain an estimator of the effect of treatment is low, but it may rise over time if radio-
size. This estimator takes the form of the hazard ratio. therapy is insufficiently effective. This means that the
two death rates are not proportional to each other.
Underlying assumptions If KaplanMeier curves are used for patients with
Cox regression is based on the assumption that the metastasis status M1 from HIT 91 (Figure 2), we can
hazard ratio remains constant over time (it is therefore see that maintenance chemotherapy performs uni-
also known as proportional hazards regression). This is formly better. This means there is no evidence against
true provided that the risk of an event (the hazard) of the assumption of proportional hazards.
group 2 is proportional to that of group 1 (assumption As with linear regression, there are also several pos-
of proportional hazard). Although the risk of an event sible methods for variable selection in Cox regression
(hazard) may vary over time, the variations over time (see Schneider et al. [10]).
must be the same in both groups. This assumption is not
always justified, but can be approximately assessed Example of Cox regression
using KaplanMeier curves. If the hazard in one of the In HIT 91, three variables demonstrated an effect on
two groups exceeds the hazard in the other permanently overall survival (Table 2):
and to the same extent, the assumption of proportional Treatment (binary)
hazard is valid. Represented graphically, this is the case Metastasis status on diagnosis (categorial)
when the KaplanMeier curves do not cross. If they do Age on diagnosis (continuous).
Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639 167
MEDICINE
TABLE 2
years, see Schneider et al. [10]) is retained when
measures are interpreted.
Results of Cox regression for overall survival in 280 children with
medulloblastoma
Other important issues
n HR 95% CI p Time-dependent variables
Metastasis status on diagnosis All the variables examined so far have been known at
the beginning of survival time. For example, HIT 91
M0* 114
investigated whether metastases which were present at
M1 33 2.11 1.133.94 the time of brain tumor surgery affected survival. To in-
0.001 vestigate a variable that is still unknown at the begin-
M2/3 40 3.06 1.765.33
Unknown 93 1.54 0.942.52 ning of survival time or that changes over time,
time-dependent Cox regression must be used. For
Treatment
example, if we wish to know whether diabetes patients
Maintenance chemotherapy* 127 0.006 cumulative dose of insulin affects the length of time
Sandwich chemotherapy 153 1.76 1.172.67 until a cardiovascular event occurs, we cannot stipulate
Age on diagnosis (years) 280 0.93 0.880.98 0.005
the cumulative dose as a known quantity at the outset.
Patients who survive longer will generally receive a
n: no. of cases; HR: hazard ratio; 95% CI: 95% confidence interval for hazard ratio; higher total dose. However, this high cumulative dose
p: p value of likelihood ratio test; * Control group for categorial variables. is not the cause of longer survival. To allow for this, the
(From: von Hoff K., Hinkes B., Gerber N.U., Deinlein F., Mittler U., Urban C. et al.: Long-term outcome and
clinical prognostic factors in children with medulloblastoma treated in the prospective randomised multi-
cumulative dose must be included in Cox regression as
centre trial HIT91. EJC 2009; 45: 120917; printed with the kind consent of Elsevier Publishers, Oxford) a time-dependent variable. Time-dependent Cox
regression is a highly complex procedure. It is
described at length in Colletts textbook (4).
Patients at risk
The reference group for the variable treatment con- The term patients at risk refers to patients who are
sists of patients receiving maintenance chemotherapy. still alive at a particular point in time. The number of
A hazard ratio of 1.76 can be interpreted as follows: patients at risk, which varies over time, is often inte-
The risk of death of children receiving sandwich grated into the KaplanMeier curve (under the time
chemotherapy is 1.76 times as high as that of children axis). As there are fewer patients at risk on the right-
receiving maintenance chemotherapy. hand edge of the KaplanMeier curve (some have al-
There are four possible metastasis statuses: ready died or been censored), this information allows
M0 us to determine how reliable the KaplanMeier
M1 estimate still is at the right-hand edge. The fewer the
M2/3 patients at risk, the higher the confidence interval of the
Unknown (Patients with unknown status are KaplanMeier estimator.
those in whom it was not clear whether their
status was M0 or M1.) Number of events
The reference group used for comparison consists of In order for results to be reliable, the number of events
patients with metastasis status M0. The risk of death in must be high enough. (N.B.: This does not mean the
each of the three groups M1, M2/3 and unknown is number of patients.) For each variable investigated
compared with that of the control group, M0. So, three using multivariable Cox regression, there must be at
hazard ratios are calculated. The risk of death of least ten events (12). If there is a small number of
children with status M1 is 2.11 times as high as that of events, only a few explanatory variables can be investi-
children with status M0 (hazard ratio = 2.11); in other gated simultaneously. In HIT 91 there were 101 cases
words, their risk is 111% higher. The risk of death of of death. This means that a maximum of ten variables
children with status M2/3 is 3.06 times as high as that can be included in Cox regression.
of a child with status M0. The risk of death of patients
whose metastasis status is unknown is 1.54 times as Sample size planning
high as that of children with status M0. In addition to A sample size calculation can be made for both the log-
the hazard ratio, the confidence interval (11) must also rank test and Cox regression. In addition to the signifi-
be taken into account. The reference value here is 1 cance level and the power to be achieved, we also need
(meaning no effect). an estimated survival rate for each group to be com-
With a continuous variable, the hazard ratio indi- pared or the estimated hazard ratio for a continuous ex-
cates the change in the risk of death if the parameter in planatory variable (3). Sample size calculation also
question rises by one unit, for example if the patient is takes into account the recruitment and follow-up time.
one year older on diagnosis. For every additional year
of patient age on diagnosis, the risk of death falls by 7% Censoring
(hazard ratio 0.93). Note that the unit chosen for the ex- If censored patients are distributed differently in each
planatory variable (in this case age on diagnosis in of two treatment groups that are to be compared, biased
168 Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639
MEDICINE
estimators may result. The degree of completeness of with medulloblastoma treated in the prospective randomised multi-
follow-up in each treatment group should therefore be centre trial HIT91. EJC 2009; 45: 120917.
reported (see Clark et al. [13]). 2. Wei C: Basiswissen Medizinische Statistik. 5th revised edition.
Heidelberg: Springer Medizin Verlag 2010.
Summary 3. Schumacher M, Schulgen G: Methodik klinischer Studien. 3rd edition.
Berlin, Heidelberg, New York: Springer 2008.
As survival time data contain censorings, they must al- nd
4. Collett D: Modelling survival data in medical research. 2 edition.
ways be evaluated using the KaplanMeier method and London: Chapman and Hall 2003.
the log-rank test. Analysis based on frequencies of 5. Kalbfleisch JD, Prentice R: The statistical analysis of failure time
events often produces faulty results. All doctors should nd
data. 2 edition. New York: Wiley, 2002.
understand KaplanMeier curves, the log-rank test and 6. Kaplan EL, Meier P: Nonparametric estimation from incomplete
the results of Cox regression, as they must be able to observations. JASA 1985; 53: 45781.
explain them to patients (e.g. when choosing a treat- 7. Ressing M, Blettner M, Klug SJ: Data analysis of epidemiological
ment option: whether to treat a brain tumor with sand- studiespart 11 of a series on evaluation of scientific publications.
Dtsch Arztebl Int 2010; 107(11): 18792.
wich or maintenance chemotherapy).
Multivariable analyses can be performed using Cox 8. Cox DR: Regression models and life tables (with discussion). Jour-
nal of the Royal Statistical Society (Series B) 1972; 74: 187200.
regression. Results can be interpreted using hazard
9. Parmar MK, Machin D: Survival analysis: a practical approach.
ratios and confidence intervals. Unfortunately, the Cambridge: John Wiley and Sons 1995.
underlying assumptions of Cox regression are not 10. Schneider A, Hommel G, Blettner M: Linear regression analy-
always taken into account (e.g. proportional hazards, sispart 14 of a series on evaluation of scientific publications
time-dependent variables), and many published Dtsch Arztebl Int 2010; 107(44): 77682.
analyses are therefore faulty. Readers of scientific pub- 11. du Prel JB, Hommel G, Rhrig B, Blettner M: Confidence interval or
lications should know these pitfalls and be able to judge p-value?part 4 of a series on evaluation of scientific publications
Dtsch Arztebl Int 2009; 106(19): 3359.
for themselves whether the chosen analytical method is
correct. 12. Peduzzi P, Concato J, Feinstein AR, Holford TR: Importance of
events per independent variable in proportional hazards regression
Conflict of interest statement analysis II. Accuracy and Precision of regression estimates. Journal
The authors declare that no conflict of interest exists according to the of Clinical Epidemiology 1995; 48: 150310.
guidelines of the International Committee of Medical Journal Editors.
13. Clark TG, Altman DG, De Stavola BL: Quantification of the complete-
Manuscript received on 1 June 2010, revised version accepted on 12 October ness of follow-up. Lancet 2002; 359: 130910.
2010.
Translated from the original German by Caroline Devitt, MA. Corresponding author
Prof. Dr. rer. nat. Maria Blettner
REFERENCES Institut fr Medizinische Biometrie (IMBEI)
Johannes Gutenberg-Universitt
1. von Hoff K, Hinkes B, Gerber NU, Deinlein F, Mittler U, Urban C, et Obere Zahlbacher Str. 69
al.: Long-term outcome and clinical prognostic factors in children 55131 Mainz, Germany
Deutsches rzteblatt International | Dtsch Arztebl Int 2011; 108(10): 1639 169