Professional Documents
Culture Documents
Mekelle University
1
Survival/Event History Analysis
2
Session objective
• Longitudinal/panel data
4
Survival Analysis
5
Survival Analysis…
6
Examples of time to event data
• Time to death
• Time to incidence of disease
• Unemployed - time till find job
• Time to birth of first child
• Smokers – time till quit smoking
7
Survival analysis: key concepts
– States
– Events
– Risk period(time)
– Spell
– Censoring
8
States
• States are categories of the outcome variable of interest
• Examples
– alive, dead
• Example
– Death; disease (diagnosis, start of symptoms, relapse)
– Menopause
– Recurrence; response
10
Risk period /“Time”:
• Example
11
Time…
12
Time…
• Need to know when the clock starts
– Time from recruitment into the study
– Time from employment
– Time from diagnosis (prognostic studies)
– Time from infection
– Calendar time
– Age
• Event history analysis is to do with the analysis of the duration
of a non occurrence of an event or the length of time during
the risk period
13
Spell
• Many events are repeatable. For example, unemployment, child
birth, migration, infection
14
Censoring
• Different types
– Right
– Left
– Interval
15
Right censoring
• Happens when the person did not have an event during the time
that they were studied
17
Example: clinical trial
18
Drop-out or LTFU
19
Competing Risks: another type of censoring
• Example of competers:
– Death from lung cancer
20
Left censoring
• Example:
• or, equivalently,
• Examples:
– Breast cancer: 5 year relapse-free survival
Example:
– 3 Deaths
– 2 censored (lost) before 2 years
– 1 survived 2 years
• Question: what is the cumulative incidence (or the
Cumulative Survival) up to 2 years?
25
Person ID
1 (24)
2 (6)
3 (18)
4 (15)
5 (13)
6 (3)
Person ID
1 (24)
2 (6)
3 (18)
4 (15)
5 (13)
6 (3)
0 1 2
Follow-up time (years)
27
Estimating the Survival Function
28
Life table…
29
Life table
Interval in Number Average Number of Lost to Proportion Among Survival
Years At Risk Number Deaths Follow-Up, Dying Those at Probability
During At Risk During Ct During Risk, St
Interval, During Interval, Dt Interval, Proportion
Nt Interval, qt Surviving
Nt* Effective Interval, pt
number of
persons at
risk
0 6 6 0 0 0 1 1
1st yr 6 6-(1/2) = 1 1 1/5.5 = 1-0.18= 1(0.82) =
5.5 0.18 0.82 0.82
2nd yr 4 4-(1/2) = 2 1 2/3.5 = 1-0.57 = (0.82)(0.43
3.5 0.57 0.43 )=0.35
30
Life table…
• Assume that censored observations over the period contribute
one-half the persons at risk in the denominator (censored
observations occur uniformly throughout follow-up interval).
31
Life table…
• An issue with the life table approach shown above is that the
survival probabilities can change depending on how the intervals
are organized, particularly with small samples.
32
Kaplan-Meier (Product Limit) Approach
33
Kaplan-Meier
6 0 1
0
3
6 1 0 1*(6-1)/6=0.83
6 5 0 1 0.83*(5-0)/5=0.83
13 4 1 0 0.83*(4-1)/4=0.62
15 3 1 0.62*(3-0)/3=0.62
18 2 1 0 0.62*(2-1)/2=0.31
1 0 1 0.31*(1-0)/1=0.31
24
0 1 2
Follow-up time (years)
36
Kaplan-Meier Method
Person ID
6 (3)
2 (6)
5 (13)
4 (15)
3 (18)
1 (24)
0 1 2
Follow-up time (years)
37
Step 2: For each time of occurrence of an event, compute the
conditional survival
Person ID
6 (3)
2 (6)
5 (13)
4 (15)
3 (18)
1 (24)
0 1 2
Follow-up time (years)
38
Person ID
6 (3)
2 (6)
5 (13)
4 (15)
3 (18)
1 (24)
0 1 2
Follow-up time (years)
39
Person ID
6 (3)
2 (6)
5 (13)
4 (15)
3 (18)
1 (24)
0 1 2
Follow-up time (years)
3 0.833
13 0.625
Survival 18 0.3125
1.00
0.833
0.80 0.625
0.60
0.20
0 5 10 15 20 25
Month of follow-up
3 6 1 0 0.8 0.2
6 5 1 0.8 0.2
13 4 1 0 0.6 0.4
15 3 1 0.6 0.4
18 2 1 0 0.3 0.7
24 1 0 1 0.3 0.7
44
Plotting the hazard function:
0 5 10 15 20 25
Month of follow-up
45
Some hazard shapes
• Increasing
– Onset of Alzheimer's , natural aging and wear
• Decreasing
– Survival after surgery, early failures due to device or
transplant failures
• U-shaped
– Age specific mortality, populations followed from birth
• Constant
– Time till next email arrives
46
Comparing Survival Curves
48
Comparing survival by group using Kaplan-Meier graphs
1.00
0.75
0.50
0.25
0.00
0 5 10 15
analysis time
51
Design of survival studies…
52
The Cox regression model
53
Cox regression
55
with Cox del
Cox regression….
56
Cox regression….
hazard.
58
Cox regression equation
• For example, if the hazard is 0.2 at time t and the time units
are months, then on average, 0.2 events are expected per
person at risk per month.
• The hazard ratio can be estimated from the log rank test.
For a predictor
64
Interpreting HR from cox regression
• The hazard ratio is the ratio of the hazard for a unit change in
the covariate
• No censoring patterns
– Independent observations
– No multi-collinearity
66
Summary
68
Assignment
• HR calculation
– A. graphically
– B. statistically
69
• Questions and Discussion
70
• Thank you
71