[R] survival analysis and censoring
Geoff Russell
geoffrey.russell at gmail.com
Thu Mar 13 06:14:22 CET 2008
Dear Prof. Therneau,
Many thanks for this,
On 3/13/08, Terry Therneau <therneau at mayo.edu> wrote:
>
> In your particular case I don't think that censoring is an issue, at least not
> for the reason that you discuss. The basic censoring assumption in the Cox
> model is that subjects who are censored have the same future risk as those who
> were a. not censored and b. have the same covariates.
> The real problem with informative censoring are the covaraites that are not
> in the model; ones that I likely don't even know exist. Assume for instance
> that some unknown exposure X, Perth sunlight say, makes people much more likely
> to get both of the outcomes. Assume further that it matters, i.e., the study
> includes a reasonable number of people with and without this exposure. Then
> someone who has an early heart attack actually has a higher risk of colorectal
> cancer than a colleague of the same age/sex/followup who did not have a heart
> attack, the reason being that the HA guy is more likely to be from Perth.
>
> Your simulation went wrong by not actually accounting for time. You created
> an outcome table for CC & HD and added a random time vector to it. If someone
> would have had CC at 2 years and now has HD at 1 year, you can't just change the
> status to make them censored at 2. The gambling analogy would be kicking
> someone out of the casino just before they win -- it does odd things to the
> odds.
I'm still astonished that this is the explanation, but I've spent an
hour playing with
my little R code model and this is exactly the problem. Score 1 for solid
maths and 0 for my intuition.
Many Thanks,
Geoff
>
> Terry Therneau
>
>
>
More information about the R-help
mailing list