[R] Recurrent analysis survival analysis data format question
Bob Green
bgreen at dyson.brisnet.org.au
Tue Jun 10 08:31:30 CEST 2014
Hello,
I'm hoping for advice regarding how to set up a recurrent event
survival analysis data file. My data consists of people released from
custody, with survival time being measured as days before re
imprisonment or end of the study. In the example below, id 5155 is
released 5 times and jailed five times. All events are therefore
true. Daysfree is the difference in days between release and return
to custody. Id 7155 is released 3 times and only re-imprisoned
twice, so the third event value is false.
id <- c(5155, 5155,5155,5155, 7155, 7155,7155)
Release <- c("29/10/10","9/01/11", "25/03/12", "15/10/13", "9/01/10",
"16/12/12","29/10/13")
JailNew <- c("1/12/10","01/12/11", "27/09/12", "24/01/14",
"22/09/12","24/01/12","24/01/14")
DaysFree <- c(24,234,134,74,709,29,64)
Event <- c("true", "true", "true", "true", "true", "true", "false" )
DF1<- data.frame(id, Release, JailNew, DaysFree, Event)
DF1
After speaking to a statistician today I'm not sure if I my method
of formatting the data is correct. Should all time intervals be
included, not just the period from release to event/end of study
period. Currently period imprisoned is not counted. For example,
for id 5155, would I also include 1/12/10 - 9/01/11 etc, which would
be FALSE for event and have a duration of 39 days; and then include
all the other similar intervals as well. The statistican thought
including this additional information more closely resembled the
bladder1 data in the Survival package.
Any assistance is appreciated,
Regards
Bob
More information about the R-help
mailing list