[R] Sharing levels across multiple factor vectors
Peter Dalgaard
pdalgd at gmail.com
Thu Apr 1 10:05:58 CEST 2010
Jeff Brown wrote:
> Sorry for spamming. I swear I had worked on that problem a long time before
> posting.
>
> But I just figured it out: I have to change the values, which are
> represented as integers, not strings. So the following code will do it:
>
> df <- data.frame (
> a = factor( c( "bob", "alice", "bob" ) ),
> b = factor( c( "kenny", "alice", "alice" ) )
> );
> allLevels <- unique( c( levels( df$a ), levels( df$b ) ) )
> for (c in colnames(df)) {
> df[,c] <- match( df[,c], allLevels);
> levels( df[,c] ) <- 1:(length(allLevels))
> };
>
Hmm, I think I'd go for something like
allLevels <- unique(unlist(lapply(df,levels)))
df[] <- lapply(df, factor,
levels=allLevels, labels=seq_along(allLevels))
--
Peter Dalgaard
Center for Statistics, Copenhagen Business School
Phone: (+45)38153501
Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com
More information about the R-help
mailing list