[R] Compare two data frames
Doran, Harold
HDoran at air.org
Thu Apr 22 15:41:49 CEST 2010
I wonder if there is a more efficient way to do this task. Suppose I have two data frames, such as
d1 <- data.frame(x = c(1,2,3), y = c(4,5,6), z = c(7,8,9))
d2 <- d1[, c('y', 'x')]
The first dataframe d1 has more variables than d2 and the variable columns are in a different order.
So, what I want to do is compare the two frames on the variables that are common between the two. First I find the common variables between the two dataframes
common_order <- intersect(colnames(d1), colnames(d2))
Then, I have to put the variables in d2 in the same order as d1 as
d2 <- d2[, common_order]
Then, I keep only the variables in common between d1 and d2 as
d1 <- d1[, common_order]
Then, finally I can do the compare on the common variables now in the same order.
all.equal(d1, d2)
None of this is horribly difficult, but it requires a couple of steps that I am wondering might be eliminated.
Harold
> sessionInfo()
R version 2.10.1 (2009-12-14)
i386-pc-mingw32
locale:
[1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252 LC_MONETARY=English_United States.1252
[4] LC_NUMERIC=C LC_TIME=English_United States.1252
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] MiscPsycho_1.6 statmod_1.4.6
loaded via a namespace (and not attached):
[1] tools_2.10.1
More information about the R-help
mailing list