[R] remove duplicated row according to NA condition
arun
smartpink111 at yahoo.com
Wed May 28 17:51:53 CEST 2014
Hi,
May be this helps:
data1 <- data[with(data, order(col1, col2,1*is.na(col3))),]
data1[!duplicated(data1[,1:2]),]
A.K.
On Wednesday, May 28, 2014 11:28 AM, jeff6868 <geoffrey_klein at etu.u-bourgogne.fr> wrote:
Hi everybody,
I have a little problem in my R-code which seems be easy to solve, but I
wasn't able to find the solution by myself for the moment.
Here's an example of the form of my data:
data <-
data.frame(col1=c("a","a","b","b"),col2=c(1,1,2,2),col3=c(NA,"ST001","ST002",NA))
I would like to remove duplicated data based on the first two columns
(col1,col2), but in both cases here, I would like to remove the duplicated
row which is equal to NA in col3.
Here's the data.frame I would like to obtain:
data2 <- data.frame(col1=c("a","b"),col2=c(1,2),col3=c("ST001","ST002"))
I've been trying to mix duplicated() with is.na() but it doesn't work yet.
Can someone tell me the best and easiest way to do this?
Thanks a lot!
--
View this message in context: http://r.789695.n4.nabble.com/remove-duplicated-row-according-to-NA-condition-tp4691362.html
Sent from the R help mailing list archive at Nabble.com.
______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list