[R] extract rows in dataframe with duplicated column values
Marc Schwartz
MSchwartz at MedAnalytics.com
Fri Mar 18 04:46:18 CET 2005
Here's one more possibility:
> subset(x, a %in% a[duplicated(a)])
a b
2 2 10
3 2 10
4 3 10
5 3 10
6 3 10
HTH,
Marc Schwartz
On Thu, 2005-03-17 at 22:25 -0500, Liaw, Andy wrote:
> OK, strike one...
>
> Here's my second try:
>
> > cnt <- table(x[,1])
> > v <- as.numeric(names(cnt[cnt > 1]))
> > v
> [1] 2 3
> > x[x[,1] %in% v, ]
> a b
> 2 2 10
> 3 2 10
> 4 3 10
> 5 3 10
> 6 3 10
>
> Andy
>
> > From: Liaw, Andy
> >
> > Does this work for you?
> >
> > > x[table(x[,1]) > 1,]
> > a b
> > 2 2 10
> > 3 2 10
> > 5 3 10
> > 6 3 10
> >
> > Andy
> >
> > > From: Tiago R Magalhaes
> > >
> > > Hi
> > >
> > > I want to extract all the rows in a data frame that have duplicates
> > > for a given column.
> > > I would expect this question to come up pretty often but I have
> > > researched the archives and surprisingly couldn't find anything.
> > > The best I can come up with is:
> > >
> > > x <- data.frame(a=c(1,2,2,3,3,3), b=10)
> > > xdup1 <- duplicated(x[,1])
> > > xdup2 <- duplicated(x[,1][nrow(x):1])[nrow(x):1]
> > > xAllDups <- x[(xdup1+xdup2)!=0,]
> > >
> > > This seems to work, but it's so convoluted that I'm sure there's a
> > > better method.
> > > Thanks for any help and enlightenment
> > > [[alternative HTML version deleted]]
> > >
> > > ______________________________________________
> > > R-help at stat.math.ethz.ch mailing list
> > > https://stat.ethz.ch/mailman/listinfo/r-help
> > > PLEASE do read the posting guide!
> > > http://www.R-project.org/posting-guide.html
> > >
> > >
> > >
> >
> > ______________________________________________
> > R-help at stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide!
> > http://www.R-project.org/posting-guide.html
> >
> >
> > --------------------------------------------------------------
> > ----------------
> > Notice: This e-mail message, together with any attachments,
> > contains information of Merck & Co., Inc. (One Merck Drive,
> > Whitehouse Station, New Jersey, USA 08889), and/or its
> > affiliates (which may be known outside the United States as
> > Merck Frosst, Merck Sharp & Dohme or MSD and in Japan, as
> > Banyu) that may be confidential, proprietary copyrighted
> > and/or legally privileged. It is intended solely for the use
> > of the individual or entity named on this message. If you
> > are not the intended recipient, and have received this
> > message in error, please notify us immediately by reply
> > e-mail and then delete it from your system.
> > --------------------------------------------------------------
> > ----------------
> >
> >
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
More information about the R-help
mailing list