[R] what is the faster way to search for a pattern in a few million entries data frame ?
Fabien Tarrade
fabien.tarrade at gmail.com
Mon Apr 11 00:59:24 CEST 2016
Hi Jim,
I didn't know this one. I will have a look.
Thanks
Cheers
Fabien
> Hi Fabien,
> I was going to send this last night, but I thought it was too simple.
> Runs in about one millisecond.
>
> df<-data.frame(freq=runif(1000),
> strings=apply(matrix(sample(LETTERS,10000,TRUE),ncol=10),
> 1,paste,collapse=""))
> match.ind<-grep("DF",df$strings)
> match.ind
> [1] 2 11 91 133 169 444 547 605 734 943
>
> Jim
--
Dr Fabien Tarrade
Quantitative Analyst/Developer - Data Scientist
Senior data analyst specialised in the modelling, processing and
statistical treatment of data.
PhD in Physics, 10 years of experience as researcher at the forefront of
international scientific research.
Fascinated by finance and data modelling.
Geneva, Switzerland
Email : contact at fabien-tarrade.eu <mailto:contact at fabien-tarrade.eu>
Phone : www.fabien-tarrade.eu <http://www.fabien-tarrade.eu>
Phone : +33 (0)6 14 78 70 90
LinkedIn <http://ch.linkedin.com/in/fabientarrade/> Twitter
<https://twitter.com/fabtar> Google
<https://plus.google.com/+FabienTarradeProfile/posts> Facebook
<https://www.facebook.com/fabien.tarrade.eu> Google
<skype:fabtarhiggs?call> Xing <https://www.xing.com/profile/Fabien_Tarrade>
More information about the R-help
mailing list