[R] Do you use R for data manipulation?

Frank E Harrell Jr f.harrell at vanderbilt.edu
Wed May 6 17:27:46 CEST 2009


Simon Pickett wrote:
> My institute uses SAS religiously, I am the only R "heathen".
> 
> I have resisted learning to use SAS because I dont see the point after 
> years of using R and I like being able to do everything using one 
> program. However, my colleagues maintain that SAS is "better" for 
> programming without really ever giving me a good reason why other than 
> memory issues.
> 
> dont want to hi-jack the thread but would be interested in hearing some 
> other views, especially since my organisation spends (wastes?) alot of 
> money every year on SAS licences...

Put quite simply, your colleagues' opinions are humbug.

Frank

> 
> Simon.
> 
> ----- Original Message ----- From: "Laura Arsanto" <ghina84 at hotmail.it>
> To: <jrkrideau at yahoo.ca>; <r-help at stat.math.ethz.ch>; <fjbuch at gmail.com>
> Cc: <ross.lazarus at gmail.com>; <gregory_warnes at urmc.rochester.edu>; 
> <greg at warnes.net>
> Sent: Wednesday, May 06, 2009 2:53 PM
> Subject: Re: [R] Do you use R for data manipulation?
> 
> 
> 
> 
> I used R for my master thesis (with big effort, anyway) and now I find 
> difficult to use R in my daily work, becasue it has really serious 
> problems with datasets of big dimension, both in the data manipulation 
> step and in the analysis step.
> 
> But I really would love to use it, as I like its transparence, compared 
> to other software.
> 
> Laura
> 
> ***********
> 
>> Date: Wed, 6 May 2009 06:42:45 -0700
>> From: jrkrideau at yahoo.ca
>> To: r-help at stat.math.ethz.ch; fjbuch at gmail.com
>> CC: ross.lazarus at gmail.com; gregory_warnes at urmc.rochester.edu; 
>> greg at warnes.net
>> Subject: Re: [R] Do you use R for data manipulation?
>>
>>
>>
>>
>> --- On Wed, 5/6/09, Farrel Buchinsky <fjbuch at gmail.com> wrote:
>>
>> > Is R an appropriate tool for data
>> > manipulation and data reshaping and data
>> > organizing? I think so but someone who recently joined our
>> > group thinks not.
>>
>> I only do small scale projects and am by no means a programmer. Isn't 
>> Perl something for earings?
>>
>> That said, I find R to be extremely useful at data manipulation and 
>> have used it exclusively in my last three projects.  The different 
>> data structures alone are worth their weight in gold, if for nothing 
>> else than making it harder to make stupid mistakes in coding.
>>
>> > The new recruit believes that python or another language is
>> > a far better tool for developing data manipulation scripts that can be
>> > then used by> several members of our research group. Her assessment is
>> > that R is useful> only when it comes to data analysis and working with
>> > statistical models.
>>
>> Any reason that she thinks this?  How well does she know R?  It is not 
>> exactly a language that one picks up in a week, especially if one is 
>> coming from using a stats package like SAS or SPSS. As an ex-SAS and 
>> SYSTAT user it took me weeks to just get comfortable with the power of 
>> subscripting and the ability to do all kinds of calculations "in-line".
>>
>> > So what do you think:
>> > 1)R is a phenomenally powerful and flexible tool and since you are 
>> going >  > to do analyses in R you might as well use it to read data 
>> in and merge
>> > it and reshape it to whatever you need.
>>
>> Definately. I am not a computer scientist or a statistician. I usually 
>> am working as a single contractor and normally with small datasets as 
>> part of a larger project.  R does what I want, usually very elegantly 
>> (albeit perhaps after a lot of headbanging and calls for help to the 
>> R-list) and it would be stupid for me to use more than one language 
>> when it is not needed.
>>
>> Another plus is that I can  easily leave my data analysis work and a 
>> working copy of R with the client.  He/she may have a problem seeing 
>> what I did but it is clearly readable & replicable by either the 
>> client or another consultant.
>>
>> > OR
>> > 2) Are you crazy? Nobody in their right mind uses R to pipe
>> > the data around their lab and assemble it for analysis.
>>
>> Well I don't work in a lab but why complicate things? If everyone is 
>> using the same tools then you have a good situation.  Others who do 
>> work in labs can address this point more cogently
>>
>> >From a personnel point of view do you expect everyone in the lab to 
>> be >proficient with R and, for example, Perl? What happens when/if you 
>> lose >your Perl expert(s)?  I've had occasions where I waited a week 
>> for data >simply because the division's MS Access "expert" was on 
>> holiday and the >only other "Access" person there only knew how to 
>> enter data and run the >monthly reports.  Anything more complicated 
>> required the "expert".
>>
>>
>>
>>
>>
>>       __________________________________________________________________
>> Make your browsing faster, safer, and easier with the new Internet 
>> Explorer® 8. Optimized for Yahoo! Get it Now for Free! at 
>> http://downloads.yahoo.com/ca/internetexplorer/
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide 
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
> 
> _________________________________________________________________
> [[elided Hotmail spam]]
> 
> [[alternative HTML version deleted]]
> 
> 
> 
> 
> -------------------------------------------------------------------------------- 
> 
> 
> 
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide 
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide 
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 


-- 
Frank E Harrell Jr   Professor and Chair           School of Medicine
                      Department of Biostatistics   Vanderbilt University




More information about the R-help mailing list