[R] quantile regression: out of memory error

Roger Koenker rkoenker at uiuc.edu
Mon Jul 11 19:48:13 CEST 2011


Paul,

Yours is NOT a large problem, but it becomes a large problem when you ask for ALL the distinct
QR solutions by specifying tau = -1.  You probably don't want to see all these solutions, I suspect
that only tau = 1:19/20 or so would suffice.  Try this, and see how it goes.

Roger

url:    www.econ.uiuc.edu/~roger            Roger Koenker
email    rkoenker at uiuc.edu            Department of Economics
vox:     217-333-4558                University of Illinois
fax:       217-244-6678                Urbana, IL 61801

On Jul 11, 2011, at 12:39 PM, Prew, Paul wrote:

> Hello,  I’m wondering if anyone can offer advice on the out-of-memory error I’m getting. I’m using R2.12.2 on Windows XP, Platform: i386-pc-mingw32/i386 (32-bit).
> 
> I am using the quantreg package,  trying to perform a quantile regression on a dataframe that has 11,254 rows and 5 columns.
> 
>> object.size(subsetAudit.dat)
> 450832 bytes
> 
>> str(subsetAudit.dat)
> 'data.frame':   11253 obs. of  5 variables:
> $ Satisfaction     : num  0.64 0.87 0.78 0.75 0.83 0.75 0.74 0.8 0.89 0.78 ...
> $ Return           : num  0.84 0.92 0.91 0.89 0.95 0.81 0.9 0.87 0.95 0.88 ...
> $ Recommend        : num  0.53 0.64 0.58 0.58 0.62 0.6 0.56 0.7 0.64 0.65 ...
> $ Cust.Clean       : num  0.75 0.85 0.72 0.72 0.81 0.79 0.79 0.8 0.78 0.75 ...
> $ CleanScore.Cycle1: num  96.7 83.3 93.3 86.7 96.7 96.7 90 80 81.7 86.7 ...
> 
> rq(subsetAudit.dat$Satisfaction ~ subsetAudit.dat$CleanScore.Cycle1, tau = -1)
> 
> ERROR:  cannot allocate vector of size 2.8 Gb
> 
> I don’t know much about computers – software, hardware, algorithms – but does this mean that the quantreg  package is creating some massive intermediate vector as it performs the rq function?   Quantile regression is something I’m just starting to explore, but believe it involves ordering data prior to the regression, which could be a huge job when using 11,000 records.   Does bigmemory have functionality to help me with this?
> 
> Thank you,
> Paul
> 
> 
> 
> 
> 
> 
> Paul Prew   ▪  Statistician
> 651-795-5942   ▪   fax 651-204-7504
> Ecolab Research Center   ▪  Mail Stop ESC-F4412-A
> 655 Lone Oak Drive   ▪   Eagan, MN 55121-1560
> 
> 
> 
> 
> CONFIDENTIALITY NOTICE: 
> This e-mail communication and any attachments may contain proprietary and privileged information for the use of the designated recipients named above. 
> Any unauthorized review, use, disclosure or distribution is prohibited. 
> If you are not the intended recipient, please contact the sender by reply e-mail and destroy all copies of the original message.
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list