[R] cforest - keep.forest = false option? (fwd)

Torsten Hothorn Torsten.Hothorn at stat.uni-muenchen.de
Wed Jul 20 09:41:24 CEST 2011


> ---------- Forwarded message ----------
> Date: Mon, 18 Jul 2011 10:17:00 -0700 (PDT)
> From: KHOFF <kuphoff at gmail.com>
> To: r-help at r-project.org
> Subject: [R] cforest - keep.forest = false option?
>
> Hi,
>
> I'm very new to R. I am most interested in the variable importance 
> measures
> that result from randomForest, but many of my predictors are 
> highly
> correlated. My first question is:
>
> 1. do highly correlated variables render variable importance 
> measures in
> randomForest invalid?
>

that depends on your idea of "valid". A number of papers 
published over the last years explore this question and
you should read the relevant literature first.

> and 2. I know that cforest is robust to highly correlated 
> variables,
> however, I do not have enough space on my machine to run cforest. 
> I used the
> keep.forest = false option when using randomForest and that solved 
> my space
> issue. Is there a similar option for cforest (besides 
> savesplitstats =
> FALSE, which isn't helping)

no. party was designed as a flexible research tool and is
not optimized wrt speed or memory consumption.

Best,

Torsten

>
> below is my code and error message
>
> Thanks in advance!
>
>> fit <- cforest(formula = y ~ x1 + x2+ x3+ x4+ x5+
> + x6+ x7+ x8+ x9+ x10, data=data, control= 
> cforest_unbiased(savesplitstats =
> FALSE, ntree = 50, mtry = 5)
>
> 1: In mf$data <- data :
>  Reached total allocation of 3955Mb: see help(memory.size)
> 2: In mf$data <- data :
>  Reached total allocation of 3955Mb: see help(memory.size)
>
>
> --
> View this message in context: 
> http://r.789695.n4.nabble.com/cforest-keep-forest-false-option-tp3675921p3675921.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide 
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



More information about the R-help mailing list