[R] Filling Lists or Arrays of variable dimensions
David Winsemius
dwinsemius at comcast.net
Thu Dec 20 22:39:21 CET 2012
On Dec 20, 2012, at 10:01 AM, Jessica Streicher wrote:
> Really must have been unclear at some point, sorry.
Hasn't it become abundantly clear that this would have progress farther had you post a complete example?
--
David.
>
> William, thats interesting, but not really helping the main problem, which is: how to do
>
>> l[[ as.character(grid[1, ]) ]] <- 1
>
> without having initialized the list in the loop before.
>
> Well, or how to initialize it without having to do the loop thing, because the loop stuff can only be done for a specific set of parameter vectors. But those change, and i don't want to have to write another loop construct every time for the new version.
>
> I want to say: hey, i have these vectors here with these values (my parameters), could you build me that nested list structure (tree - whatever) from it? And the function will give me that structure whatever i give it without me needing to intervene in form of changing the code.
>
> -------------- Clarification -----------------
>
> First: i am not computing statistics over the parameters. I'm computing stuff from other data, and the computation is affected by the parameters.
>
> I am computing classifiers for different sets of parameters for those classifiers. So the result of doSomething() isn't a simple value. Its usually a list of 6 lists (doing cross validation), which in turn have the classifier object, some statistics of the classifier (e.g what was missclassified), and the subsets of data used in them.
> That doesn't really fit in a data.frame, hence the use of lists. I want the nested lists because it helps me find stuff in the object browser faster, and because all my other code is already geared towards it. If i had the time i might still go for a flat structure that everyone keeps telling me to use (got a few mails off the list),
> but i really haven't the time.
>
> If theres no good way i'll just keep things as they are now.
>
>
> On 20.12.2012, at 18:37, William Dunlap wrote:
>
>> Arranging data as a list of lists of lists of lists [...] of scalar values generally
>> will lead to slow and hard-to-read R code, mainly because R is designed to
>> work on long vectors of simple data. If you were to start over, consider constructing
>> a data.frame with one column for each attribute. Then tools like aggregate and
>> the plyr functions would be useful.
>>
>> However, your immediate problem may be solved by creating your 'grid' object
>> as a data.frame of character, not factor, columns because as.character works differently
>> on lists of scalar factors and lists of scalar characters. Usually as.<mode>(x), when
>> x is a list of length-1 items, gives the same result as as.<mode>(unlist(x)), but not when
>> x is a list of length-1 factors:
>>
>>> height<-c("high", "low")
>>> width<-c("slim", "wide")
>>> gridF <- expand.grid(height, width, stringsAsFactors=FALSE)
>>> gridT <- expand.grid(height, width, stringsAsFactors=TRUE)
>>> as.character(gridF[1,])
>> [1] "high" "slim"
>>> as.character(gridT[1,])
>> [1] "1" "1"
>>> as.character(unlist(gridT[1,])) # another workaround
>> [1] "high" "slim"
>>
>> Your example was not self-contained so I changed the call to doSomething() to paste(h,w,sep="/"):
>>
>> height<-c("high", "low")
>> width<-c("slim", "wide")
>>
>> l <- list()
>> for(h in height){
>> l[[h]] <- list()
>> for(w in width){
>> l[[h]][[w]] <- paste(h, w, sep="/") # doSomething()
>> }
>> }
>>
>> grid <- expand.grid(height, width, stringsAsFactors=FALSE)
>> as.character(grid[1,])
>> # [1] "high" "slim", not the [1] "1" "1" you get with stringsAsFactors=TRUE
>> l[[ as.character(grid[1, ]) ]]
>> # [1] "high/slim"
>> l[[ as.character(grid[1, ]) ]] <- 1
>> l[[ as.character(grid[1, ]) ]]
>> # [1] 1
>>
>> Bill Dunlap
>> Spotfire, TIBCO Software
>> wdunlap tibco.com
>>
>>> -----Original Message-----
>>> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf
>>> Of Jessica Streicher
>>> Sent: Thursday, December 20, 2012 8:43 AM
>>> To: Chris Campbell
>>> Cc: R help
>>> Subject: Re: [R] Filling Lists or Arrays of variable dimensions
>>>
>>> Aggregate is highly confusing (and i would have appreciated if you used my example
>>> instead, i don't get it to do anything sensible on my stuff).
>>>
>>> And this seems not what i asked for anyway. This may be a named list but not named and
>>> structured as i want it at all.
>>>
>>> happy Christmas too
>>>
>>> On 20.12.2012, at 15:48, Chris Campbell wrote:
>>>
>>>> Dear Jessica
>>>>
>>>> Aggregate is a function that allows you to perform loops across multiple variables.
>>>>
>>>> tempData <- data.frame(height = rnorm(20, 100, 10),
>>>> width = rnorm(20, 50, 5),
>>>> par1 = rnorm(20))
>>>>
>>>> tempData$htfac <- cut(tempData$height, c(0, 100, 200))
>>>> tempData$wdfac <- cut(tempData$width, c(0, 50, 100))
>>>>
>>>> doSomething <- function(x) { mean(x) }
>>>>
>>>> out <- aggregate(tempData["par1"], tempData[c("htfac", "wdfac")], doSomething)
>>>>
>>>> # out is a data frame; this is a named list.
>>>> # use as.list to remove the data.frame class
>>>>
>>>>> as.list(out)
>>>>
>>>> $htfac
>>>> [1] (0,100] (100,200] (0,100] (100,200]
>>>> Levels: (0,100] (100,200]
>>>>
>>>> $wdfac
>>>> [1] (0,50] (0,50] (50,100] (50,100]
>>>> Levels: (0,50] (50,100]
>>>>
>>>> $par1
>>>> [1] -1.0449563 -0.3782483 -0.9319105 0.8837459
>>>>
>>>>
>>>
>>>>
>>>> I believe you are seeing an error similar to this one:
>>>>
>>>>> out[[1:3]] <- 1
>>>> Error in `[[<-`(`*tmp*`, i, value = value) :
>>>> recursive indexing failed at level 2
>>>>
>>>> This is because double square brackets for lists can only set a single list element at
>>> once; grid[1, ] is longer.
>>>
>>>>
>>>> Happy Christmas
>>>>
>>>> Chris
>>>>
>>>>
>>>> Chris Campbell
>>>> Tel. +44 (0) 1249 705 450 | Mobile. +44 (0) 7929 628 349
>>>> mailto:ccampbell at mango-solutions.com | http://www.mango-solutions.com
>>>> Mango Solutions
>>>> 2 Methuen Park
>>>> Chippenham
>>>> Wiltshire
>>>> SN14 OGB
>>>> UK
>>>>
>>>> -----Original Message-----
>>>> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf
>>> Of Jessica Streicher
>>>> Sent: 20 December 2012 12:46
>>>> To: R help
>>>> Subject: [R] Filling Lists or Arrays of variable dimensions
>>>>
>>>> Following problem:
>>>>
>>>> Say you have a bunch of parameters and want to produce results for all combinations
>>> of those:
>>>>
>>>> height<-c("high","low")
>>>> width<-c("slim","wide")
>>>>
>>>> then what i used to do was something like this:
>>>>
>>>> l<-list()
>>>> for(h in height){
>>>> l[[h]]<-list()
>>>> for(w in width){
>>>> l[[h]][[w]] <- doSomething()
>>>> }
>>>> }
>>>>
>>>> Now those parameters aren't always the same. Their number can change and the
>>> number of entries can change, and i'd like to have one code that can handle all
>>> configurations.
>>>>
>>>> Now i thought i could use expand.grid() to get all configurations ,and than iterate over
>>> the rows, but the problem then is that i cannot set the values in the list like above.
>>>>
>>>> grid<-expand.grid(height,width)
>>>> l[[as.character(grid[1,])]] <-1
>>>> Error in `[[<-`(`*tmp*`, as.character(grid[1, ]), value = 1) :
>>>> no such index at level 1
>>>>
>>>> This will only work if the "path" for that is already existent, and i'm not sure how to
>>> build that in this scenario.
>>>>
>>>> I then went on and built an array instead lists of lists, but that doesn't help either
>>> because i can't access the array with what i have in the grids row - or at least i don't
>>> know how.
>>>>
>>>> Any ideas?
>>>>
>>>> I'd prefer to keep the named lists since all other code is built towards this.
>>>> ______________________________________________
>>>> R-help at r-project.org mailing list
>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>
>>>> --
>>>>
>>>> LEGAL NOTICE
>>>>
>>>> This message is intended for the use of the named recipient(s) only and may contain
>>>> confidential and / or privileged information. If you are not the intended recipient,
>>> please
>>>> contact the sender and delete this message. Any unauthorised use of the information
>>>> contained in this message is prohibited.
>>>>
>>>> Mango Business Solutions Limited is registered in England under No. 4560258 with its
>>>> registered office at Suite 3, Middlesex House, Rutherford Close, Stevenage, Herts, SG1
>>> 2EF,
>>>> UK.
>>>>
>>>> PLEASE CONSIDER THE ENVIRONMENT BEFORE PRINTING THIS EMAIL
>>>>
>>>
>>> ______________________________________________
>>> R-help at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius
Alameda, CA, USA
More information about the R-help
mailing list