[R] summarized data set - how to use an "occurs" field

David Winsemius dwinsemius at comcast.net
Sat Jul 16 03:38:54 CEST 2011


On Jul 15, 2011, at 6:10 PM, mloxton wrote:

> I have a data set with 22 fields and several thousand records in which
> one field (count) indicates the number of times that each specific
> combination of the other 21 fields occurred in a bigger and largely
> unavailable data set.
> So each record is unique in its combination of field values and has a
> field that identifies how many multiples of this record actually
> occurred.
>
> Without resorting to writing a program that re-expands the data set to
> several million rows by cloning each row by the number of times the
> "count" field indicated, is there a way in R to use that field to come
> up with summary stats and bargraphs of the distribution of any one of
> the other fields?
>

 > dfrm <- expand.grid(A=1:3, B=1:3)
 > dfrm$counts <- 1:9
 > xtabs(counts~A, data=dfrm)
A
  1  2  3
12 15 18

 >barplot(xtabs(counts~A, data=dfrm), xlab="Counts by A level")

-- 

David Winsemius, MD
West Hartford, CT



More information about the R-help mailing list