[R] Help Required in using cast (reshape package) function
Punit Anand
punit.anand at lusight.com
Wed Nov 21 17:55:45 CET 2007
Hello everyone,
Since the fields in variables column are unique with respect to ID and
fiscal year; any function like
sum,min,max,mean etc will lead to the desired result
Therefore cast(dataread, ID + Period ~ variable,sum)
Will lead to the desired result in my case;
Thanks,
Punit
-----Original Message-----
From: Punit Anand
Sent: Wednesday, November 21, 2007 11:21 AM
To: 'r-help at r-project.org'
Subject: Help Required in using cast (reshape package) function
Hello everyone,
I am new to R. I have data in the form of excel pivot table format and I
want to cast it into a format which can make it compatible with
computation.
Since I already have the package in pivot format; I avoid melt function
and use the cast directly.
I inspect the
dataread <- read.csv(".....", header=TRUE)
Data in the format
Id Region Country Industry
Period variable value
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "account payable" 10000
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "account receivable" 50000
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "XXXX" 70000
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "YYYY" "NA"
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "ZZZZ"
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "AAAA" 10000
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" "ccccccc" 10000
# I want to arrange data in the format
Id Region Country Industry
Period "account payable" "account receivable" XXXX" "YYYYY"
"ZZZZ" "AAAA" "CCCCC"
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" 10000 50000 70000 "NA" 10000
10000
# casting the data
cast <- cast(dataread, ID + Period ~ variable)
When I do that the data is casted as a pivot with a warning "Aggregation
requires fun.aggregate: length used as default", and the casted data
gives me the count of variables (as suggested by the warning )
Id Region Country Industry
Period "account payable" "account receivable" XXXX" "YYYYY"
"ZZZZ" "AAAA" "CCCCC"
"Aaa11 xx" "Latin America" "Mexico" "food & beverages"
"2002:FY" 1 1 1 NA NA 10000
10000
How do I use the fun.aggregate feature of the cast function to obtain
the desired result?
Moreover, I want to subdivide the casted data into subsets based on ids.
How do I achieve that?
Thanks in advance,
Punit Anand
More information about the R-help
mailing list