[R] simple problem, but not for me
Spencer Graves
spencer.graves at pdf.com
Sat Mar 19 19:56:52 CET 2005
The suggestions by Uwe and Rolf use some of the subtler features of
R. A simpler (to me) if more tedious approach is provided by the
following:
Data <- data.frame(a1=1:4, a2=5:8, a3=9:12)
Data$b1 <- Data$a1/(Data$a1+Data$a2+Data$a3)
Data$b2 <- Data$a2/(Data$a2+Data$a3)
> Data
a1 a2 a3 b1 b2
1 1 5 9 0.06666667 0.3571429
2 2 6 10 0.11111111 0.3750000
3 3 7 11 0.14285714 0.3888889
4 4 8 12 0.16666667 0.4000000
The above can be written in fewer characters using "with":
Data$b1 <- with(Data, a1/(a1+a2+a3))
Data$b2 <- with(Data, a2/(a2+a3))
If you want something that will work with an arbitrary number of
columns, consider the following:
Data <- data.frame(a1=1:4, a2=5:8, a3=9:12)
dat2 <- Data
k <- length(dat2)
for(i in (k-1):1)
dat2[[i]] <- (dat2[[i]]+dat2[[i+1]])
Dat2 <- cbind(Data, Data/dat2)
names(Dat2)[k+(1:k)] <- paste("b", 1:k, sep="")
> Data
a1 a2 a3
1 1 5 9
2 2 6 10
3 3 7 11
4 4 8 12
> dat2
a1 a2 a3
1 15 14 9
2 18 16 10
3 21 18 11
4 24 20 12
> Dat2
a1 a2 a3 b1 b2 b3
1 1 5 9 0.06666667 0.3571429 1
2 2 6 10 0.11111111 0.3750000 1
3 3 7 11 0.14285714 0.3888889 1
4 4 8 12 0.16666667 0.4000000 1
If you want to do this more than once, you can wrap the above code
in a function; see "Writing your own functions" in "An Introduction to
R", available, e.g., vial help.start().
hope this helps.
spencer graves
p.s. The posting guide "http://www.R-project.org/posting-guide.html"
also seems quite valuable. In an discussion on (and off) this list
earlier this week, several people reported they had found solutions to
their own problems in the process of preparing a question to post to
this list. And even if you don't find a solution, the result will more
likely elicit useful replies.
Rolf Turner wrote:
>alexbri at netcabo.pt wrote:
>
>
>
>>Hello, I'm new in R and I want to do one thing that is very easy in
>>excel, however, I cant do it in R.
>>
>>
>
> Well, if you've deadened your brain by using Excel,
> no wonder.
>
>
>
>>Suppose we have the data frame:
>>
>>data<- data.frame(A=c("a1","a2","a3","a4","a5"))
>>
>>
>
> Oh, for Pete's sake! This makes ``data'' (NOT a good name
> for an object!) into a data frame with a single column named
> ``A''. That column will be a factor with 5 entries (an 5
> levels) with these levels being (the character strings)
> "a1","a2","a3","a4", and "a5".
>
> Nothing to do with what you actually want.
>
>
>
>>I need to obtain another column in the same data frame (lets say
>>B=c(b1,b2,b3,b4,b5) in the following way:
>>
>>
>
> This would make B a ***vector*** equal to the concatenation
> of b1, ..., b5.
>
> Perhaps you mean:
>
> B <- cbind(b1,b2,b3,b4,b5)
>
>
>
>>b1=a1/(a1+a2+a3+a4+a5)
>>
>>b2=a2/(a2+a3+a4+a5)
>>
>>b3=a3/(a3+a4+a5)
>>
>>b4=a4/(a4+a5)
>>
>>b5=a5/a5
>>
>>a1..a5 and b1...b5 are always numeric values
>>
>>(this is just an example, what I really want is apply this kind of
>>formula to a much larger data frame)
>>
>>
>>
> You are very confused. Your notation and your use of
> the function c() are all wrong.
>
> If you are going to use R, get the basic syntax straight.
>
> You probably should be using matrices rather than data frames
> given that the entries are all numeric.
>
> Be that as it may, if A is a (numeric) matrix then
>
> B <- A/t(apply(A,1,function(x){rev(cumsum(x))}))
>
> will give what you appear to want.
>
> cheers,
>
> Rolf Turner
> rolf at math.unb.ca
>
>______________________________________________
>R-help at stat.math.ethz.ch mailing list
>https://stat.ethz.ch/mailman/listinfo/r-help
>PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
>
>
More information about the R-help
mailing list