[R] calculate column means when missing data is present
Luke Neraas
lukasneraas.r at gmail.com
Tue Nov 27 01:59:27 CET 2007
# Hello,
# I have a matrix that contains some missing values denoted by NA.
# I would like to calculate the mean for each column and not have the
# NA values included. here is a sample matrix "X" below.
c1 <- c(1,4,3,2,NA,2,5,6,4,2)
c2 <- c(1,3,6,NA,NA,2,9,6,1,2)
c3 <- c(2,4,2,NA,4,2,NA,6,3,5)
c4 <- c(2,6,4,NA,4,2,NA,6,1,5)
c5 <- c(2,3,2,NA,4,2,9,6,NA,5)
X<-cbind(c1,c2,c3,c4,c5)
X
# I have tried a variety of methods to calculate the Column means but each time
# i do not get the result i am looking for
result<-colMeans(X) # only returns a mean of NA
X <- X[!is.na(X)] # Effectively removes the NA values but now I
cannot calculate Column Means
# Below is the result of the column means I am looking for
result
c1 c2 c3 c4 c5
[1,] 3.22 3.75 3.5 3.75 4.125
# any help or ideas would be greatly appreciated
# thanks in advance
Luke Neraas
lukasneraas.r at gmail.com
University of Alaska Fairbanks
School of Fisheries and Ocean Sciences
11120 Glacier Highway
UAF Fisheries Division
Juneau, AK 99801
More information about the R-help
mailing list