[R] Replacing missing values
Mark Myatt
mark at myatt.demon.co.uk
Wed May 9 12:20:15 CEST 2001
Jean Vidal <jean.vidal at freesurf.fr> writes:
>I'm discovering R (very impressive), and didn't find in the docs a simple
>method for replacing, in a data frame, missing values (NA) with the
>column's mean (or any other method for reconstructing missing values when
>needed).
>Thanks in advance for your help.
There are a series of na.* Functions in R but they are not well
documented (which is my way of saying "I can't work out how to make them
work"!).
I use indexing to deal with missing values. For example:
var[var == -99] <- NA
To replace a missing value code (e.g. -99) with NA. Replacing with an
imputed value can be done in the same manner but with the imputation
function on the RHS of the assignment. For example:
var[is.na(var)] <- mean(var, na.rm = TRUE)
If var is a vector in a data.frame then you need to specify the
data.frame:
df$var[is.na(df$var)] <- mean(df$var, na.rm = TRUE)
I hope that helps.
Mark
BTW: While I am here, can anyone explain how the na.* functions work?
--
Mark Myatt
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
More information about the R-help
mailing list