[R] equivalent of Matlab robustfit?
Darren Weber
darren.weber.lists at gmail.com
Wed Nov 21 07:26:55 CET 2007
Hi,
I've been using the Matlab robustfit function for linear regressions
where I suspect some data points are outliers. Is there an equivalent
function in R?
Take care, Darren
PS, This is the Matlab help on robustfit:
>> help robustfit
ROBUSTFIT Robust linear regression
B = ROBUSTFIT(X,Y) returns the vector B of regression coefficients,
obtained by performing robust regression to estimate the linear model
Y = Xb. X is an n-by-p matrix of predictor variables, and Y is an
n-by-1 vector of observations. The algorithm uses iteratively
reweighted least squares with the bisquare weighting function. By
default, ROBUSTFIT adds a column of ones to X, corresponding to a
constant term in the first element of B. Do not enter a column of ones
directly into the X matrix.
B = ROBUSTFIT(X,Y,'WFUN',TUNE) uses the weighting function 'WFUN' and
tuning constant TUNE. 'WFUN' can be any of 'andrews', 'bisquare',
'cauchy', 'fair', 'huber', 'logistic', 'talwar', or 'welsch'.
Alternatively 'WFUN' can be a function that takes a residual vector as
input and produces a weight vector as output. The residuals are scaled
by the tuning constant and by an estimate of the error standard
deviation before the weight function is called. 'WFUN' can be
specified using @ (as in @myfun). TUNE is a tuning constant that is
divided into the residual vector before computing the weights, and it
is required if 'WFUN' is specified as a function.
B = ROBUSTFIT(X,Y,'WFUN',TUNE,'CONST') controls whether or not the
model will include a constant term. 'CONST' is 'on' (the default) to
include the constant term, or 'off' to omit it.
[B,STATS] = ROBUSTFIT(...) also returns a STATS structure
containing the following fields:
'ols_s' sigma estimate (rmse) from least squares fit
'robust_s' robust estimate of sigma
'mad_s' MAD estimate of sigma; used for scaling
residuals during the iterative fitting
's' final estimate of sigma, the larger of robust_s
and a weighted average of ols_s and robust_s
'se' standard error of coefficient estimates
't' ratio of b to stats.se
'p' p-values for stats.t
'covb' estimated covariance matrix for coefficient estimates
'coeffcorr' estimated correlation of coefficient estimates
'w' vector of weights for robust fit
'h' vector of leverage values for least squares fit
'dfe' degrees of freedom for error
'R' R factor in QR decomposition of X matrix
The ROBUSTFIT function estimates the variance-covariance matrix of the
coefficient estimates as V=inv(X'*X)*STATS.S^2. The standard errors
and correlations are derived from V.
ROBUSTFIT treats NaNs in X or Y as missing values, and removes them.
Example:
x = (1:10)';
y = 10 - 2*x + randn(10,1); y(10) = 0;
bls = regress(y,[ones(10,1) x])
brob = robustfit(x,y)
scatter(x,y)
hold on
plot(x,brob(1)+brob(2)*x,'r-', x,bls(1)+bls(2)*x,'m:')
See also REGRESS, ROBUSTDEMO.
More information about the R-help
mailing list