[R] testing independence of categorical variables
David Winsemius
dwinsemius at comcast.net
Fri Nov 23 02:47:08 CET 2007
"Shoaaib Mehmood" <shoaaib at gmail.com> wrote in
news:ab02bb240711220316q25e0bbd6rd2de31610c245422 at mail.gmail.com:
> hi,
>
> is there a way of calculating of measuring dependence between two
> categorical variables. i tried using the chi square test to test for
> independence but i got error saying that the lengths of the two
> vectors don't match. Suppose X and Y are two factors. X has 5 levels
> and Y has 7 levels. This is what i tried doing
>
>>temp<-chisq.test(x,y)
>
> but got error "the lengths of the two vectors don't match". any help
> will be appreciated
If you posted the table, it might be more clear why the error was being
thrown. In the example shown you have mixed "x" and "X". They would be
different in R.
chisq.test should not be having a problem with unequal row and column
lengths.
#simulate a 5 x 7 table
> TT<-r2dtable(1,5*c(1,8,5,8,4),5*c(3,3,3,3,4,4,6))
> TT
[[1]]
[,1] [,2] [,3] [,4] [,5] [,6] [,7]
[1,] 0 1 1 0 2 1 0
[2,] 3 3 6 6 2 8 12
[3,] 1 2 3 3 9 2 5
[4,] 8 3 3 3 6 7 10
[5,] 3 6 2 3 1 2 3
#general test for association
> chisq.test(TT[[1]],TT[[2]])
Pearson's Chi-squared test
data: TT[[1]]
X-squared = 33.5942, df = 24, p-value = 0.09214
Warning message:
In chisq.test(TT[[1]], TT[[2]]) :
Chi-squared approximation may be incorrect
--
David Winsemius
More information about the R-help
mailing list