[R] Computing/Interpreting Odds Ratios for 3-way interactions from lmer

Thu Jan 1 23:20:50 CET 2009

Hello, 
  I am a relative novice at both using regression analysis and at using R in general  (and at object oriented programing). A colleague convinced me that binary logistic regression is the most appropriate analysis for the data that I have though, so I've been trying to muddle through. 

I'm currently stumped on how to interpret/compute odds ratios for two and three way interactions when a variable has more than 2 factors.  
The study design is an examination of the use of past tense by 3 subject populations in 3 different sentence types and 2 locations within each sentence. 
I have been treating subject and main/subordinate verbs as random factors and group (age, sli, mlu) and condition (and, that, when) and clause order (first/second) as fixed factors.  In each case it is worth noting that the first factor is the reference group. I would like to be able to clearly interpret the interactions because specific hypotheses hinge on the results. 

Two other points that may be relevant: 1) The original design was balanced, but the current results are  unbalanced because of data loss (e.g. children failing to respond) and this is not randomly distributed across groups.  2) There is some colinearity between the conditions (corr between subord/when = .44)  and between the groups (corr between SLI/MLU =.48).  This is somewhat logical given the targets, but is not easily reduced. 

The syntax I'm using for the analysis is: 
clauseOPCyesI <- lmer(OPCorrect == "past" ~ group*Cond* Clause.Order + (1|SUBJ) + (1|sub.V) + (1|main.V), subset(a), family="binomial")

And the results I obtain are: 

Generalized linear mixed model fit by the Laplace approximation 
Formula: OPCorrect == "past" ~ group * Cond * Clause.Order + (1 | SUBJ) +    (1 | sub.V) + (1 | main.V) 
   Data: subset(a) 
  AIC  BIC logLik deviance
 5673 5817  -2816     5631
Random effects:
 Groups Name        Variance Std.Dev.
 main.V (Intercept) 0.056843 0.23842 
 SUBJ   (Intercept) 1.248441 1.11734 
 sub.V  (Intercept) 0.085521 0.29244 
Number of obs: 6827, groups: main.V, 47; SUBJ, 38; sub.V, 36

Fixed effects:
                                     Estimate Std. Error z value Pr(>|z|)    
(Intercept)                           2.19570    0.35057   6.263 3.77e-10 ***
groupm                                0.67645    0.52638   1.285   0.1988    
groups                               -0.59218    0.47534  -1.246   0.2128    
Condsubord                            0.27588    0.23297   1.184   0.2363    
Condwhen                             -0.17839    0.20278  -0.880   0.3790    
Clause.Ordersecond                   -0.10066    0.19965  -0.504   0.6141    
groupm:Condsubord                     0.38856    0.52295   0.743   0.4575    
groups:Condsubord                    -0.07662    0.29406  -0.261   0.7944    
groupm:Condwhen                      -1.82957    0.31892  -5.737 9.65e-09 ***
groups:Condwhen                      -1.42286    0.26601  -5.349 8.85e-08 ***
groupm:Clause.Ordersecond            -0.29247    0.33645  -0.869   0.3847    
groups:Clause.Ordersecond            -0.53254    0.26494  -2.010   0.0444 *  
Condsubord:Clause.Ordersecond        -0.60703    0.28965  -2.096   0.0361 *  
Condwhen:Clause.Ordersecond           1.64044    0.34265   4.788 1.69e-06 ***
groupm:Condsubord:Clause.Ordersecond -2.95511    0.60007  -4.925 8.45e-07 ***
groups:Condsubord:Clause.Ordersecond -0.04796    0.38893  -0.123   0.9019    
groupm:Condwhen:Clause.Ordersecond   -0.36423    0.48229  -0.755   0.4501    
groups:Condwhen:Clause.Ordersecond    0.05882    0.41775   0.141   0.8880    

While I know to talk about odds ratios I need to raise e to the estimate of the coefficient for main effects, I am less clear about interactions terms.  
So for instance, if I would like to say that the SLI group was X% less likely to produce a correct past tense for in the second clause than in the first clause do I add the estimates (groups+Clause.Ordersecond+groups:ClauseOrdersecond = -1.22538) and then compute the OR 0.293646094?  "The odds of the SLI group producing a past tense form in the second clause was approximately 29% as compared to their productions in the first clause"   Or since the reference group is their age matched peers would I need to say "The odds of the SLI group producing a past tense form in the second clause was approximately 29% as compared to the age-matched groups productions in the first clause of the target sentences".  (Note that the second sentence is much less useful to me than the first).  

Similarly if I would like to talk about the fact that Condwhen is relatively poorer for the MLU/SLI groups than the age matched-coordinate but I get muddled by the fact that there is  improvement in the Cond when, second clause  (as compared to condwhen-first clause-age-matched, right?).  Again, my trouble is with what to use as the reference group and how to separate that out in terms of computing percentages. 

Thanks so much for any assistance you can lend. 

Amanda 

Amanda J. Owen PhD CCC-SLP
Assistant Professor
Dept of Communication Sciences and Disorders
University of Iowa
319-335-6951 (office)
amanda-owen at uiowa.edu