[R] overdispersed zero inflated continuous data
Ben Bolker
bbolker at gmail.com
Thu May 8 23:54:43 CEST 2014
Jenushka <jhazlehu <at> tulane.edu> writes:
>
> I'm a beginning R user.
>
> The data: Volume of nectar in flowers under 4 different treatments,
> nested
> for individual (measures were taken mutliple times from different
> flowers of
> the same individual- never the same flower).
This is really more of a statistical question than an R question,
and might get more informed answers at r-sig-ecology at r-project.org,
but:
>
> Specs: 54% of the data = 0. Variance=3.89. Mean=1.03. Sample size per
> treatment varies, all are >20.
>
> I am trying to determine if treatment had any impact on nectar volume.
>
Since it will be hard to transform the data to normality (since
you have a big pile of responses equal to exactly zero), one possibility
would be a two-stage model: fit a binomial GL(M)M for zero vs non-zero,
then fit a linear (mixed) model to the (probably log-transformed) positive
responses.
Alternatively you could ignore the distribution and use a randomization
(permutation or bootstrap) approach to get reasonable p-values/confidence
intervals, although you'll have to be careful to do the randomization
respecting the grouping by individual.
Ben Bolker
More information about the R-help
mailing list