[R] Modelling categorical variables
Jessica Lavabre
jessicalavabre at gmail.com
Tue Sep 8 11:01:14 CEST 2015
Hi,
I am a beginner with statistics and R and have no clue on how to model my
data. I have collected information on seed traps (ID) that includes the
habitat type (Hab) and different measures of distances. Also I have applied
a modularity analysis, so that the seeds traps are grouped into modules. My
dataset is as follow:
*ID Hab Module DistEdge MeanDist1 MeanDist2 MeanDist3
F48 F A 21.768 24.941 6.033
27.642 F50 F E 35.666** 60.505 149.927 *
* 48.582 F52 F B** 12.243** 103.041
72.908 *
* 102.375 N02 N B ** 58.681**
129.59 127.344 *
* 131.383 N17 N B** 62.829** 72.827 **
76.736 *
* 77.644 N22 N B** 89.207** 78.719 **
75.005 *
* 81.176N33 N A** 23.288** 35.48 **
25.317 *
* 36.931 N40 N B** 36.734** 62.234 **
30.68 *
* 61.885 N47 N E ** 60.443** 66.367 **
150.892 ** 55.097 *
I am looking for a way to analyze if there is any correlation between the
Module classification and the other variables. My difficulties here are:
1 - is there a way to model my data where Module is the response variable
(something like Module~Hab*DistEdge*MeanDist1) ? If so, which model should
I use (I only have a bit of experience with glm) and which distribution?
2 - Is that a problem if I have different types of predictor variable
(factor and numerical)?
Any help would be greatly appreciated,
-- Jessica Lavabre-Micas
[[alternative HTML version deleted]]
More information about the R-help
mailing list