[R] Using pam, agnes or clara as prediction models?

Renald Buter buter at cwts.leidenuniv.nl
Thu Jan 15 09:07:01 CET 2004

On Wed, Jan 14, 2004 at 03:18:10PM -0500, Liaw, Andy wrote:
> If pam produces the cluster medoids, you should be able to use the
> 1-nearest-neighbor classifier for prediction of future data, using the
> medoids as the `training' data.  1-NN is available in the `class' package,
> part of the `VR' bundle.

Thanks very much for your quick answer! I've tried your suggestion in
the following way:

 # separate the ruspini data into train and test set
 > train<-ruspini[1:50,]
 > test<-ruspini[51:75,]
 > pamx<-pam(train,4)
 > knnx<-knn(pamx$medoids,test,factor(c("a","b","c","d")),k=3)
 > knnx
 [1] d d b b d c b c c d c a a d c c a a c a a d c d a
 Levels: a b c d

But the result of applying the test set to the knn should only contain 2
clusters, since the upper half of the ruspini data contains only 2

Could you tell me what I am missing here?



More information about the R-help mailing list