[R] machine learning and horse racing

Moshe Olshansky m_olshansky at yahoo.com
Tue Sep 18 03:59:12 CEST 2007


Hi Stephen,

How many variables do you have?  How many of them are
categorical?
How many observations do you have?
Since I am not a racing expert, in how many races a
typical horse participates? How many years does it
usually span?

In the past I had a good experience with Random
Forest. There exists a RandomForest package in R. If
you run out of memory and do not mind to spend some
time you can try the original Fortran code (after
trying the R package without saving the forest).

Regards,

Moshe.

--- stephenc at ics.mq.edu.au wrote:

> Hi
> 
>  I am trying to use various techniques (eg svm,
> logistic regression,
> neural networks) to classify and predict the outcome
> of horse races.
> 
>  Most of my predictive features are categorical  -
> horse, jockey, trainer 
> - and I keep on running out of memory owing to the
> size of the vector.
> 
>  Does anyone know how to solve the problem?
> 
>  I have classified the outcomes as win/lose or
> place/lose with a view to
> train on x years of results and then testing on the
> subsequent years
> results. Is there some alternate way of looking at
> the problem?
> 
>  Does anyone have pointers to published work in this
> area?
> 
>  Thanks.
> 
>  Stephen
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained,
> reproducible code.
>



More information about the R-help mailing list