[R] randomForest Tutorial

4 views
Skip to first unread message

Dirk Nemitz

unread,
Jul 22, 2008, 7:58:32 AM7/22/08
to r-h...@r-project.org
I am new to R and I'd like to use the randomForest package for my thesis
(identifying important variables for more detailed analysis with other
software). I have found extremely well written and helpful information on
the usage of R.

Unfortunately it seems to be very difficult to find similarly detailed
tutorials for randomForest, and I just can't get it work with the
information on
http://rss.acs.unt.edu/Rdoc/library/randomForest/html/randomForest.html

Can anybody give me a hint where to find a more detailed tutorial? I am
especially interested in preparation of data to use with randomForest (how
to structure the csv-file) and which information is mandatory for the
package to work. Maybe someone has a simple copy-paste example to share and
experiment?

Thank you for your attention,

Dirk

--
---------------------------------------
Dirk Nemitz
Zeppelinstr. 11a
37083 Göttingen
Germany

Tel: +49 (0)551 492 32 51
Mobil: +49 (0)175 709 31 92
---------------------------------------

[[alternative HTML version deleted]]

Jim Porzak

unread,
Jul 22, 2008, 9:28:06 AM7/22/08
to Dirk Nemitz, r-h...@r-project.org
Dirk,

Check out Andy Liaw/Mat Wiener's article in RNews 2002-3
www.r-project.org/doc/Rnews/Rnews_2002-3.pdf

Also my decks at www.porzak.com/JimArchive :
1. http://www.porzak.com/JimArchive/JimPorzak_CIwithR_useR2006_tutorial.pdf
2nd part is on randomForest
2. http://www.porzak.com/JimArchive/JimPorzak_RFwithR_DMAAC_Jan07_webinar.pdf
is all on randomForest

The latter two focus on the use of randomForest in marketing & on the
binary response problem

For the variable selection problem you mention, check out Ramon
Diaz-Uriate's package varSelRF on your local CRAN.

HTH,
Jim Porzak
Responsys, Inc.
San Francisco, CA
http://www.linkedin.com/in/jimporzak

> ______________________________________________
> R-h...@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

______________________________________________
R-h...@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Hans W. Borchers

unread,
Jul 22, 2008, 1:41:34 PM7/22/08
to r-h...@r-project.org

The help pages for 'randomForest' are quite clear in stating how to use the
'randomForest()' function. Input is a data frame (not a csv-file) and a
formula involving attribute names.

There is a kind of tutorial for classification and clustering with Random
Forests on Leo Breiman's web page
<http://www.stat.berkeley.edu/users/breiman/RandomForests/cc_home.htm>. It
describes the older code on which the R package is built, not the
implementation of RandomForest from Salford systems, so more or less what
you want.

And there is another implementation of random forests in the 'party'
package, see the 'cforest()' function. (The package varSelRF has already
been mentioned in a previous post.)

Hans Werner Borchers
ABB Corporate Research

Dirk Nemitz wrote:
>
> I am new to R and I'd like to use the randomForest package for my thesis
> (identifying important variables for more detailed analysis with other
> software). I have found extremely well written and helpful information on
> the usage of R.
>
> Unfortunately it seems to be very difficult to find similarly detailed
> tutorials for randomForest, and I just can't get it work with the
> information on
> http://rss.acs.unt.edu/Rdoc/library/randomForest/html/randomForest.html
>
> Can anybody give me a hint where to find a more detailed tutorial? I am
> especially interested in preparation of data to use with randomForest (how
> to structure the csv-file) and which information is mandatory for the
> package to work. Maybe someone has a simple copy-paste example to share
> and
> experiment?
>
> Thank you for your attention,
>
> Dirk
>

> ---------------------------------------
> Dirk Nemitz
> Zeppelinstr. 11a
> 37083 Göttingen
> Germany
>
> Tel: +49 (0)551 492 32 51
> Mobil: +49 (0)175 709 31 92
> ---------------------------------------
>
>

--
View this message in context: http://www.nabble.com/randomForest-Tutorial-tp18588688p18594872.html
Sent from the R help mailing list archive at Nabble.com.

Reply all
Reply to author
Forward
0 new messages