Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

Bibliographic Notes

Caruana & Niculescu-Mizil (2006) present a comprehensive empirical comparison of many of the modern model builders. An older comparison is known as the Statlog comparison (King et al., 1995).

The ada package for boosting was implemented by Mark Culp, Kjell Johnson, and George Michailidis, and is described in Culp et al. (2006).

Random forests were introduced by Breiman (2001), building on the concept of bagging (Breiman, 1996) and the random subspace method for decision forests (Ho, 1998). Breiman observed that ``random forests do not overfit.''



Copyright © 2004-2008 Togaware Pty Ltd
Support further development through the purchase of the PDF version of the book.
PDF version is properly formatted and forms a comprehensive book (draft with over 600 pages).
Brought to you by Togaware.