|
DATA MINING
Desktop Survival Guide by Graham Williams |
|
|||
Bibliographic Notes |
Caruana & Niculescu-Mizil (2006) present a comprehensive empirical comparison of many of the modern model builders. An older comparison is known as the Statlog comparison (King et al., 1995).
The ada package for boosting was implemented by Mark Culp, Kjell Johnson, and George Michailidis, and is described in Culp et al. (2006).
Random forests were introduced by Breiman (2001), building on the concept of bagging (Breiman, 1996) and the random subspace method for decision forests (Ho, 1998). Breiman observed that ``random forests do not overfit.''