Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

Priors

Sometimes the proportions of classes in a training set do not reflect their true proportions in the population. You can inform Rattle of the population proportions and the resulting model will reflect these.

The priors can be used to ``boost'' a particularly important class, by giving it a higher prior probability, although this might best be done through the Loss Matrix.

In Rattle the priors are expressed as a list of numbers that sum up to 1, and of the same length as the number of classes in the training dataset. An example for binary classification is 0.5,0.5.

The default priors are set to be the class proprtions as found in the training dataset.



Copyright © Togaware Pty Ltd
Support further development through the purchase of the PDF version of the book.
PDF version is properly formatted and forms a comprehensive book (draft with over 600 pages).
Brought to you by Togaware.