Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

V

Validation Set: A portion of a dataset that is used to test the performance of a model as it is being built. See also Training Set and Refer[testset]Test Set.

Variance: The error in a model due to the fact that the model is built from only a sample of the data taken to represent the whole population. That is, those instances occasionally incorrectly classified by models built by the learning algorithm. Modelling error due to variance can be reduced using Bagging. Compare with Refer[bias]bias.



Copyright © Togaware Pty Ltd
Support further development through the purchase of the PDF version of the book.
PDF version is properly formatted and forms a comprehensive book (draft with over 700 pages).
Brought to you by Togaware. This page generated: Tuesday, 23 December 2008