Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

List of Figures


  1. The R command line.
  2. Initial Rattle window.
  3. Loading the weather.csv file.
  4. Decision tree model of the weather dataset.
  5. Decision tree plot.
  6. Explore tab's distribution plots.
  7. Explore tab's distribution plots- Categorics.
  8. Initial steps of the data mining process (Tony Nolan)
  9. The data mining process
  10. A sample of plots
  11. Loading the weather.csv dataset.
  12. The CSV options of the Data tab.
  13. The CSV file chooser
  14. Data tab ARFF option
  15. Loading data through an ODBC database connection
  16. Teradata ODBC connection
  17. Netezza ODBC connection
  18. Netezza configuration
  19. Loading an R binary data file.
  20. Loading an already defined R data frame
  21. Selected region of a spreadsheet copied to the clipboard
  22. Loading an R data frame originally from the clipboard
  23. Data entry spreadsheet
  24. Select tab choosing Adjusted as a Risk variable.
  25. Loading the weather.csv dataset.
  26. The CSV option of the Data tab.
  27. The CSV file chooser
  28. Data tab ARFF option
  29. Loading data through an ODBC database connection.
  30. Netezza ODBC connection
  31. Loading an already defined R data frame XXXX update XXXX.
  32. Selected region of a spreadsheet copied to the clipboard.
  33. Loading an R data frame originally from the clipboard
  34. Loading an R binary data file.
  35. Select tab choosing Adjusted as a Risk variable.
  36. Missing value summary for a modified version of the audit dataset.
  37. Benford stratified by Marital and Gender.
  38. Mosaic plot of Age by Adjusted.
  39. Correlations between keywords in documents.
  40. Transform options.
  41. Selection of normalisations performed on Income.
  42. Original distribution of Age.
  43. Normalisations of Age.
  44. Selection of imputations.
  45. Imputation using the mode for missing values of Age.
  46. Binning Age.
  47. Distributions of binned Age.
  48. Turning Gender into an Indicator Variable.
  49. Selection of cleanup operations.
  50. External data change.
  51. KMeans Iteration Interface
  52. KMeans Iteration Plot
  53. Informational dialog.
  54. A sample Cost Curve for a random forest.
  55. Evaluate tab with Score option and a CSV file.
  56. Scores have been saved.
  57. Load and analyse score data using the Gnumeric spreadsheet.
  58. Distribution of scores displayed using Rattle.
  59. R command line under MS/Windows
  60. R Commander GUI
  61. R GUI using ESS for Emacs
  62. A simple time series plot of dates using traditional Rpackage[]graphics.
  63. A simple time series plot of dates using Rpackage[]ggplot2.
  64. An ordered monthly box plot.
  65. A approximate model of random data.
  66. Reduced example of an alternating decision tree.
  67. Audit risk chart from an alternating decision tree.
  68. Togaware's Rattle Gnome Data Mining interface.
  69. The Weka GUI chooser.
  70. Weka explorer viewing data.
  71. Import CSV data into Weka.
  72. Output from running J48 (C4.5).
  73. Fujitsu GhostMiner interface.
  74. Sample ODMiner interface to ODM.
  75. SAS Enterprise Miner interface (Version 4).
  76. Statistica Data Miner graphical interface.


Copyright © Togaware Pty Ltd
Support further development through the purchase of the PDF version of the book.
PDF version is properly formatted and forms a comprehensive book (draft with over 700 pages).
Brought to you by Togaware. This page generated: Sunday, 13 September 2009