Data Mining: A Preprocessing Engine
This study is emphasized on different types of normalization. Each of which was tested against the ID3 methodology using the HSV data set. Number of leaf nodes, accuracy and tree growing time are three factors that were taken into account. Comparisons between different learning methods were accomplished as they were applied to each normalization method. A new matrix was designed to check for the best normalization method based on the factors and their priorities. Recommendations were concluded.
Copyright: © 2006 Luai A. Shalabi, Zyad Shaaban and Basel Kasasbeh. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 6,685 Views
- 5,648 Downloads
- 326 Citations
- induction decision tree
- rules generation