摘要
The C4. 5 decision tree (DT) can be applied in various fields and discovers knowledge for huma can contain numerous features, not all features are beneficial for classification in C4. 5 algorithm. Therefore, a novel scatter search-based approach (SS + DT) is proposed to acquire optimal parameter settings and to select the beneficial subset of features that result in better classification results. To evaluate the efficiency of the proposed SS + DT approach, datasets in the UCI (University of California, Irvine) Machine Learning Repository are utilized to assess the performance of the proposed approach. Experimental results demonstrate that the parameter settings for the C4. 5 algorithm obtained by the SS + DT approach are better than those obtained by other approaches. When feature selection is considered, classification accuracy rates on most datasets are increased. Therefore, the proposed approach can be utilized to identify effectively the best parameter settings for C4. 5 algorithm and useful features.
原文 | 英語 |
---|---|
頁(從 - 到) | 63-75 |
頁數 | 13 |
期刊 | Soft Computing |
卷 | 16 |
發行號 | 1 |
DOIs | |
出版狀態 | 已出版 - 01 2012 |
對外發佈 | 是 |