Instance selection by genetic-based biological algorithm

Zong Yao Chen, Chih Fong Tsai*, William Eberle, Wei Chao Lin, Shih Wen Ke

*此作品的通信作者

研究成果: 期刊稿件文章同行評審

11 引文 斯高帕斯(Scopus)

摘要

Instance selection is an important research problem of data pre-processing in the data mining field. The aim of instance selection is to reduce the data size by filtering out noisy data, which may degrade the mining performance, from a given dataset. Genetic algorithms have presented an effective instance selection approach to improve the performance of data mining algorithms. However, current approaches only pursue the simplest evolutionary process based on the most reasonable and simplest rules. In this paper, we introduce a novel instance selection algorithm, namely a genetic-based biological algorithm (GBA). GBA fits a “biological evolution” into the evolutionary process, where the most streamlined process also complies with the reasonable rules. In other words, after long-term evolution, organisms find the most efficient way to allocate resources and evolve. Consequently, we can closely simulate the natural evolution of an algorithm, such that the algorithm will be both efficient and effective. Our experiments are based on comparing GBA with five state-of-the-art algorithms over 50 different domain datasets from the UCI Machine Learning Repository. The experimental results demonstrate that GBA outperforms these baselines, providing the lowest classification error rate and the least storage requirement. Moreover, GBA is very computational efficient, which only requires slightly larger computational cost than GA.

原文英語
頁(從 - 到)1269-1282
頁數14
期刊Soft Computing
19
發行號5
DOIs
出版狀態已出版 - 05 2015
對外發佈

文獻附註

Publisher Copyright:
© 2014, Springer-Verlag Berlin Heidelberg.

指紋

深入研究「Instance selection by genetic-based biological algorithm」主題。共同形成了獨特的指紋。

引用此