Abstract
Feature selection is an important data preprocessing step for the construction of an effective bankruptcy prediction model. The prediction performance can be affected by the employed feature selection and classification techniques. However, there have been very few studies of bankruptcy prediction that identify the best combination of feature selection and classification techniques. In this study, two types of feature selection methods, including filter- and wrapper-based methods, are considered, and two types of classification techniques, including statistical and machine learning techniques, are employed in the development of the prediction methods. In addition, bagging and boosting ensemble classifiers are also constructed for comparison. The experimental results based on three related datasets that contain different numbers of input features show that the genetic algorithm as the wrapper-based feature selection method performs better than the filter-based one by information gain. It is also shown that the lowest prediction error rates for the three datasets are provided by combining the genetic algorithm with the naïve Bayes and support vector machine classifiers without bagging and boosting.
Original language | English |
---|---|
Article number | e12335 |
Journal | Expert Systems |
Volume | 36 |
Issue number | 1 |
DOIs | |
State | Published - 01 02 2019 |
Bibliographical note
Publisher Copyright:© 2018 John Wiley & Sons, Ltd
Keywords
- bankruptcy prediction
- data mining
- ensemble classifiers
- feature selection
- machine learning