Feature selection in single and ensemble learning-based bankruptcy prediction models

Wei Chao Lin, Yu Hsin Lu, Chih Fong Tsai*

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

48 Scopus citations

Abstract

Feature selection is an important data preprocessing step for the construction of an effective bankruptcy prediction model. The prediction performance can be affected by the employed feature selection and classification techniques. However, there have been very few studies of bankruptcy prediction that identify the best combination of feature selection and classification techniques. In this study, two types of feature selection methods, including filter- and wrapper-based methods, are considered, and two types of classification techniques, including statistical and machine learning techniques, are employed in the development of the prediction methods. In addition, bagging and boosting ensemble classifiers are also constructed for comparison. The experimental results based on three related datasets that contain different numbers of input features show that the genetic algorithm as the wrapper-based feature selection method performs better than the filter-based one by information gain. It is also shown that the lowest prediction error rates for the three datasets are provided by combining the genetic algorithm with the naïve Bayes and support vector machine classifiers without bagging and boosting.

Original languageEnglish
Article numbere12335
JournalExpert Systems
Volume36
Issue number1
DOIs
StatePublished - 01 02 2019

Bibliographical note

Publisher Copyright:
© 2018 John Wiley & Sons, Ltd

Keywords

  • bankruptcy prediction
  • data mining
  • ensemble classifiers
  • feature selection
  • machine learning

Fingerprint

Dive into the research topics of 'Feature selection in single and ensemble learning-based bankruptcy prediction models'. Together they form a unique fingerprint.

Cite this