Abstract
Missing value imputation (MVI) has been studied for several decades being the basic solution method for incomplete dataset problems, specifically those where some data samples contain one or more missing attribute values. This paper aims at reviewing and analyzing related studies carried out in recent decades, from the experimental design perspective. Altogether, 111 journal papers published from 2006 to 2017 are reviewed and analyzed. In addition, several technical issues encountered during the MVI process are addressed, such as the choice of datasets, missing rates and missingness mechanisms, and the MVI techniques and evaluation metrics employed, are discussed. The results of analysis of these issues allow limitations in the existing body of literature to be identified based upon which some directions for future research can be gleaned.
Original language | English |
---|---|
Pages (from-to) | 1487-1509 |
Number of pages | 23 |
Journal | Artificial Intelligence Review |
Volume | 53 |
Issue number | 2 |
DOIs | |
State | Published - 01 02 2020 |
Bibliographical note
Publisher Copyright:© 2019, Springer Nature B.V.
Keywords
- Data mining
- Imputation
- Incomplete dataset
- Missing values
- Supervised learning