Missing value imputation: a review and analysis of the literature (2006–2017)

Wei Chao Lin, Chih Fong Tsai*

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

312 Scopus citations

Abstract

Missing value imputation (MVI) has been studied for several decades being the basic solution method for incomplete dataset problems, specifically those where some data samples contain one or more missing attribute values. This paper aims at reviewing and analyzing related studies carried out in recent decades, from the experimental design perspective. Altogether, 111 journal papers published from 2006 to 2017 are reviewed and analyzed. In addition, several technical issues encountered during the MVI process are addressed, such as the choice of datasets, missing rates and missingness mechanisms, and the MVI techniques and evaluation metrics employed, are discussed. The results of analysis of these issues allow limitations in the existing body of literature to be identified based upon which some directions for future research can be gleaned.

Original languageEnglish
Pages (from-to)1487-1509
Number of pages23
JournalArtificial Intelligence Review
Volume53
Issue number2
DOIs
StatePublished - 01 02 2020

Bibliographical note

Publisher Copyright:
© 2019, Springer Nature B.V.

Keywords

  • Data mining
  • Imputation
  • Incomplete dataset
  • Missing values
  • Supervised learning

Fingerprint

Dive into the research topics of 'Missing value imputation: a review and analysis of the literature (2006–2017)'. Together they form a unique fingerprint.

Cite this