Missing Value Imputation Designs and Methods of Nature-Inspired Metaheuristic Techniques : A Systematic Review

Po Chan, Chiu and Ali, Selamat and Ondrej, Krejcar and Kuok, King Kuok and Siti Dianah, Abdul Bujang and Hamido, Fujita (2022) Missing Value Imputation Designs and Methods of Nature-Inspired Metaheuristic Techniques : A Systematic Review. IEEE Access, 10. pp. 61544-61566. ISSN 2169-3536

[img] PDF
Missing Value - Copy.pdf

Download (208kB)
Official URL: https://ieeexplore.ieee.org/document/9771309

Abstract

Missing values are highly undesirable in real-world datasets. The missing values should be estimated and treated during the preprocessing stage. With the expansion of nature-inspired metaheuristic techniques, interest in missing value imputation (MVI) has increased. The main goal of this literature is to identify and review the existing research on missing value imputation (MVI) in terms of nature-inspired metaheuristic approaches, dataset designs, missingness mechanisms, and missing rates, as well as the most used evaluation metrics between 2011 and 2021. This study ultimately gives insight into how the MVI plan can be incorporated into the experimental design. Using the systematic literature review (SLR) guidelines designed by Kitchenham, this study utilizes renowned scientific databases to retrieve and analyze all relevant articles during the search process. A total of 48 related articles from 2011 to 2021 were selected to assess the review questions. This review indicated that the synthetic missing dataset is the most popular baseline test dataset to evaluate the effectiveness of the imputation strategy. The study revealed that missing at random (MAR) is the most common proposed missing mechanism in the datasets. This review also indicated that the hybridizations of metaheuristics with clustering or neural networks are popular among researchers. The superior performance of the hybrid approaches is significantly attributed to the power of optimized learning in MVI models. In addition, perspectives, challenges, and opportunities in MVI are also addressed in this literature. The outcome of this review serves as a toolkit for the researchers to develop effective MVI models.

Item Type: Article
Uncontrolled Keywords: Missing value, missing data, imputation, incomplete dataset, metaheuristic, systematic review.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Gani
Date Deposited: 07 Sep 2022 02:26
Last Modified: 07 Sep 2022 02:26
URI: http://ir.unimas.my/id/eprint/39556

Actions (For repository members only: login required)

View Item View Item