Memory-Based Sand Cat Swarm Optimization for Feature Selection in Medical Diagnosis
The rapid expansion of medical data poses numerous challenges for Machine Learning
(ML) tasks due to their potential to include excessive noisy, irrelevant, and redundant features. As
a result, it is critical to pick the most pertinent features for the classification task, which is referred
to as Feature Selection (FS). Among the FS approaches, wrapper methods are designed to select
the most appropriate subset of features. In this study, two intelligent wrapper FS approaches are
implemented using a new meta-heuristic algorithm called Sand Cat Swarm Optimizer (SCSO). First,
the binary version of SCSO, known as BSCSO, is constructed by utilizing the S-shaped transform
function to effectively manage the binary nature in the FS domain. However, the BSCSO suffers
from a poor search strategy because it has no internal memory to maintain the best location. Thus,
it will converge very quickly to the local optimum. Therefore, the second proposed FS method is
devoted to formulating an enhanced BSCSO called Binary Memory-based SCSO (BMSCSO). It has
integrated a memory-based strategy into the position updating process of the SCSO to exploit and
further preserve the best solutions. Twenty one benchmark disease datasets were used to implement
and evaluate the two improved FS methods, BSCSO and BMSCSO. As per the results, BMSCSO acted
better than BSCSO in terms of fitness values, accuracy, and number of selected features. Based on the
obtained results, BMSCSO as a FS method can efficiently explore the feature domain for the optimal
feature set.