Diagnosing Diabetes Mellitus Using Machine Learning Techniques
high for an extended length of time. It is a major cause of death with high mortality rates and the
second leading cause of total years lived with disability worldwide. Its seriousness comes from its
long-term complications, including nephropathy, retinopathy, and neuropathy leading to kidney
failure, poor vision and blindness, and peripheral sensory loss, respectively. Such conditions are
life-threatening and affect patients? quality of life. Therefore, this paper aims to identify the most
relevant features in the diagnosis of DM and identify the best classifier that can efficiently diagnose
DM based on a set of relevant features. To achieve this, four different feature selection methods
have been utilized. Moreover, twelve different classifiers that belong to six learning strategies
have been evaluated using two datasets and several evaluation metrics such as Accuracy, Precision,
Recall, F1-measure, and ROC area. The obtained results revealed that the correlation attribute
evaluation method would be the best choice to handle the task of feature selection and ranking
for the considered datasets, especially when considering the Accuracy metric. Furthermore, MultiClassClassifier
would be the best classifier to handle Diabetes datasets, especially when considering
True Positive, precision, and Recall metrics.