Jurnal Teknologi dan Sistem Komputer
Volume 8, Issue 4, Year 2020 (October 2020)

Tree-based homogeneous ensemble model with feature selection for diabetic retinopathy prediction

Tamunopriye Ene Dagogo-George (Department of Computer Science, Faculty of Communication and Information Sciences, University of Ilorin)
Hammed Adeleye Mojeed (Department of Computer Science, Faculty of Communication and Information Sciences, University of Ilorin)
Abdulateef Oluwagbemiga Balogun (Department of Computer Science, Faculty of Communication and Information Sciences, University of Ilorin)
Modinat Abolore Mabayoje (Department of Computer Science, Faculty of Communication and Information Sciences, University of Ilorin)
Shakirat Aderonke Salihu (Department of Computer Science, Faculty of Communication and Information Sciences, University of Ilorin)



Article Info

Publish Date
31 Oct 2020

Abstract

Diabetic Retinopathy (DR) is a condition that emerges from prolonged diabetes, causing severe damages to the eyes. Early diagnosis of this disease is highly imperative as late diagnosis may be fatal. Existing studies employed machine learning approaches with Support Vector Machines (SVM) having the highest performance on most analyses and Decision Trees (DT) having the lowest. However, SVM has been known to suffer from parameter and kernel selection problems, which undermine its predictive capability. Hence, this study presents homogenous ensemble classification methods with DT as the base classifier to optimize predictive performance. Boosting and Bagging ensemble methods with feature selection were employed, and experiments were carried out using Python Scikit Learn libraries on DR datasets extracted from UCI Machine Learning repository. Experimental results showed that Bagged and Boosted DT were better than SVM. Specifically, Bagged DT performed best with accuracy 65.38 %, f-score 0.664, and AUC 0.731, followed by Boosted DT with accuracy 65.42 %, f-score 0.655, and AUC 0.724 when compared to SVM (accuracy 65.16 %, f-score 0.652, and AUC 0.721). These results indicate that DT's predictive performance can be optimized by employing the homogeneous ensemble methods to outperform SVM in predicting DR.

Copyrights © 2020






Journal Info

Abbrev

JTSISKOM

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

Jurnal Teknologi dan Sistem Komputer (JTSiskom, e-ISSN: 2338-0403) adalah terbitan berkala online nasional yang diterbitkan oleh Departemen Teknik Sistem Komputer, Universitas Diponegoro, Indonesia. JTSiskom menyediakan media untuk mendiseminasikan hasil-hasil penelitian, pengembangan dan ...