Ressa Isnaini Arumnisaa
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Comparison of Ensemble Learning Method: Random Forest, Support Vector Machine, AdaBoost for Classification Human Development Index (HDI) Ressa Isnaini Arumnisaa; Arie Wahyu Wijayanto
Sistemasi: Jurnal Sistem Informasi Vol 12, No 1 (2023): Sistemasi: Jurnal Sistem Informasi
Publisher : Program Studi Sistem Informasi Fakultas Teknik dan Ilmu Komputer

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32520/stmsi.v12i1.2501

Abstract

Classification in supervised learning is a way to find patterns in database that the classes are already known. In the classification of machine learning, there is a term called ensemble classifier. The workings of the ensemble classifier aimed to improve model accuracy and optimize classification performance. This study aims to analyze the comparison of algorithms that work with ensemble learning, including Random Forest, Support Vector Machine (SVM), and AdaBoost. The data used is the Human Development Index (HDI) of districts/cities in Indonesia. Other variables that are strongly related to human development are GRDP per capita, gross enrollment rate, net enrollment rate, labor force participation rate, unemployment rate, poverty rate, poverty depth, poverty severity, and average consumption per capita. The reason for using HDI is that apart from being an important macroeconomic variable in describing the condition of human resources in Indonesia, HDI already has an obvious classification according to the Badan Pusat Statistik (BPS) so that supervised learning can be applied. Comparison of model evaluation using accuracy, specificity, sensitivity, and kappa statistics. The analysis flow starts with data preprocessing, resampling and cross-validation, then modeling using the Random Forest, Support Vector Machine (SVM), and AdaBoost algorithm. The final stage is the model evaluation by comparing the best models in the classifications of districts/ cities according to HDI. The results showed that the Random Forest model had the best performance compared to the Support Vector Machine (SVM) and AdaBoost models with an accuracy value of 85,23%, specificity of 71,63%, sensitivity of 95,05%, and kappa coefficient of 0,7698. From this research, the an ensemble classifier can be developed to help classify scores on the Human Development Index in Indonesia.Keywords: AdaBoost, Random Forest, Support Vector Machine, Ensemble Learning, Human Development Index