Claim Missing Document
Check
Articles

Found 1 Documents
Search

Model Klasifikasi Multilabel pada Publikasi Penelitian SDG dengan Pendekatan Multilevel dan Hierarki Berliana Sugiarti Putri; Lya Hulliyyatus Suadaa; Efri Diah Utami
Jurnal Nasional Teknik Elektro dan Teknologi Informasi Vol 14 No 1: Februari 2025
Publisher : Departemen Teknik Elektro dan Teknologi Informasi, Fakultas Teknik, Universitas Gadjah Mada

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/jnteti.v14i1.16265

Abstract

The growing number of research publications complicates the identification of the implementation of research publications, especially related to sustainable development goals (SDGs). The research publication categorization into SDG levels has not been conducted. The Center for Research and Community Service (Pusat Penelitian dan Pengabdian Masyarakat, PPPM) Politeknik Statistika (Polstat) STIS needs this to monitor lecturers in implementing SDGs. This study aimed to implement and evaluate problem transformation methods and machine learning classification algorithms with a multilevel and hierarchical approach to categorize research publications into SDG levels. Problem transformation methods used were binary relevance, label powerset (LP), and classifier chains. Machine learning classification algorithms used were logistic regression (LR) and support vector machine (SVM). The inputs included titles, abstracts, and titles and abstracts. The best filter model that classified data into SDGs-non-SDGs was the model with titles and SVM, with an accuracy of 0.8634. The best level model for classifying data to SDG level was the model using titles, LP, and SVM with multilevel approaches. The level model classified data into four pillars, goals, targets, and indicators of SDGs, with an accuracy of 0.8067, 0.7501, 0.6792, and 0.6194, respectively. In comparison to other inputs with more comprehensive information, the results showed that title inputs yielded the best accuracy due to the simultaneous use of English and Indonesian. Future research can modify the model to utilize a single language input to optimize the term frequency-inverse document frequency (TF-IDF) process, hence, the word meanings from each language are not considered different important words.