Garuda - Garba Rujukan Digital

p-Index From 2021 - 2026

2.439

P-Index

This Author published in this journals

All Journal JOIN (Jurnal Online Informatika) Journal of Machine Learning and Soft Computing Jurnal Keuangan dan Akuntansi Terapan (KUAT) Jurnal Dharmabakti Nagri International Journal of Information Technology and Computer Science Applications (IJITCSA) Jurnal Informatika: Jurnal Pengembangan IT Prosiding Seminar Nasional CORISINDO Jurnal Riset Informatika dan Teknologi Informasi (JRITI)

Tb Ai Munandar, Tb Ai

Unknown Affiliation

Author-ID : 644211

Humanities Computer Science & IT Decision Sciences, Operations Research & Management Economics, Econometrics & Finance Education Engineering Social Sciences Other

Published : 17 Documents Claim Missing Document

Claim Missing Document

Articles

Title

K-Means-Based Pseudo-Labeling Technique in Supervised Learning Models for Regional Classification Based on Types of Non-Communicable Diseases Surbakti, Herison; Munandar, Tb Ai
JOIN (Jurnal Online Informatika) Vol 10 No 2 (2025)
Publisher : Department of Informatics, UIN Sunan Gunung Djati Bandung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15575/join.v10i2.1609

Non-Communicable Diseases (NCDs) pose a critical threat to global public health, with Indonesia experiencing significant challenges due to high mortality rates and uneven regional distribution. In Banten Province, limited access to labeled health data hampers effective, data-driven intervention strategies. This study proposes a semi-supervised learning approach to develop a regional classification model for NCDs. The methodology begins with K-Means clustering applied to data from 254 community health centers (Puskesmas) to generate pseudo-labels. Various cluster configurations (k=2 to 8) were evaluated, with the optimal result being two clusters based on a silhouette score of 0.735. These clusters were then used to create a semi-labeled dataset for supervised learning. Eight classification algorithms—CN2 Rule Inducer, k-Nearest Neighbor (kNN), Logistic Regression, Naïve Bayes, Neural Network, Random Forest, Support Vector Machine (SVM), and Decision Tree—were trained and compared. Among them, the Neural Network model achieved the highest performance, with an AUC of 0.999 and an MCC of 0.976, indicating excellent stability and predictive accuracy. The findings validate the effectiveness of semi-supervised learning for health classification tasks when labeled data is scarce. This approach can serve as a valuable decision-support tool for regional health planning and targeted interventions, enhancing the precision and efficiency of public health responses.

Co-Authors Damara, Rian Dwi Budi Srisulistiowati Dwipa Handayani Fathurrazi, Ahmad Kapriadi, Engkap Karyaningsih, Dentik Khairunnisa Fadhilla Ramdhania Kristian Vieri, Jhon Muhammad Fairuzabadi Noeman, Achmad Noe’man,, Achmad No’eman, Achmad Pratama Yusuf, Ajif Yunizar Priatna , Wowon Primanda, Ferdy Hartanto Primanda rahmaddyan, reyhan tri Ramdhania, Khairunnisa Fadhilla Retno Wulandari Rizki Surya Pratama, Daffa Sani, Ardila Sri Lestari, Tyastuti Suhendar, Akip Surbakti, Herison

Title Search

Found 1 Documents Search Journal : JOIN (Jurnal Online Informatika)

Abstract

Title

Found 1 Documents
Search
Journal : JOIN (Jurnal Online Informatika)