Jurnal Masyarakat Informatika
Vol 16, No 2 (2025): November 2025

Comparative Evaluation of Machine Learning Algorithms with Data Balancing Approach and Hyperparameter Tuning in Predicting Thyroid Disorder Recurrence

Darnell Ignasius (Universitas Dian Nuswantoro)
Rhyan David Levandra (Universitas Dian Nuswantoro)
Ramadhan Rakhmat Sani (Universitas Dian Nuswantoro)
Ika Novita Dewi (Universitas Dian Nuswantoro)



Article Info

Publish Date
30 Nov 2025

Abstract

This research evaluates and compares the performance of five machine learning algorithms (Logistic Regression, K-Nearest Neighbors, Decision Tree, Random Forest, and Gradient Boosting) in predicting thyroid disease recurrence using patient data. The analysis was conducted on the Thyroid Disease Dataset from the UCI Machine Learning Repository. The methodology includes data preprocessing, normalization, and class balancing with the Synthetic Minority Over-sampling Technique (SMOTE). Additionally, hyperparameter tuning was conducted using GridSearchCV to optimize model performance. The results demonstrate that ensemble-based models, specifically Random Forest and Gradient Boosting, consistently outperform the other algorithms in terms of accuracy and robustness. These models achieve 95–96% accuracy across various scenarios.A key finding is that SMOTE significantly improves recall for minority classes, highlighting its value in imbalanced medical datasets.

Copyrights © 2025






Journal Info

Abbrev

jmasif

Publisher

Subject

Computer Science & IT

Description

JURNAL MASYARAKAT INFORMATIKA - JMASIF is a Journal published by the Department of Informatics, Universitas Diponegoro invites lecturers, researchers, students (Bachelor, Master, and Doctoral) as well as practitioners in the field of computer science and informatics to contribute to JMASIF in the ...