Sinkron : Jurnal dan Penelitian Teknik Informatika
Vol. 9 No. 1 (2025): Research Article, January 2025

Thyroid Disease Prediction Using Random Forest with KNNImputer for Missing Values

Pratama, Raffy Nicandra Putra (Unknown)
Winarno, Sri (Unknown)
Wijaya, Tan Nicholas Octavian (Unknown)



Article Info

Publish Date
08 Jan 2025

Abstract

Thyroid disease is a health dysfunction that requires immediate and accurate diagnosis. This research seeks to design a classification model based on the Random Forest algorithm to detect the type of thyroid disease utilizing data from the UCI Repository. In the data processing stage, KNNImputer is used to handle missing data by calculating the average value of the nearest neighbors based on Euclidean distance, thus ensuring better data quality for model training. The developed model was evaluated utilizing the confusion matrix, which showed an accuracy of 98%, with precision, recall, and F1 score values ​​reached 98% based on weighted avg.These results corroborate that the proposed model is highly reliable in detecting various types of thyroid diseases, such as Negative, Hypothyroid, and Hyperthyroid. This research makes an important contribution to the application of data mining technology for medical diagnosis, while proving that optimal data processing through methods such as KNN Imputer can significantly improve model performance.

Copyrights © 2025






Journal Info

Abbrev

sinkron

Publisher

Subject

Computer Science & IT

Description

Scope of SinkrOns Scientific Discussion 1. Machine Learning 2. Cryptography 3. Steganography 4. Digital Image Processing 5. Networking 6. Security 7. Algorithm and Programming 8. Computer Vision 9. Troubleshooting 10. Internet and E-Commerce 11. Artificial Intelligence 12. Data Mining 13. Artificial ...