Garuda - Garba Rujukan Digital

Building of Informatics, Technology and Science

Vol 6 No 2 (2024): September 2024

Nugraha, Najmi Cahaya (Unknown)
Hikmayanti, Hanny (Unknown)
Indra, Jamaludin (Unknown)
Juwita, Ayu Ratna (Unknown)

Publish Date
09 Sep 2024

It is estimated that at least 17 million Indonesians suffer from thyroid disorders. Interestingly, nearly 60% of those living with a thyroid disorder do not receive a diagnosis. Thus, it is necessary to carry out research that applies methods to predict thyroid disease. Before applying prediction methods, it is crucial to implement classification methods to obtain an accurate prediction model. However, to achieve optimal classification results and to avoid inaccuracies, a balance in the used data is required. Data imbalance is a condition where the ratio between classes in the data is uneven, which can result in the generated model becoming biased. The main objective of the research is to present a solution that can improve the accuracy of early detection of thyroid diseases through addressing data imbalance and implementing appropriate classification algorithms. The research methodology began with the collection and analysis of a dataset consisting of 9172 data points. Preprocessing was then performed, resulting in 5321 training data points and 1331 test data points. The testing phase employed 7 different classification algorithms with 7 different resampling methods and evaluation using a confusion matrix. This research achieved the highest accuracy rate of 98%, obtained from the combination of the Random Forest Algorithm and the Random Over Sampling method. It can be concluded that the combination of the Random Forest Algorithm with the Random Over Sampling resampling method can improve early detection accuracy for thyroid diseases.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Building of Informatics, Technology and Science

Website

Abbrev

bits

Publisher

Forum Kerjasama Pendidikan TInggi

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...

Article Info

Abstract

Implementasi Metode Resampling Dalam Menangani Data Imbalance Pada Klasifikasi Multiclass Penyakit Thyroid

Article Info

Abstract