Journal of Computer Networks, Architecture and High Performance Computing
Vol. 7 No. 3 (2025): Articles Research July 2025

The implementation of the Random Forest Algorithm with Resampling and Without Resampling on the Hepatitis C Disease Dataset

Hendrayana, I Gede (Unknown)
Dewi, Ni Putu Dita Ariani Sukma (Unknown)
Aryasa, Jiyestha Aji Dharma (Unknown)
Prayoga, I Made Ade (Unknown)
Raharjo, Rizki Anom (Unknown)



Article Info

Publish Date
05 Jul 2025

Abstract

This study evaluates the performance of Random Forest models for Hepatitis C classification using a dataset from Kaggle, focusing on addressing class imbalance through resampling techniques. We compare three approaches: baseline Random Forest without resampling, Random Forest with SMOTE+ENN (Synthetic Minority Oversampling Technique + Edited Nearest Neighbors), and Random Forest with SMOTE+OSS (Synthetic Minority Oversampling Technique + One-Sided Selection). Results show that the baseline model achieved high accuracy (0.9837) but failed to detect minority classes (e.g., suspect Blood Donor recall=0.00). SMOTE+ENN significantly improved performance, achieving perfect classification (precision=1.00, recall=1.00) for Hepatitis, Fibrosis, and Cirrhosis, while maintaining high accuracy (0.9919) and ROC AUC (0.9999). In contrast, SMOTE+OSS showed limitations in detecting Hepatitis (recall=0.00) and yielded lower precision for Fibrosis (0.44), indicating its undersampling approach may be too aggressive. The study highlights SMOTE+ENN as the most effective method for balancing class distribution and enhancing model robustness in medical diagnostics. These findings underscore the importance of selecting appropriate resampling techniques to improve minority class detection in imbalanced datasets, with implications for developing reliable AI-based diagnostic tools for Hepatitis C.

Copyrights © 2025






Journal Info

Abbrev

CNAPC

Publisher

Subject

Computer Science & IT Education

Description

Journal of Computer Networks, Architecture and Performance Computing is a scientific journal that contains all the results of research by lecturers, researchers, especially in the fields of computer networks, computer architecture, computing. this journal is published by Information Technology and ...