Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Journal of Computer Networks, Architecture and High Performance Computing

The implementation of the Random Forest Algorithm with Resampling and Without Resampling on the Hepatitis C Disease Dataset Hendrayana, I Gede; Dewi, Ni Putu Dita Ariani Sukma; Aryasa, Jiyestha Aji Dharma; Prayoga, I Made Ade; Raharjo, Rizki Anom
Journal of Computer Networks, Architecture and High Performance Computing Vol. 7 No. 3 (2025): Articles Research July 2025
Publisher : Information Technology and Science (ITScience)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47709/cnahpc.v7i3.6089

Abstract

This study evaluates the performance of Random Forest models for Hepatitis C classification using a dataset from Kaggle, focusing on addressing class imbalance through resampling techniques. We compare three approaches: baseline Random Forest without resampling, Random Forest with SMOTE+ENN (Synthetic Minority Oversampling Technique + Edited Nearest Neighbors), and Random Forest with SMOTE+OSS (Synthetic Minority Oversampling Technique + One-Sided Selection). Results show that the baseline model achieved high accuracy (0.9837) but failed to detect minority classes (e.g., suspect Blood Donor recall=0.00). SMOTE+ENN significantly improved performance, achieving perfect classification (precision=1.00, recall=1.00) for Hepatitis, Fibrosis, and Cirrhosis, while maintaining high accuracy (0.9919) and ROC AUC (0.9999). In contrast, SMOTE+OSS showed limitations in detecting Hepatitis (recall=0.00) and yielded lower precision for Fibrosis (0.44), indicating its undersampling approach may be too aggressive. The study highlights SMOTE+ENN as the most effective method for balancing class distribution and enhancing model robustness in medical diagnostics. These findings underscore the importance of selecting appropriate resampling techniques to improve minority class detection in imbalanced datasets, with implications for developing reliable AI-based diagnostic tools for Hepatitis C.