Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Scientific Journal of Informatics

Optimization of Random Forest Algorithm with SMOTE Method to Improve the Accuracy of Early Diabetes Prediction Nisa, Siti Khoirun; Barata, Mula Agung; Yuwita, Pelangi Eka
Scientific Journal of Informatics Vol. 12 No. 3: August 2025
Publisher : Universitas Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15294/sji.v12i3.22986

Abstract

Purpose: This research aims to examine the performance of the random forest algorithm in diabetes risk classification with data balancing using the Synthetic Minority Oversampling Technique (SMOTE) method to improve the representation of minority classes and increase the prediction accuracy value. Methods: The study used the Behavioral Risk Factor Surveillance System (BRFSS) dataset, obtained from Kaggle, which contains health-related survey data used to identify individuals at risk of diabetes. The Random Forest algorithm was applied to classify diabetes. To balance the data, the SMOTE method was used. The model’s performance was evaluated using 10-fold cross-validation by comparing result before and after SMOTE. Result: The results showed that the application of the SMOTE method improved the performance of the Random Forest classification model, especially in minority classes. Model performance in minority classes without SMOTE had poor evaluation metrics with precision of 49%, recall of 18%, and F1-score of 26%. After applying SMOTE, these values increased to precision of 96%, recall of 88%, and F1-score of 92%. Representing improvements of 47 percentage points in precision, 70 points in recall, and 66 points F1-score. The overall accuracy of the Random Forest model also increased from 86% to 92%, showing a 6 percentage point improvement. Novelty: This study use integrating the Random Forest algorithm with the SMOTE technique and validating the results using 10-fold cross-validation. The combination significantly improves minority class prediction performance in early diabetes detection, addressing the common limitations of previous studies in handling imbalanced datasets effectively.
Implementation of ANN Optimization with SMOTE and Backward Elimination for PCOS Prediction Ilmiyah, Miftakhul; Barata, Mula Agung; Yuwita, Pelangi Eka
Scientific Journal of Informatics Vol. 12 No. 1: February 2025
Publisher : Universitas Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15294/sji.v12i1.22886

Abstract

by women, making it potentially fatal owing to delayed diagnosis and treatment. With the advent of current technology, machine learning and medical care may become associated with disease prediction. The purpose of the study is to predict PCOS using an Artificial Neural Network (ANN) Deep Learning algorithm combined with Synthetic Minority Oversampling Technique (SMOTE) for data balancing and backward elimination for feature selection, aiming to provide a more accurate diagnosis of PCOS with high accuracy from thoose combination. Methods: ANN algorithm structure with three hidden layers, each with a ReLU activation function of 128, 64, and 32 neurons, a Dropout layer, an output layer with a sigmoid activation function, and an Adam learning rate. Result: Using the SMOTE approach for data balance and backward elimination feature selection, the research attributes are reduced to 18. And ANN algorithm predicts PCOS disease achieve an accuracy of 92%. Novelty: This study uses an ANN algorithm model combined with the SMOTE data balancing technique and a feature selection method using backward elimination. These methods and techniques have proven to have high accuracy. The results of this study are expected to be used as a more accurate diagnosis by medical professionals in predicting PCOS disease.