Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Journal of Dinda : Data Science, Information Technology, and Data Analytics

Implementation of Random Forest Algorithm with RFE and SMOTE on Cardiotocography Dataset Nur Taqwimi, Muhammad Ahsani; Wahono, Buang Budi; Mulyo, Harminto
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 5 No 2 (2025): August
Publisher : Research Group of Data Engineering, Faculty of Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20895/dinda.v5i2.1818

Abstract

Having a healthy baby is a dream for mothers. However, the high rate of maternal and fetal mortality is still a serious problem, so more accurate fetal health monitoring is needed to prevent pregnancy complications. One of the devices used is Cardiotocography (CTG), which produces data on fetal conditions. The CTG dataset used in this study faces challenges in the form of class imbalance and a high number of features, which can reduce classification performance. This study aims to overcome these challenges by implementing the Random Forest algorithm combined with the Synthetic Minority Oversampling Technique (SMOTE) technique for class balancing and Recursive Feature Elimination (RFE) for feature selection. The dataset used is "Fetal Health Classification" from the Kaggle platform, which consists of 2,126 data with three classes: Normal, Suspect, and Pathological. The test results show that the RFE method is able to reduce the number of features from 22 to 18, while SMOTE increases the proportion of minority data. The model built produces good classification performance with an accuracy value of 95%, precision 93%, recall 89%, and F1-score 91%. The ROC-AUC value for the Normal class is 0.9881, Suspect 0.9789, and Pathological 0.9985. Although the model is able to predict the Normal and Pathological classes with high accuracy, the performance on the Suspect class still needs to be improved. Overall, the integration of Random Forest with SMOTE and RFE has proven effective in improving the accuracy of fetal health classification.
Heart Failure Classification Using a Hybrid Model Based on SVM and Random Forest Abdilllah, Muh Sajid; Mulyo, Harminto; Wibowo, Gentur Wahyu Nyipto
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 5 No 2 (2025): August
Publisher : Research Group of Data Engineering, Faculty of Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20895/dinda.v5i2.2001

Abstract

This study discusses the development of a model to classify heart failure disease by combining two algorithms in the field of data mining: Support Vector Machine (SVM) and Random Forest (RF). The dataset used is the Heart Failure Prediction Dataset, consisting of 918 patient records containing medical information such as blood pressure, cholesterol levels, and heart rate. The research process began with data cleaning, normalization using MinMaxScaler, and data balancing with the SMOTE technique to equalize the number of cases between heart failure patients and non-patients. The data was then split into training and testing sets. Each model (SVM and RF) was tested individually and also combined into a hybrid model. Validation was performed using 5-Fold Cross Validation to ensure consistent results. The results show that SVM performed better in terms of precision for detecting heart failure after applying SMOTE, while RF remained stable even without data balancing. The hybrid model combining both algorithms achieved the best performance, with an accuracy of 91.20%, precision of 90.85%, recall of 92.44%, and an AUC score of 0.961. These results indicate that the hybrid model can detect heart failure more accurately and in a more balanced manner. With its high and consistent performance, this model is suitable for use as a decision support system in the medical field, particularly for early detection of heart failure.