Claim Missing Document
Check
Articles

Found 3 Documents
Search
Journal : Bulletin of Computer Science Research

Penerapan Metode ADASYN Dalam Mengatasi Imbalanced Data Untuk Klasifikasi Penyakit Stroke Menggunakan Support Vector Machine Alwaliyanto; Siska Kurnia Gusti; Iis Afrianty; Fadhilah Syafria
Bulletin of Computer Science Research Vol. 5 No. 4 (2025): June 2025
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bulletincsr.v5i4.612

Abstract

Stroke is one of the leading causes of death and disability worldwide, making it essential to develop classification models that can assist in early and accurate diagnosis. This study aims to implement the Support Vector Machine (SVM) algorithm with three types of kernels linear, polynomial, and Radial Basis Function (RBF) to classify stroke disease data. The Adaptive Synthetic Sampling (ADASYN) method is employed to address the class imbalance problem, while model training and evaluation are carried out using 5-Fold Cross-Validation to ensure stable and reliable results. The findings indicate that ADASYN successfully improves the model’s sensitivity to stroke cases (the minority class), as reflected by an increase in recall and F1-score, despite a slight decrease in overall accuracy a common trade-off in handling imbalanced data. The linear kernel (after ADASYN) achieved the best performance after imbalance handling, with an average AUC-ROC of 0.8333, recall of 0.7827, and F1-score of 0.2181 for the stroke class. Although the F1-score remains relatively low, it improved compared to the pre-ADASYN results, indicating better detection of stroke cases. The implementation was conducted using Google Colab, which also contributed to efficient data processing and visualization. Overall, the results demonstrate that the combination of SVM and ADASYN is effective in enhancing the model’s sensitivity to minority classes and is well-suited for medical data classification tasks, particularly in the early diagnosis of stroke using machine learning approaches.
Perbandingan Teknik Penyeimbang Kelas Pada Multi-Layer Perceptron (MLP) Berbasis Backpropagation Untuk Klasifikasi Diabetes Mellitus Robby Azhar; Siska Kurnia Gusti; Iis Afrianty; Elvia Budianita
Bulletin of Computer Science Research Vol. 5 No. 6 (2025): October 2025
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bulletincsr.v5i6.804

Abstract

Diabetes Mellitus (DM) is a chronic disease that can lead to serious complications if not detected early; therefore, early diagnosis is highly important. One of the methods that can be applied for early diagnosis is the classification technique in data mining. However, the classification process often faces challenges due to class imbalance, which can reduce model performance. This study aims to analyze the effect of class balancing techniques on the performance of the Backpropagation Neural Network (BPNN) in classifying DM cases. BPNN is a form of Multi-Layer Perceptron (MLP) with a simple structure and the ability to solve complex problems with good accuracy. The dataset used in this study is the Pima Indians Diabetes Dataset, consisting of 768 instances, including 500 non-diabetic and 268 diabetic cases. The research was conducted using three scenarios: without balancing, Synthetic Minority Over-sampling Technique (SMOTE), and Random Under Sampling (RUS). The BPNN model was designed with two architectural variations (one hidden layer and two hidden layers), three learning rate values (0.1, 0.01, and 0.001), and a varying number of neurons. The dataset was divided using the 10-Fold Cross Validation technique. The results show that applying SMOTE achieved the best performance, with an average accuracy of 90.89%, precision of 91.22%, recall of 90.89%, and F1-score of 90.89% on the BPNN architecture with one hidden layer. Furthermore, the single hidden layer architecture proved more stable than the two hidden layers, especially when the dataset size decreased due to RUS. Therefore, the combination of SMOTE and BPNN with one hidden layer provides better performance in classifying Diabetes Mellitus cases.
Penerapan Seleksi Fitur Information Gain dan Metode Backpropagation Neural Network Untuk Klasifikasi Atrisi Karyawan Dinyah Fithara; Elvia Budianita; Iis Afrianty; Siska Kurnia Gusti
Bulletin of Computer Science Research Vol. 6 No. 1 (2025): December 2025
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bulletincsr.v6i1.922

Abstract

Employee attrition management is a critical challenge for organizations as it involves costs, time, and the risk of decision-making errors. This problem requires a data-driven business strategy to achieve more accurate predictions of employees who are potentially at risk of termination. This study applies the Information Gain feature selection method and the Backpropagation Neural Network (BPNN) algorithm in the employee attrition classification process with the aim of increasing the accuracy and efficiency of the prediction model. BPNN is chosen due to its simpler architecture, faster training time, and greater stability for small to medium sized datasets.  With the assistance of Information Gain feature selection, BPNN is able to achieve optimal performance without requiring a complex architecture. The dataset used consist of 35 attributes and 1.470 employee records covering various factor such as age, income level, and employment status. The research stages include feature selection based on information gain values with specific thresholds, data partitioning using k-fold cross validation, and model training using BPNN with variations of learning rates and hidden neuron counts. The results show that the combination of Information Gain and BPNN improves classification accuracy compared to models without feature selection, achieving the highest average accuracy of 87.28% when using 25 selected attributes, with a BPNN configuration of learning rate 0.001, 35 hidden neurons, and 50 epochs. The attributes with the highest Information Gain score include JobLevel, OverTime, MaritalStatus, and MonthlyIncome. This study demonstrates that the proposed approach successfully enhances the prediction performance of employee attrition and can serve as a foundation for developing data-driven models that support employee retention efforts.