Claim Missing Document
Check
Articles

Found 30 Documents
Search

Prediksi Churn Pelanggan Telekomunikasi dengan Optimalisasi Seleksi Fitur dan Tuning Hyperparameter pada Algoritma Klasifikasi C4.5 Antoh, Soterio; Herteno, Rudy; Budiman, Irwan; Kartini, Dwi; Mazdadi, Muhammad Itqan
Jurnal Sistem Informasi Bisnis Vol 15, No 1 (2025): Volume 15 Number 1 Year 2025
Publisher : Diponegoro University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.14710/vol15iss1pp60-67

Abstract

In the telecommunications industry, predicting customer churn is crucial for maintaining business sustainability. High churn rates can negatively impact profitability, necessitating effective retention strategies. This research aims to enhance the accuracy of telecommunications customer churn prediction by optimizing the C4.5 classification algorithm through feature selection and hyperparameter tuning. The methods used include Information Gain for feature selection and hyperparameter tuning with Random Search and Grid Search. This study utilizes the Telco Customer Churn dataset from Kaggle, split into an 80:20 ratio for training and testing data. Six approaches are applied: (1) the basic C4.5 algorithm, (2) C4.5 with Information Gain, (3) C4.5 with Random Search, (4) C4.5 with Grid Search, (5) C4.5 with a combination of Information Gain and Random Search, and (6) C4.5 with a combination of Information Gain and Grid Search. The results indicate that the C4.5 algorithm alone achieves an accuracy of 74.09%, while applying Information Gain increases accuracy to 78.42%. Hyperparameter tuning with Random Search achieves the highest accuracy of 80.05%, whereas Grid Search reaches 77.71%. Combining Information Gain with Random Search results in an accuracy of 78.99%, while combining Information Gain with Grid Search yields an accuracy of 78.85%. These findings suggest that hyperparameter tuning using Random Search significantly improves accuracy compared to other methods, while Information Gain feature selection does not have a significant impact on performance in this context.
Implementation of PPCA Imputation, SMOTE-N Class Balancing in Hepatitis Classification Using Naïve Bayes Fathmah, Siti; Kartini, Dwi; Abadi, Friska; Budiman, Irwan; Mazdadi, Muhammad Itqan
JUITA: Jurnal Informatika JUITA Vol. 12 No. 2, November 2024
Publisher : Department of Informatics Engineering, Universitas Muhammadiyah Purwokerto

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30595/juita.v12i2.21528

Abstract

The availability of complete data in research is crucial, especially in the initial stages. The Hepatitis data used in this study encountered issues such as missing data and class imbalance, which hindered its optimal utilization. The method employed to address missing data was the PPCA imputation method. After filling in the missing data, the data was balanced using the SMOTE-N class balancing method and classified using Gaussian Naïve Bayes. The aim of this research was to compare the classification evaluation of hepatitis disease using Naive Bayes with the PPCA imputation approach and SMOTE-N class balancing. The best results from each scenario yielded an AUC value of 0.833 in the first scenario with an 80:20 data split for training and testing, and 0.875 in the second scenario with a 90:10 data split. The highest AUC value was obtained in the application of PPCA imputation with SMOTE-N class balancing using Naive Bayes classification. This demonstrates that the implementation of PPCA imputation with SMOTE-N class balancing has a better impact on the performance of Naïve Bayes classification.
Image Classification of Traditional Indonesian Cakes Using Convolutional Neural Network (CNN) Azizah, Azkiya Nur; Budiman, Irwan; Indriani, Fatma; Faisal, M. Reza; Herteno, Rudy
Computer Engineering and Applications Journal (ComEngApp) Vol. 13 No. 2 (2024)
Publisher : Universitas Sriwijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Indonesia is one of the countries famous for its traditional culinary. Traditional cakes in Indonesia are traditional snacks typical of the archipelago's culture which have a variety of textures, shapes, colors that vary and some are similar so that there are still many people who do not know the name of the cake from the many types of traditional Indonesian cakes. The problem can be solved by creating a traditional cake image recognition system that can be programmed and trained to classify various types of traditional Indonesian cakes. The Convolutional Neural Network method with the AlexNet architecture model is used in this research to predict various kinds of traditional Indonesian cakes. The dataset used in this research is 1846 datasets with 8 classes of cake images. This study trained the AlexNet model with several optimizers, namely, Adam optimizer, SGD, and RMSprop. The best parameters from the model testing results are at batchsize 16, epoch 50, learning rate 0.01 for SGD optimizer and learning rate 0.001 for Adam and RMSprop optimizers. Each optimizer tested produces different accuracy, precision, recall, and f1_score values. The highest test results that have been carried out on the image dataset of typical Indonesian traditional cakes are obtained by the Adam optimizer with an accuracy value of 79%.
Implementation of Chi-Square Feature Selection for Parkinson’s Disease Classification Using LightGBM Ahdyani, Annisa Salsabila; Budiman, Irwan; Kartini, Dwi; Farmadi, Andi; Mazdadi, Muhammad Itqan
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 19, No 3 (2025): July
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.107881

Abstract

Penyakit Parkinson merupakan penyakit yang disebabkan oleh kerusakan sel saraf otak dan termasuk penyakit yang jumlah kasusnya meningkat pesat di dunia. Salah satu cara yang dapat dilakukan untuk mencegah meningkatnya kasus penyakit Parkinson adalah dengan melakukan diagnosis melalui metode klasifikasi dengan pendekatan pembelajaran algoritmik. Penelitian ini mengimplementasikan teknik Chi-Square untuk pendekatan pemilihan fitur yang relevan dengan algoritma Light Gradient Boosting Machine (LightGBM) dalam klasifikasi penyakit Parkinson. Pemilihan fitur Chi-Square bertujuan untuk mengurangi fitur yang kurang relevan sehingga dapat meningkatkan hasil kinerja model. Selain itu, metode SMOTE diterapkan untuk menangani ketidakseimbangan data dan penyetelan hiperparameter guna menentukan kombinasi parameter yang optimal. Pengujian dilakukan terhadap sepuluh variasi jumlah fitur, dengan hasil terbaik diperoleh dengan menggunakan 200 fitur yang menghasilkan akurasi sebesar 96,05%. Dengan menggunakan metode Chi-Square, kinerja model LightGBM meningkat dibandingkan dengan kinerja tanpa pemilihan fitur. Penerapan kombinasi metode ini dapat meningkatkan kinerja model klasifikasi secara signifikan dan berpotensi untuk diterapkan dalam sistem pendukung diagnosis penyakit Parkinson.
Multimodal Biometric Recognition Based on Fusion of Electrocardiogram and Fingerprint Using CNN, LSTM, CNN-LSTM, and DNN Models Agustina, Winda; Nugrahadi, Dodon Turianto; Faisal, Mohammad Reza; Saragih, Triando Hamonangan; Farmadi, Andi; Budiman, Irwan; Parenreng, Jumadi Mabe; Alkaff, Muhammad
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 4 (2025): JUTIF Volume 6, Number 4, Agustus 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.4.5098

Abstract

Biometric authentication offers a promising solution for enhancing the security of digital systems by leveraging individuals' unique physiological characteristics. This study proposes a multimodal authentication system using deep learning approaches to integrate fingerprint images and electrocardiogram (ECG) signals. The datasets employed include FVC2004 for fingerprint data and ECG-ID for ECG signals. Four deep learning architectures—Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), CNN-LSTM, and Deep Neural Network (DNN)—are evaluated to compare their effectiveness in recognizing individual identity based on fused multimodal features. Feature extraction techniques include grayscale conversion, binarization, edge detection, minutiae extraction for fingerprint images, and R-peak–based segmentation for ECG signals. The extracted features are combined using a feature-level fusion strategy to form a unified representation. Experimental results indicate that the CNN model achieves the highest classification accuracy at 96.25%, followed by LSTM and DNN at 93.75%, while CNN-LSTM performs the lowest at 11.25%. Minutiae-based features consistently yield superior results across different models, highlighting the importance of local feature descriptors in fingerprint-based identification tasks. This research advances biometric authentication by demonstrating the effectiveness of feature-level fusion and CNN architecture for accurate and robust identity recognition. The proposed system shows strong potential for secure and adaptive biometric authentication in modern digital applications.
Implementation of Extra Trees Classifier and Chi-Square Feature Selection for Early Detection of Liver Disease Al Ghifari, Muhammad Akmal; Budiman, Irwan; Saragih, Triando Hamonangan; Mazdadi, Muhammad Itqan; Herteno, Rudy; Rozaq, Hasri Akbar Awal
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4261

Abstract

The imbalanced distribution of medical data poses challenges in accurately detecting liver disease, which is crucial as symptoms often remain unnoticed until advanced stages. This study examines the application of the Extra Trees Classifier algorithm and chi-square feature selection for early detection of liver disease. Compared to traditional methods like Random Forest and SVM, the Extra Trees Classifier offers enhanced computational efficiency and better handling of imbalanced datasets, while chi-square feature selection helps identify the most relevant medical indicators. The data consists of five medical variables likely to be laboratory test results from patient samples, with labels indicating classes A and B. The data is randomly divided with a ratio of 80% for each class. To address data imbalance, SMOTE technique was applied before the data was randomly split into a ratio of 80% for training and 20% for testing to ensure effective learning and testing of the model's performance. The results showed that with the help of chi-square feature selection, the Extra Trees Classifier algorithm could provide fairly accurate predictions in liver disease classification, with an accuracy of 82.6%, sensitivity of 85.5%, precision of 78.3%, and F1-Score of 81.7%. These results demonstrate significant improvement over existing methods, and the proposed approach can aid healthcare practitioners in making timely diagnostic decisions, potentially reducing mortality rates through early intervention in liver disease cases.
Game Development of Banjar Archive for Interactive Cultural Education Ultilizing Large Language Models Adi Mu'Ammar, Rifqi; Abadi, Friska; Budiman, Irwan; Adi Nugroho, Radityo; Turianto Nugrahadi, Dodon
Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control Vol. 10, No. 4, November 2025
Publisher : Universitas Muhammadiyah Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22219/kinetik.v10i4.2294

Abstract

The preservation of Banjar cultural heritage is threatened by globalization and the fading interest of younger generations. This study addressed these challenges by developing an interactive educational game using the Game Development Life Cycle (GDLC) framework and integrating Large Language Models (LLMs) for adaptive and immersive player interactions. The six stages of GDLC namely initiation, pre-production, production, testing, beta, and release were systematically applied, resulting in a game that blends dynamic narratives to engage players while educating them about Banjar culture. Black Box Testing verified 14 test scenarios that all passed successfully, ensuring system stability and reliability. Additionally, user experience evaluation using the Game Experience Questionnaire (GEQ) highlighted high levels of immersion (4.936), competence (4.448), flow (3.124) and positive affect (4.976) among players, with minimal reported tension (1), challenge (1.744) and negative affect (1.07). These results demonstrated that the game successfully balances educational goals with engaging gameplay, fostering meaningful connections to Banjar heritage. By leveraging LLM technology, the game enhances interactivity, offering an innovative approach to Banjar cultural preservation in the digital era. This research extends the existing body of knowledge on AI-driven gamification strategies in heritage conservation with a specific focus on Banjar culture.
K-Modes Clustering untuk Mengetahui Jenis Masakan Daerah yang Populer pada Website Resep Online (Studi Kasus: Masakan Banjar di cookpad.com) Indriani, Fatma; Budiman, Irwan
Jurnal Teknologi Informasi dan Ilmu Komputer Vol 4 No 4: Desember 2017
Publisher : Fakultas Ilmu Komputer, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (1144.909 KB) | DOI: 10.25126/jtiik.201744548

Abstract

AbstrakPada makalah ini dipaparkan clustering pada data resep masakan daerah Banjar untuk mengetahui jenis makanan yang paling banyak di-post secara online oleh pengguna website recipe sharing. Pertama-tama data resep sebanyak 355 dikumpulkan dari suatu website resep, untuk selanjutnya dilakukan ekstraksi data bahan dan pembersihan. Metode clustering yang dipilih adalah k-modes karena cocok digunakan pada data kategorikal. Berdasar metode Elbow, jumlah cluster yang ideal adalah k=4 dan k=8. Jumlah cluster k=4 menghasilkan kelompok yang lebih umum, sedangkan k=8 menghasilkan kelompok yang lebih spesifik. Adapun kelompok yang berhasil diidentifikasi untuk k=4 adalah sayur asam, soto banjar, masakan gurih lain-lain, kue dan bubur manis. Sedangkan kelompok dengan jumlah cluster k=8 adalah sayur asam, soto banjar, kue basah, masakan gurih lain-lain, masak habang, bubur manis, kuah ketupat, dan masakan gurih asam. Evaluasi nilai purity menunjukkan nilai masing-masing 0,825 untuk k=4 dan 0,831 untuk k=8.Kata kunci: data mining, clustering, k-modes, resep masakan, bahanAbstractIn this paper, we cluster user-submitted recipes of Banjar regional cuisine to find out which type of cuisine are popular according to its ingredients. 355 recipes are collected from a recipe sharing website, then the ingredients extracted and cleaned. The clustering method chosen is k-modes because it is suitable for categorical data. Based on the Elbow method, the ideal number of clusters is k = 4 and k = 8. The number of clusters k = 4 produces more general cuisines group, whereas k = 8 produces more specific groups. The groups identified for k = 4 are (1) “sayur asam” (sour soup), (2)“soto banjar” (Banjar chicken soup), (3) savory dishes, and (4) sweet dishes. While the group with the number of clusters k = 8 consists of (1)“sayur asam” (sour soup)  (2) “soto banjar”, (3) Banjar sweet puddings, (4) various savory dishes, (5) “masak habang” (Banjar sweet chili dishes), (6) sweet porridge, (7) “kuah ketupat” (spicy coconut soup) and (8) various savory sour dishes. The purity of clusters are shown to be 0.825 for k=4 and 0.831 for k=8.Keywords: clustering, k-modes, data mining, recipe, ingredient
Kombinasi Seleksi Fitur Berbasis Filter dan Wrapper Menggunakan Naive Bayes pada Klasifikasi Penyakit Jantung Azizah, Siti Roziana; Herteno, Rudy; Farmadi, Andi; Kartini, Dwi; Budiman, Irwan
Jurnal Teknologi Informasi dan Ilmu Komputer Vol 10 No 6: Desember 2023
Publisher : Fakultas Ilmu Komputer, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25126/jtiik.2023107467

Abstract

Penyakit jantung menjadi salah satu penyebab utama kematian bersama dengan penyakit lainnya. Dalam bidang teknologi, data mining dapat digunakan untuk mendiagnosa suatu penyakit yang bersumber dari data rekam medis pasien. Pada klasifikasi dataset medis, Naive Bayes merupakan salah satu metode terbaik yang digunakan. Tujuan dari penelitian ini adalah untuk mengetahui perbandingan hasil akurasi dari Naive Bayes menggunakan beberapa seleksi fitur yaitu Forward Selection, Backward Elimination, kombinasi union hasil seleksi fitur Forwad Selection dan Backward Elimination, Information Gain, Gain Ratio, dan kombinasi union hasil seleksi fitur Information Gain dengan Gain Ratio. Data yang digunakan dalam penelitian ini adalah data penyakit jantung yang didapatkan dari UCI Machine Learning Repository. Dari implementasi pemodelan yang akan dilakukan menghasilkan nilai akurasi tertinggi sebesar 91.80% pada algoritma Naive Bayes dengan kombinasi union hasil seleksi fitur Information Gain dan Gain Ratio menggunakan perbandingan data latih dan data uji 80:20. Sedangkan akurasi Naive Bayes dengan kombinasi union hasil seleksi fitur Forward Selection dan Backward Elimination hanya memiliki nilai akurasi sebesar 83.61%   Abstract Heart disease is one of the leading causes of death along with other diseases. In the field of technology, data mining can be used to diagnose a disease sourced from patient medical record data. In the classification of medical datasets, Naive Bayes is one of the best methods used. The purpose of this study is to determine the comparison of the accuracy results of Naive Bayes using several feature selections, namely Forward Selection, Backward Elimination, a combination of union of Forwad Selection and Backward Elimination feature selection results, Information Gain, Gain Ratio, and a combination of union of Information Gain feature selection results with Gain Ratio. The data used in this research is heart disease data obtained from the UCI Machine Learning Repository. From the implementation of modeling that will be carried out, the highest accuracy value is 91.80% in the Naive Bayes algorithm with a combination of union of Information Gain and Gain Ratio feature selection results using a ratio of training data and test data of 80:20. While the accuracy of Naive Bayes with a combination of union selection results of Forward Selection and Backward Elimination features only has an accuracy value of 83.61%.  
Gender Classification Based on Electrocardiogram Signals Using Long Short Term Memory and Bidirectional Long Short Term Memory Halim, Kevin Yudhaprawira; Nugrahadi, Dodon Turianto; Faisal, Mohammad Reza; Herteno, Rudy; Budiman, Irwan
Jurnal Ilmiah Teknik Elektro Komputer dan Informatika Vol. 9 No. 3 (2023): September
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/jiteki.v9i3.26354

Abstract

Gender classification by computer is essential for applications in many domains, such as human-computer interaction or biometric system applications. Generally, gender classification by computer can be done by using a face photo, fingerprint, or voice. However, researchers have demonstrated the potential of the electrocardiogram (ECG) as a biometric recognition and gender classification. In facilitating the process of gender classification based on ECG signals, a method is needed, namely Long Short-Term Memory (LSTM) and Bidirectional Long Short-Term Memory (Bi-LSTM). Researchers use these two methods because of the ability of these two methods to deal with sequential problems such as ECG signals. The inputs used in both methods generally use one-dimensional data with a generally large number of signal features. The dataset used in this study has a total of 10,000 features. This research was conducted on changing the input shape to determine its effect on classification performance in the LSTM and Bi-LSTM methods. Each method will be tested with input with 11 different shapes. The best accuracy results obtained are 79.03% with an input shape size of 100×100 in the LSTM method. Moreover, the best accuracy in the Bi-LSTM method with input shapes of 250×40 is 74.19%. The main contribution of this study is to share the impact of various input shape sizes to enhance the performance of gender classification based on ECG signals using LSTM and Bi-LSTM methods. Additionally, this study contributes for selecting an appropriate method between LSTM and Bi-LSTM on ECG signals for gender classification.