Claim Missing Document
Check
Articles

Implementation of the Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) Method to Address Class Imbalance in Alzheimer’s Disease Magnetic Resonance Imaging (MRI) Datasets Alamudin, Muhammad Faiq; Mazdadi, Muhammad Itqan; Nugroho, Radityo Adi; Saragih, Triando Hamonangan; Muliadi, Muliadi; Athavale, Vijay Anant
Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol. 7 No. 3 (2025): August
Publisher : Jurusan Teknik Elektromedik, Politeknik Kesehatan Kemenkes Surabaya, Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/ijeeemi.v7i3.109

Abstract

Class imbalance in medical imaging datasets often leads to biased machine learning models, particularly in Alzheimer’s disease (AD) diagnosis using MRI. This study proposes the use of Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to mitigate class imbalance in AD MRI datasets. Realistic MRI images were synthesized for underrepresented AD stages, and the quality of the generated data was quantitatively validatedusing the Fréchet Inception Distance (FID), with the lowest FID score recorded at 31.84, indicating a high degree of realism and diversity. The synthetic images were used to augment a dataset of 6,400 T1-weighted scans for training four Convolutional Neural Network (CNN) architectures: ResNet-50, AlexNet, VGG-16, and VGG-19. Results demonstrated statistically significant improvements in balanced accuracy across all models (p < 0.01 for all comparisons). The AlexNet + WGAN-GP combination achieved the highest accuracy of 98.54%, representing a mean improvement of 4.76% (95% CI: 2.45% to 6.98%) over its baseline. Significant gains were also observed for ResNet-50, VGG-16, and VGG-19. These enhancements were consistent across multiple evaluation metrics, including precision, recall, F1-score, and AUC. These findings confirm that WGAN-GP is a highly effective and statistically validated strategy for boosting the diagnostic accuracy of CNN models in Alzheimer's disease classification
Automatic Analysis of Natural Disaster Messages on Social Media Using IndoBERT and Multilingual BERT Safitri, Yasmin Dwi; Faisal, Mohammad Reza; Kartini, Dwi; Saragih, Triando Hamonangan; Abadi, Friska; Bachtiar, Adam Mukharil
Telematika Vol 18, No 2: August (2025)
Publisher : Universitas Amikom Purwokerto

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35671/telematika.v18i2.3140

Abstract

Information about natural disasters disseminated through social media can serve as an important data source for mitigation processes and early warning systems. Social media platforms, such as X (formerly known as Twitter), have become primary channels for conveying real-time information, especially during disaster emergencies. With the large amount of unstructured disaster-related text that must be processed, the main challenge is accurately filtering and classifying messages into three categories: eyewitness, non-eyewitness, and don’t know. This research aims to compare the performance of four BERT-based natural language processing models, namely IndoBERT, IndoBERT with Masked Language Modeling (MLM), Multilingual BERT, and Multilingual BERT with MLM, in classifying Indonesian-language disaster messages. The dataset used in this study was obtained from previous research and publicly available data on GitHub, consisting of annotated messages related to floods, earthquakes, and forest fires. The method applied is a deep learning approach using the hold-out technique with an 80:20 ratio for training and testing data, and the same ratio applied to split the training data into training and validation subsets, with stratification to maintain balanced class proportions. In addition, variations in batch size were explored to evaluate their effect on model performance stability. The results show that the IndoBERT model achieved the highest performance on the flood and earthquake datasets, with accuracies of 80.67% and 81.50%, respectively. Meanwhile, IndoBERT with MLM pre-training recorded the highest accuracy on the forest fire dataset, 88.33%. Overall, IndoBERT demonstrated the most consistent and superior performance across datasets compared to the other models. These findings indicate that IndoBERT has strong capabilities in understanding Indonesian disaster-related text, and the results can be used as a foundation for developing automatic classification systems to support real-time disaster monitoring and early warning applications
Multimodal Biometric Recognition Based on Fusion of Electrocardiogram and Fingerprint Using CNN, LSTM, CNN-LSTM, and DNN Models Agustina, Winda; Nugrahadi, Dodon Turianto; Faisal, Mohammad Reza; Saragih, Triando Hamonangan; Farmadi, Andi; Budiman, Irwan; Parenreng, Jumadi Mabe; Alkaff, Muhammad
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 4 (2025): JUTIF Volume 6, Number 4, Agustus 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.4.5098

Abstract

Biometric authentication offers a promising solution for enhancing the security of digital systems by leveraging individuals' unique physiological characteristics. This study proposes a multimodal authentication system using deep learning approaches to integrate fingerprint images and electrocardiogram (ECG) signals. The datasets employed include FVC2004 for fingerprint data and ECG-ID for ECG signals. Four deep learning architectures—Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), CNN-LSTM, and Deep Neural Network (DNN)—are evaluated to compare their effectiveness in recognizing individual identity based on fused multimodal features. Feature extraction techniques include grayscale conversion, binarization, edge detection, minutiae extraction for fingerprint images, and R-peak–based segmentation for ECG signals. The extracted features are combined using a feature-level fusion strategy to form a unified representation. Experimental results indicate that the CNN model achieves the highest classification accuracy at 96.25%, followed by LSTM and DNN at 93.75%, while CNN-LSTM performs the lowest at 11.25%. Minutiae-based features consistently yield superior results across different models, highlighting the importance of local feature descriptors in fingerprint-based identification tasks. This research advances biometric authentication by demonstrating the effectiveness of feature-level fusion and CNN architecture for accurate and robust identity recognition. The proposed system shows strong potential for secure and adaptive biometric authentication in modern digital applications.
Implementation of Extra Trees Classifier and Chi-Square Feature Selection for Early Detection of Liver Disease Al Ghifari, Muhammad Akmal; Budiman, Irwan; Saragih, Triando Hamonangan; Mazdadi, Muhammad Itqan; Herteno, Rudy; Rozaq, Hasri Akbar Awal
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4261

Abstract

The imbalanced distribution of medical data poses challenges in accurately detecting liver disease, which is crucial as symptoms often remain unnoticed until advanced stages. This study examines the application of the Extra Trees Classifier algorithm and chi-square feature selection for early detection of liver disease. Compared to traditional methods like Random Forest and SVM, the Extra Trees Classifier offers enhanced computational efficiency and better handling of imbalanced datasets, while chi-square feature selection helps identify the most relevant medical indicators. The data consists of five medical variables likely to be laboratory test results from patient samples, with labels indicating classes A and B. The data is randomly divided with a ratio of 80% for each class. To address data imbalance, SMOTE technique was applied before the data was randomly split into a ratio of 80% for training and 20% for testing to ensure effective learning and testing of the model's performance. The results showed that with the help of chi-square feature selection, the Extra Trees Classifier algorithm could provide fairly accurate predictions in liver disease classification, with an accuracy of 82.6%, sensitivity of 85.5%, precision of 78.3%, and F1-Score of 81.7%. These results demonstrate significant improvement over existing methods, and the proposed approach can aid healthcare practitioners in making timely diagnostic decisions, potentially reducing mortality rates through early intervention in liver disease cases.
Prediction of Life Expectancy of Lung Cancer Patients After Thoracic Surgery Using Decision Tree Algorithm and Adaptive Synthetic Sampling Erdi, Muhammad; Mazdadi, Muhammad Itqan; Nugroho, Radityo Adi; Farmadi, Andi; Saragih, Triando Hamonangan; Rozaq, Hasri Akbar Awal
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4724

Abstract

This research focuses on predicting the life expectancy of lung cancer patients after undergoing thoracic surgery, using a decision tree classification algorithm (C4.5) combined with adaptive synthetic sampling to handle data imbalance. Data imbalance in the lung cancer patient dataset is a major obstacle in obtaining accurate prediction results, especially in identifying minority classes. Data imbalance in the lung cancer patient dataset is a major obstacle in obtaining accurate prediction results, especially in identifying minority classes. By applying ADASYN, the data distribution becomes more even, thus improving the performance of the C4.5 model. The results showed that combining these methods increased the prediction accuracy from 67% to 87%. In addition, the precision, recall, and f1-score for minority classes have significantly improved, which were previously difficult to identify by the model. Thus, combining the C4.5 algorithm and the ADASYN technique proved effective in dealing with the challenge of data imbalance and resulted in better prediction in the case of lung cancer. This study is expected to contribute to the field of medical classification and serve as a reference for further research on similar cases.
Performance Comparison of AdaBoost, LightGBM, and CatBoost for Parkinson's Disease Classification Using ADASYN Balancing Anshari, Muhammad Ridha; Saragih, Triando Hamonangan; Muliadi, Muliadi; Kartini, Dwi; Indriani, Fatma; Rozaq, Hasri Akbar Awal; Yıldız, Oktay
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4726

Abstract

Parkinson's disease is a neurodegenerative condition identified by the decline of neurons that produce dopamine, causing motor symptoms such as tremors and muscle stiffness. Early diagnosis is challenging as there is no definitive laboratory test. This study aims to improve the accuracy of Parkinson's diagnosis using voice recordings with machine learning algorithms, such as AdaBoost, LightGBM, and CatBoost. The dataset used is Parkinson's Disease Detection from Kaggle, consisting of 195 records with 22 attributes. The data was normalized with Min-Max normalization, and class imbalance was resolved with ADASYN. Results show that ADASYN-LightGBM and ADASYN-CatBoost have the best performance with 96.92% accuracy, 97.10% precision, 96.92% recall, and 96.92% F1 score. This improvement suggests that combining boosting methods and data balancing techniques can improve the accuracy of Parkinson's diagnosis. These results demonstrate the effectiveness of ADASYN in addressing data imbalance and improving the performance of boosting algorithms for medical classification problems. The findings contribute to the development of intelligent diagnostic systems in the field of medical informatics and computer science. These findings are essential for developing more accurate and efficient diagnostic tools, supporting early diagnosis and better management of Parkinson's disease.
Seleksi Fitur Hybrid Grey Wolf Optimization dan Particle Swarm Optimization pada Distance Biased Naive Bayes untuk Klasifikasi Kanker Payudara Ratna Septia Devi; Triando Hamonangan Saragih; Mohammad Reza Faisal
Jurnal Informatika Polinema Vol. 10 No. 2 (2024): Vol 10 No 2 (2024)
Publisher : UPT P2M State Polytechnic of Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33795/jip.v10i2.4737

Abstract

Kanker payudara adalah penyebab utama kematian akibat kanker tertinggi kedua di dunia. Pasien Kanker payudara terus mengalami peningkatan dan menjadi masalah kesehatan yang cukup serius di seluruh dunia, termasuk juga di Indonesia. Diagnosis dini adalah salah satu pendekatan terbaik untuk mencegah penyakit ini semakin meningkat dan berkembang. Machine learning dapat melakukan penambangan data menggunakan serangkaian fitur pada sebuah data. Penelitian ini menggunakan dataset public dari UCI machine learning repository yaitu Breast Cancer Wisconsin (Diagnostic). Pada dataset ini memiliki atribut sebanyak 32 fitur, namun banyaknya fitur pada sebuah data juga akan memperlambat waktu komputasi dari metode klasifikasi yang digunakan. Pada penelian ini, akan dilakukan seleksi fitur menggunakan metode Hybrid Grey Wolf Optimization dan Particle Swarm Optimization (HGWOPSO) untuk memilih fitur yang paling informatif dan signifikan untuk digunakan pada klasifikasi. Metode klasifikasi yang digunakan adalah Distance Biased Naive Bayes (DBNB) yang terdiri dari dua modul yaitu Weighted Naïve Bayes Module (WNBM) dan Distance Reinforcement Module (DRM). Dari penelitian ini, didapatkan performa akurasi tertinggi pada model DBNB tanpa seleksi fitur sebesar 94,90%, DBNB dengan GWO sebesar 95,08%, DBNB dengan PSO sebesar 95,25%, dan DBNB dengan HGWOPSO sebesar 96,13%. Dapat disimpulkan bahwa model DBNB dengan seleksi fitur HGWOPSO mengalami peningkatan dibandingkan dengan DBNB tanpa seleksi fitur maupun dengan seleksi fitur individualnya.
Penyeimbangan Kelas SMOTE dan Seleksi Fitur Ensemble Filter pada Support Vector Machine untuk Klasifikasi Penyakit Liver Nugraha, Muhammad Amir; Mazdadi, Muhammad Itqan; Farmadi, Andi; Muliadi; Saragih, Triando Hamonangan
Jurnal Teknologi Informasi dan Ilmu Komputer Vol 10 No 6: Desember 2023
Publisher : Fakultas Ilmu Komputer, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25126/jtiik.2023107234

Abstract

Liver merupakan salah satu organ penting dalam tubuh manusia yang berperan dalam proses metabolisme tubuh. Mengutip artikel dari situs American Liver Foundation, pada tahun 2020 sebanyak 51.642 orang dewasa di Amerika Serikat meninggal akibat penyakit liver. Data hasil tes fungsi liver dari laboratorium dapat digunakan untuk mendiagnosis penyakit liver. Klasifikasi penyakit liver pada pasien perlu dilakukan dengan baik karena hasilnya dapat membantu dalam diagnosis awal apakah seorang pasien mengidap penyakit liver. Berdasarkan penelitian sebelumnya, metode Support Vector Machine (SVM) paling baik dalam mengklasifikasikan pasien penyakit liver. Namun, SVM memiliki kelemahan ketika diterapkan pada dataset dengan kelas yang tidak seimbang dan tidak bekerja secara akurat ketika terlalu banyak fitur yang tidak relevan digunakan. Untuk menyeimbangkan kelas pada dataset, digunakan metode Synthetic Minority Oversampling Technique (SMOTE). Sedangkan untuk seleksi fitur dilakukan menggunakan metode Ensemble Filter, terdiri dari metode Information Gain, Gain Ratio, dan Relief-F untuk menangani fitur-fitur tidak relevan. Berdasarkan hasil pengujian, penerapan SMOTE dan Ensemble Filter pada metode klasifikasi SVM memberikan hasil terbaik dengan nilai accuracy sebesar 85% dan AUC sebesar 0,850. Pengujian tersebut dapat membuktikan jika SMOTE pada penyeimbangan kelas dan Ensemble Filter pada seleksi fitur dapat meningkatkan performa klasifikasi dari metode SVM.    Abstract   The liver is one of the important organs in the human body that plays a role in the body's metabolic processes. Quoting an article from the American Liver Foundation website, in 2020, as many as 51,642 adults in the United States died from liver disease. Liver function test data from the laboratory can be used to diagnose liver disease. Classification of liver disease in patients needs to be done well because the results can help in the initial diagnosis of whether a patient has liver disease. Based on previous research, the Support Vector Machine (SVM) method best classifies liver disease patients. However, SVM has weaknesses when applied to datasets with unbalanced classes and does not work accurately when too many irrelevant features are used. To class-balance the dataset, the Synthetic Minority Oversampling Technique (SMOTE) method is used. Meanwhile, feature selection is performed using the Ensemble Filter method, which consists of Information Gain, Gain Ratio, and Relief-F methods to handle irrelevant features. Based on the test results, the application of SMOTE and Ensemble Filter in SVM classification gives the best results with an accuracy value of 85% and an AUC of 0.850. The test can prove if SMOTE on class balancing and Ensemble Filter on feature selection can improve the classification performance of the SVM method.
Jatropha Curcas Disease Identification using Random Forest Saragih, Triando Hamonangan; Wijayaningrum, Vivi Nur; Haekal, Muhammad
Jurnal Ilmiah Teknik Elektro Komputer dan Informatika Vol. 7 No. 1 (2021): April
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/jiteki.v7i1.20141

Abstract

As one of the most versatile plants, Jatropha curcas is spread in various regions around the world because of the great benefits it provides. However, Jatropha curcas is easily attacked by viruses which then cause damage to the plant, such as yellowing leaves and secreting sap, making it necessary to identify Jatropha curcas disease to deal with the problem as early as possible so that the losses incurred are not too large. An expert system was built to be able to identify Jatropha curcas disease by adopting expert knowledge. The use of the Random Forest algorithm as one of the classification algorithms was applied in this study. By using a random forest, several disease prediction classes are generated by each decision tree that has been formed. The disease class with the most votes was used as the final result. In this study, the data used were 166 data with 9 diseases and 30 symptoms. The results showed that Random Forest outperformed other algorithms such as Fuzzy Neural Network and Extreme Learning Machine with an accuracy of 98.002% using the composition of training data and test data of 70:30.
Automated Detection of COVID-19 Cough Sound using Mel-Spectrogram Images and Convolutional Neural Network Nafiz, Muhammad Fauzan; Kartini, Dwi; Faisal, Mohammad Reza; Indriani, Fatma; Saragih, Triando Hamonangan
Jurnal Ilmiah Teknik Elektro Komputer dan Informatika Vol. 9 No. 3 (2023): September
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/jiteki.v9i3.26374

Abstract

COVID-19 disease is known as a new disease caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) variant. The initial symptoms of the disease commonly include fever (83-98%), fatigue or myalgia, dry cough (76-82%), and shortness of breath (31-55%). Given the prevalence of coughing as a symptom, artificial intelligence has been employed as a means of detecting COVID-19 based on cough sounds. This study aims to compare the performance of six different Convolutional Neural Network (CNN) models (VGG-16, VGG-19, LeNet-5, AlexNet, ResNet-50, and ResNet-152) in detecting COVID-19 using mel-spectrogram images derived from cough sounds. The training and validation of these CNN models were conducted using the Virufy dataset. Audio data was processed to generate mel-spectrogram images, which were subsequently employed as inputs for the CNN models. The AlexNet model, utilizing an input size of 227x227, exhibited the best performance with the highest Area Under the Curve (AUC) value of 0.930303. This study provides compelling evidence of the efficacy of CNN models in detecting COVID-19 based on cough sounds through the utilization of mel-spectrogram images. Furthermore, the study underscores the impact of input size on model performance. The primary contribution of this research lies in identifying the CNN model that demonstrates the best performance in COVID-19 detection based on cough sounds. Additionally, this study establishes the fundamental groundwork for selecting an appropriate CNN methodology for early detection of COVID-19.
Co-Authors AA Sudharmawan, AA Abadi, Friska Abdul Latief Abadi Abdullayev, Vugar Achmad Rizal Adawiyah, Laila Afifa, Ridha Ahmad Rusadi Arrahimi - Universitas Lambung Mangkurat) Ahmad Rusadi Arrahimi - Universitas Lambung Mangkurat) Ahmad Tajali Aida, Nor Ajwa Helisa Al Ghifari, Muhammad Akmal Alamudin, Muhammad Faiq Alfita Rakhmandasari Amelia Aditya Santika Andi Farmadi Andi Farmadi Anshari, Muhammad Ridha Anshory, Muhammad Naufal Ansyari, Muhammad Ridho Athavale, Vijay Anant Athavale, Vijay Annant Bachtiar, Adam Mukharil Bachtiar, Adam Mukharil Difa Fitria Dina Arifah Diny Melsye Nurul Fajri Diny Melsye Nurul Fajri Dodon Turianto Nugrahadi Dwi Kartini Dwi Kartini, Dwi Dzira Naufia Jawza Erdi, Muhammad Erlianita, Noor Faisal, Mohammad Reza Fatma Indriani Fatma Indriani Favorisen R. Lumbanraja Febrian, Muhamad Michael Friska Abadi Haekal, Muhammad Haekal, Muhammad Hafizah, Rini Hermiati, Arya Syifa Herteno, Rudy Huynh, Phuoc-Hai Ichwan Dwi Nugraha Indriani, Fatma Irwan Budiman Irwan Budiman Itqan Mazdadi, Muhammad Ivan Sitohang Jumadi Mabe Parenreng Keswani, Ryan Rhiveldi Lilies Handayani M. Khairul Rezki Mafazy, Muhammad Meftah Mariana Dewi Muhamad Fawwaz Akbar Muhammad Al Ichsan Nur Rizqi Said Muhammad Alkaff Muhammad Darmadi Muhammad Fauzan Nafiz Muhammad Haekal Muhammad Haekal Muhammad Ikhwan Rizki Muhammad Itqan Mazdadi Muhammad Mursyidan Amini Muhammad Nadim Mubaarok Muhammad Reza Faisal, Muhammad Reza Muhammad Rofiq Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Musyaffa, Muhammad Hafizh Nafiz, Muhammad Fauzan Noryasminda Nugraha, Muhammad Amir Nurcahyati, Ica Nurlatifah Amini Okta Muthia Sari Purwoko, Agus Putra, Aditya Maulana Perdana Radityo Adi Nugroho Rahmat Ramadhani Rahmat Ramadhani Rahmatullah, Satrio Wibowo Ramadhani, Rahmat Ratna Septia Devi Regina Reza Faisal, Mohammad Rizki, M. Alfi Rozaq, Hasri Akbar Awal Rudy Herteno Rudy Herteno Safitri, Yasmin Dwi Said, Muhammad Al Ichsan Nur Rizqi SALLY LUTFIANI Salsha Farahdiba Saputro, Setyo Wahyu Siena, Laifansan Siti Aisyah Solechah Siti Napi'ah Suci Permata Sari Sulastri Norindah Sari Tajali, Ahmad Totok Wianto Vivi Nur Wijayaningrum Wahyu Caesarendra Wayan Firdaus Mahmudy Winda Agustina Yanche Kurniawan Mangalik YILDIZ, Oktay Yusuf Priyo Anggodo Zamzam, Yra Fatria