Claim Missing Document
Check
Articles

Implementation of Chi-Square Feature Selection for Parkinson’s Disease Classification Using LightGBM Ahdyani, Annisa Salsabila; Budiman, Irwan; Kartini, Dwi; Farmadi, Andi; Mazdadi, Muhammad Itqan
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 19, No 3 (2025): July
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.107881

Abstract

Penyakit Parkinson merupakan penyakit yang disebabkan oleh kerusakan sel saraf otak dan termasuk penyakit yang jumlah kasusnya meningkat pesat di dunia. Salah satu cara yang dapat dilakukan untuk mencegah meningkatnya kasus penyakit Parkinson adalah dengan melakukan diagnosis melalui metode klasifikasi dengan pendekatan pembelajaran algoritmik. Penelitian ini mengimplementasikan teknik Chi-Square untuk pendekatan pemilihan fitur yang relevan dengan algoritma Light Gradient Boosting Machine (LightGBM) dalam klasifikasi penyakit Parkinson. Pemilihan fitur Chi-Square bertujuan untuk mengurangi fitur yang kurang relevan sehingga dapat meningkatkan hasil kinerja model. Selain itu, metode SMOTE diterapkan untuk menangani ketidakseimbangan data dan penyetelan hiperparameter guna menentukan kombinasi parameter yang optimal. Pengujian dilakukan terhadap sepuluh variasi jumlah fitur, dengan hasil terbaik diperoleh dengan menggunakan 200 fitur yang menghasilkan akurasi sebesar 96,05%. Dengan menggunakan metode Chi-Square, kinerja model LightGBM meningkat dibandingkan dengan kinerja tanpa pemilihan fitur. Penerapan kombinasi metode ini dapat meningkatkan kinerja model klasifikasi secara signifikan dan berpotensi untuk diterapkan dalam sistem pendukung diagnosis penyakit Parkinson.
Implementation of the Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) Method to Address Class Imbalance in Alzheimer’s Disease Magnetic Resonance Imaging (MRI) Datasets Alamudin, Muhammad Faiq; Mazdadi, Muhammad Itqan; Nugroho, Radityo Adi; Saragih, Triando Hamonangan; Muliadi, Muliadi; Athavale, Vijay Anant
Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol. 7 No. 3 (2025): August
Publisher : Jurusan Teknik Elektromedik, Politeknik Kesehatan Kemenkes Surabaya, Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/ijeeemi.v7i3.109

Abstract

Class imbalance in medical imaging datasets often leads to biased machine learning models, particularly in Alzheimer’s disease (AD) diagnosis using MRI. This study proposes the use of Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to mitigate class imbalance in AD MRI datasets. Realistic MRI images were synthesized for underrepresented AD stages, and the quality of the generated data was quantitatively validatedusing the Fréchet Inception Distance (FID), with the lowest FID score recorded at 31.84, indicating a high degree of realism and diversity. The synthetic images were used to augment a dataset of 6,400 T1-weighted scans for training four Convolutional Neural Network (CNN) architectures: ResNet-50, AlexNet, VGG-16, and VGG-19. Results demonstrated statistically significant improvements in balanced accuracy across all models (p < 0.01 for all comparisons). The AlexNet + WGAN-GP combination achieved the highest accuracy of 98.54%, representing a mean improvement of 4.76% (95% CI: 2.45% to 6.98%) over its baseline. Significant gains were also observed for ResNet-50, VGG-16, and VGG-19. These enhancements were consistent across multiple evaluation metrics, including precision, recall, F1-score, and AUC. These findings confirm that WGAN-GP is a highly effective and statistically validated strategy for boosting the diagnostic accuracy of CNN models in Alzheimer's disease classification
Improving Diabetes Prediction Using Feedforward Neural Network with Adam Optimization and SMOTE Technique Wijaya Kusuma, Arizha; Mazdadi, Muhammad Itqan; Kartini, Dwi; Farmadi, Andi; Indriani, Fatma; P., Chandrasekaran
Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol. 7 No. 3 (2025): August
Publisher : Jurusan Teknik Elektromedik, Politeknik Kesehatan Kemenkes Surabaya, Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/ijeeemi.v7i3.127

Abstract

Diabetes mellitus is a chronic metabolic disorder that demands early and accurate detection to prevent life-threatening complications. Traditional diagnostic procedures, such as blood glucose tests and oral glucose tolerance tests, are often invasive, time-consuming, and resource-intensive, making them less practical for widespread screening. This study aims to explore the potential of artificial intelligence, specifically Feedforward Neural Networks (FNN), in predicting diabetes based on clinical data from the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). The main contribution of this research lies in the application of the Adaptive Moment Estimation (Adam) optimization algorithm and the Synthetic Minority Oversampling Technique (SMOTE) to enhance the performance and generalization of the FNN on imbalanced medical datasets. The methodology involves preprocessing steps such as imputing zero values with feature means, normalizing input features using Min-Max scaling, and applying SMOTE to balance class distribution. Two model configurations were compared: a baseline FNN trained manually using full-batch gradient descent and a second FNN optimized using Adam. Experimental results demonstrated that the baseline model achieved an accuracy of 70.13%, precision of 56.06%, recall of 68.52%, and F1-score of 61.67%, while the Adam-optimized model achieved superior results with an average accuracy of 73.31%, precision of 60.97%, recall of 66.67%, and F1-score of 63.64% across ten independent runs. These findings indicate that combining adaptive optimization with oversampling significantly enhances the robustness and reliability of neural networks for medical classification tasks. In conclusion, the proposed method provides an effective framework for AI-assisted early diabetes detection and opens pathways for future development using deeper network architectures and explainable AI models for clinical applications.
Implementation of Extra Trees Classifier and Chi-Square Feature Selection for Early Detection of Liver Disease Al Ghifari, Muhammad Akmal; Budiman, Irwan; Saragih, Triando Hamonangan; Mazdadi, Muhammad Itqan; Herteno, Rudy; Rozaq, Hasri Akbar Awal
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4261

Abstract

The imbalanced distribution of medical data poses challenges in accurately detecting liver disease, which is crucial as symptoms often remain unnoticed until advanced stages. This study examines the application of the Extra Trees Classifier algorithm and chi-square feature selection for early detection of liver disease. Compared to traditional methods like Random Forest and SVM, the Extra Trees Classifier offers enhanced computational efficiency and better handling of imbalanced datasets, while chi-square feature selection helps identify the most relevant medical indicators. The data consists of five medical variables likely to be laboratory test results from patient samples, with labels indicating classes A and B. The data is randomly divided with a ratio of 80% for each class. To address data imbalance, SMOTE technique was applied before the data was randomly split into a ratio of 80% for training and 20% for testing to ensure effective learning and testing of the model's performance. The results showed that with the help of chi-square feature selection, the Extra Trees Classifier algorithm could provide fairly accurate predictions in liver disease classification, with an accuracy of 82.6%, sensitivity of 85.5%, precision of 78.3%, and F1-Score of 81.7%. These results demonstrate significant improvement over existing methods, and the proposed approach can aid healthcare practitioners in making timely diagnostic decisions, potentially reducing mortality rates through early intervention in liver disease cases.
Prediction of Life Expectancy of Lung Cancer Patients After Thoracic Surgery Using Decision Tree Algorithm and Adaptive Synthetic Sampling Erdi, Muhammad; Mazdadi, Muhammad Itqan; Nugroho, Radityo Adi; Farmadi, Andi; Saragih, Triando Hamonangan; Rozaq, Hasri Akbar Awal
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4724

Abstract

This research focuses on predicting the life expectancy of lung cancer patients after undergoing thoracic surgery, using a decision tree classification algorithm (C4.5) combined with adaptive synthetic sampling to handle data imbalance. Data imbalance in the lung cancer patient dataset is a major obstacle in obtaining accurate prediction results, especially in identifying minority classes. Data imbalance in the lung cancer patient dataset is a major obstacle in obtaining accurate prediction results, especially in identifying minority classes. By applying ADASYN, the data distribution becomes more even, thus improving the performance of the C4.5 model. The results showed that combining these methods increased the prediction accuracy from 67% to 87%. In addition, the precision, recall, and f1-score for minority classes have significantly improved, which were previously difficult to identify by the model. Thus, combining the C4.5 algorithm and the ADASYN technique proved effective in dealing with the challenge of data imbalance and resulted in better prediction in the case of lung cancer. This study is expected to contribute to the field of medical classification and serve as a reference for further research on similar cases.
Penyeimbangan Kelas SMOTE dan Seleksi Fitur Ensemble Filter pada Support Vector Machine untuk Klasifikasi Penyakit Liver Nugraha, Muhammad Amir; Mazdadi, Muhammad Itqan; Farmadi, Andi; Muliadi; Saragih, Triando Hamonangan
Jurnal Teknologi Informasi dan Ilmu Komputer Vol 10 No 6: Desember 2023
Publisher : Fakultas Ilmu Komputer, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25126/jtiik.2023107234

Abstract

Liver merupakan salah satu organ penting dalam tubuh manusia yang berperan dalam proses metabolisme tubuh. Mengutip artikel dari situs American Liver Foundation, pada tahun 2020 sebanyak 51.642 orang dewasa di Amerika Serikat meninggal akibat penyakit liver. Data hasil tes fungsi liver dari laboratorium dapat digunakan untuk mendiagnosis penyakit liver. Klasifikasi penyakit liver pada pasien perlu dilakukan dengan baik karena hasilnya dapat membantu dalam diagnosis awal apakah seorang pasien mengidap penyakit liver. Berdasarkan penelitian sebelumnya, metode Support Vector Machine (SVM) paling baik dalam mengklasifikasikan pasien penyakit liver. Namun, SVM memiliki kelemahan ketika diterapkan pada dataset dengan kelas yang tidak seimbang dan tidak bekerja secara akurat ketika terlalu banyak fitur yang tidak relevan digunakan. Untuk menyeimbangkan kelas pada dataset, digunakan metode Synthetic Minority Oversampling Technique (SMOTE). Sedangkan untuk seleksi fitur dilakukan menggunakan metode Ensemble Filter, terdiri dari metode Information Gain, Gain Ratio, dan Relief-F untuk menangani fitur-fitur tidak relevan. Berdasarkan hasil pengujian, penerapan SMOTE dan Ensemble Filter pada metode klasifikasi SVM memberikan hasil terbaik dengan nilai accuracy sebesar 85% dan AUC sebesar 0,850. Pengujian tersebut dapat membuktikan jika SMOTE pada penyeimbangan kelas dan Ensemble Filter pada seleksi fitur dapat meningkatkan performa klasifikasi dari metode SVM.    Abstract   The liver is one of the important organs in the human body that plays a role in the body's metabolic processes. Quoting an article from the American Liver Foundation website, in 2020, as many as 51,642 adults in the United States died from liver disease. Liver function test data from the laboratory can be used to diagnose liver disease. Classification of liver disease in patients needs to be done well because the results can help in the initial diagnosis of whether a patient has liver disease. Based on previous research, the Support Vector Machine (SVM) method best classifies liver disease patients. However, SVM has weaknesses when applied to datasets with unbalanced classes and does not work accurately when too many irrelevant features are used. To class-balance the dataset, the Synthetic Minority Oversampling Technique (SMOTE) method is used. Meanwhile, feature selection is performed using the Ensemble Filter method, which consists of Information Gain, Gain Ratio, and Relief-F methods to handle irrelevant features. Based on the test results, the application of SMOTE and Ensemble Filter in SVM classification gives the best results with an accuracy value of 85% and an AUC of 0.850. The test can prove if SMOTE on class balancing and Ensemble Filter on feature selection can improve the classification performance of the SVM method.
Pelatihan Penerapan Teknologi Alat Produksi dan Pengemasan bagi UMKM Labuana Kota Banjarbaru Kalimantan Selatan Normaidah, Normaidah; Mazdadi, Muhammad Itqan; Ratnapuri, Prima Happy; Buih, Putri Helena Junjung
Jurnal Pengabdian UNDIKMA Vol. 6 No. 1 (2025): February
Publisher : LPPM Universitas Pendidikan Mandalika (UNDIKMA)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33394/jpu.v6i1.13538

Abstract

This community service aims to improve the skills of UMKM Labuana in applying production equipment technology and product packaging instruments. The methods used include initial surveys, followed by technology application training, and evaluation with observation methods on improving partner skills in the equipment used. The results of this service show that the increasing skills of UMKM Labuana in applying product production and packaging technology include installation, operation, and maintenance of equipment.
The Effectiveness of Data Imputations on Myocardial Infarction Complication Classification Using Machine Learning Approach with Hyperparameter Tuning Mazdadi, Muhammad Itqan; Saragih, Triando Hamonangan; Budiman, Irwan; Farmadi, Andi; Tajali, Ahmad
Jurnal Ilmiah Teknik Elektro Komputer dan Informatika Vol. 10 No. 3 (2024): September
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26555/jiteki.v10i3.29479

Abstract

Complications from Myocardial Infarction (MI) represent a critical medical emergency caused by the blockage of blood flow to the heart muscle, primarily due to a blood clot in a coronary artery narrowed by atherosclerotic plaque. Diagnosing MI involves physical examination, electrocardiogram (ECG) evaluation, blood sample analysis for specific heart enzyme levels, and imaging techniques such as coronary angiography. Proactively predicting acute myocardial complications can mitigate adverse outcomes, and this study focuses on early prediction using classification methods. Machine learning algorithms such as Support Vector Machine (SVM), Random Forest, and XGBoost were employed to classify patient medical records accurately. Techniques like K-Nearest Neighbors (KNN) imputation, Iterative imputation, and Miss Forest were used to handle incomplete datasets, preserving vital information. Hyperparameter optimization, crucial for model performance, was performed using Bayesian Optimization, which minimizes the objective function by modeling past evaluations. The contribution to this study is to see how much influence data imputation has on classification using machine learning methods on missing data and to see how much influence the optimization method has when performing hyperparameter tuning. Results demonstrated that the Iterative Imputation method yielded excellent performance with SVM and XGBoost algorithms. SVM achieved 100% accuracy, precision, sensitivity, F1 score, and AUC. XGBoost reached 99.4% accuracy, 100% precision, 79.6% sensitivity, an F1 score of 88.7%, and an AUC of 0.898. KNN Imputation with SVM showed results similar to Iterative Imputation with SVM, while Random Forest exhibited poor classification outcomes due to data imbalance, causing overfitting.
Accurate Skin Tone Classification for Foundation Shade Matching using GLCM Features-K-Nearest Neighbor Algorithm Syahputra, Muhammad Reza; Mazdadi, Muhammad Itqan; Budiman, Irwan; Farmadi, Andi; Saputro, Setyo Wahyu; Rozaq, Hasri Akbar Awal; Sutaji, Deni
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4723

Abstract

Foundation shade matching remains a significant challenge in the beauty industry, particularly in Indonesia where consumers exhibit three distinct skin tone categories: ivory white, amber yellow, and tan. Manual foundation selection often results in mismatched shades, leading to customer dissatisfaction. This study presents a novel automated skin tone classification system combining Gray Level Co-Occurrence Matrix (GLCM) feature extraction with the K-Nearest Neighbor (KNN) algorithm. The GLCM method extracts four key texture features (contrast, homogeneity, energy, and entropy) from facial images, while KNN performs classification. A comprehensive dataset of 963 facial images was used, with 770 training and 193 test samples collected under controlled lighting conditions. After testing K values from 1 to 15, the optimal K=1 achieved 75.65% accuracy. Compared to baseline color histogram methods (60% accuracy), our GLCM-KNN approach demonstrates 15.65% improvement in classification performance. This research contributes to computer vision applications in beauty technology, enabling the development of mobile applications for virtual foundation try-on and personalized product recommendations. The findings have significant implications for the cosmetics industry, particularly for automated cosmetic shade matching systems and enhanced customer experience in online beauty retail. Further research is recommended to explore deep learning approaches and expand dataset diversity to improve accuracy.
Cross-Temporal Generalization of IndoBERT for Indonesian Hoax News Classification Riadi, Agus Teguh; Indriani, Fatma; Mazdadi, Muhammad Itqan; Faisal, Mohammad Reza; Herteno, Rudi
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.5.4757

Abstract

The spread of hoaxes in digital media poses a major challenge for automated detection systems as language and topics evolve over time. Although Transformer-based models such as IndoBERT have demonstrated high accuracy in previous studies, their performance across different time periods remains underexplored. This study examines the cross-temporal generalization ability of IndoBERT for hoax news classification. The model was trained on labeled articles from 2018–2023 and tested on data from 2025 to evaluate its robustness against temporal distribution shifts. The results indicate high accuracy on similar-period data (99.67–99.89%) but a decrease on 2025 data (95.45–95.87%), with most errors occurring as false negatives in the hoax class. These findings highlight the impact of temporal distribution shifts on model reliability and underscore the importance of adaptive strategies such as periodic retraining and domain-based data augmentation. Practically, this model has the potential to assist social media platforms and government institutions in developing dynamic and time-adaptive hoax detection systems. The cross-temporal approach employed in this study also offers methodological innovation compared to conventional random validation, as it better reflects real-world conditions where misinformation patterns continually evolve.
Co-Authors AA Sudharmawan, AA Abdilah, Muhammad Fariz Fata Abdullayev, Vugar Ade Agung Harnawan, Ade Agung Adela Putri Ariyanti Afifa, Ridha Ahdyani, Annisa Salsabila Ahmad Rusadi Ahmad Rusadi Ahmad Rusadi Arrahimi - Universitas Lambung Mangkurat) Ahmad Rusadi Arrahimi - Universitas Lambung Mangkurat) Ahmad Shofi Khairian Ahmad Tajali Aidil Akbar Al Ghifari, Muhammad Akmal Alamudin, Muhammad Faiq Amalia, Raisa Andi - Farmadi Andi Farmadi Andi Farmadi Anna Khumaira Sari Anshory, Muhammad Naufal Ansyari, Muhammad Ridho Antoh, Soterio Ardiansyah Sukma Wijaya Athavale, Vijay Anant Athavale, Vijay Annant budiman, irwan Buih, Putri Helena Junjung Deni Sutaji Dina Arifah Djordi Hadibaya Dodon Turianto Nugrahadi Dwi Kartini Dwi Kartini Dwi Kartini, Dwi Dzira Naufia Jawza Erdi, Muhammad Faisal, Mohammad Reza Fathmah, Siti Fatma Indriani Fayyadh, Muhammad Naufaldi Fitriani, Karlina Elreine Fitrinadi Friska Abadi Haekal, Muhammad Hafizah, Rini Helma Herlinda Herteno, Rudi Herteno, Rudy Indriani, Fatma Irwan Budiman Irwan Budiman Irwan Budiman Irwan Budiman M. Apriannur M. Khairul Rezki Mafazy, Muhammad Meftah Maulana, Muhammad Rafly Alfarizqy Muflih Ihza Rifatama Muhamad Fawwaz Akbar Muhamad Ihsanul Qamil Muhammad Adika Riswanda Muhammad Khairin Nahwan Muhammad Mada Muhammad Mirza Hafiz Yudianto Muhammad Mursyidan Amini Muhammad Reza Faisal, Muhammad Reza Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Muliadi Nabella, Putri Noorhafizi, Muhammad Normaidah, Normaidah Nugraha, Muhammad Amir Nursyifa Azizah P., Chandrasekaran Patrick Ringkuangan Prastya, Septyan Eka Putri Nabella Radityo Adi Nugroho Rahmah, Indah Noor Rahmat Hidayat Rahmat Ramadhani Rahmat Ramadhani Rahmawati, Nanda Hesti Rahmawati, Nanda Putri Ramadhan, Mita Azzahra Ramadhani, Muhammad Irfan Ramadhani, Rahmat Ratnapuri, Prima Happy Riadi, Agus Teguh Rifki Izdihar Oktvian Abas Pullah Rifki Rinaldi Rizky, Muhammad Miftahur Rozaq, Hasri Akbar Awal Rozaq, Hasri Awal Akbar Rudy Herteno Saputra, Adryan Maulana Saputro, Setyo Wahyu Saragih, Triando Hamonangan Satrio Yudho Prakoso Setyo Wahyu Saputro Shalehah Syahputra, Muhammad Reza Tajali, Ahmad Totok Wianto Wahyu Dwi Styadi Wijaya Kusuma, Arizha Yanche Kurniawan Mangalik YILDIZ, Oktay Yoga Pambudi Yudha Sulistiyo Wibowo Zaini Abdan