Claim Missing Document
Check
Articles

Found 3 Documents
Search
Journal : JOURNAL OF APPLIED INFORMATICS AND COMPUTING

Anemia Classification with Clinical Feature Engineering and SHAP Interpretation Amalia, Ikhlasul; Rumini, Rumini
Journal of Applied Informatics and Computing Vol. 9 No. 5 (2025): October 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i5.10912

Abstract

Anemia is a global health issue that has a significant impact on quality of life and productivity. Early and accurate detection is essential to prevent more serious complications. This study aims to develop an anemia classification model based on machine learning technology using the XGBoost algorithm, as well as compare its performance with Logistic Regression and Random Forest methods. The dataset used in this study was obtained from the Kaggle platform, consisting of 1,421 samples and six clinical attributes, namely Gender, Hemoglobin (HGB), Mean Corpuscular Hemoglobin (MCH), Mean Corpuscular Hemoglobin Concentration (MCHC), Mean Corpuscular Volume (MCV), Result. During the feature engineering process, the derived feature of the hemoglobin-to-MCV ratio (Hb/MCV) was added, which is medically relevant in distinguishing types of anemia. Evaluation results showed that XGBoost and Random Forest achieved an accuracy rate and F1-Score of 100%, while Logistic Regression achieved a rate of 98.9%. XGBoost was selected as the primary model due to its efficient computational capabilities and support for interpretation using SHAP (SHapley Additive exPlanations). SHAP visualization revealed that the Hb/MCV ratio and hemoglobin were the most influential features in classification. This model has the potential to be used as a decision support system for automated anemia screening and can be further integrated into clinical systems.
Pap Smear Image Classification for Cervical Cancer Prediction with Transfer Learning on ResNet101 Architecture Dewi, Sila Cahya; Rumini, Rumini
Journal of Applied Informatics and Computing Vol. 9 No. 5 (2025): October 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i5.10343

Abstract

Early detection of cervical cancer remains a pivotal strategy to improve clinical outcomes and mitigate mortality associated with this disease. This study introduces a robust deep learning framework employing the ResNet101 architecture to facilitate the automated classification of cervical cell images derived from Pap smear examinations. By leveraging transfer learning, the pre-trained ResNet101 model was fine-tuned to extract salient morphological features critical for distinguishing among diverse cervical cell categories. A comprehensive dataset of labeled Pap smear images, systematically expanded through augmentation techniques, was utilized to enhance model generalizability. The proposed approach achieved a remarkable classification accuracy of 99.6%, highlighting its effectiveness in reliably differentiating between normal and abnormal cellular structures. These findings substantiate the promise of deep residual networks coupled with transfer learning as a powerful tool in advancing computer-aided diagnostic systems, thereby reinforcing early screening initiatives for cervical cancer.
Comparative Study of Logistic Regression, Random Forest, and XGBoost for Bank Loan Approval Classification Putra, Hamdika; Rumini, Rumini
Journal of Applied Informatics and Computing Vol. 9 No. 5 (2025): October 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i5.10862

Abstract

Bank loan approval plays a vital role in ensuring financial institutions can minimize credit risk while supporting economic growth. Default prediction is a crucial aspect of banking credit risk management. This study compares three machine learning algorithms Logistic Regression, Random Forest, and Extreme Gradient Boosting (XGBoost) to classify bank loan approvals using a combination of application, previous application, and bureau datasets. The workflow includes data merging, cleaning, missing value imputation, handling unknown values, feature engineering (such as converting day-based variables into years, calculating total submitted documents, income-to-annuity ratio, and employment-to-income ratio), encoding (label and one-hot), scaling (min-max normalization), feature selection based on correlation analysis, handling class imbalance with SMOTE, as well as modeling and evaluation using Accuracy, Precision, Recall, F1-score, and AUC. The results show that Logistic Regression yields the highest AUC of 0.741498, outperforming Random Forest (0.713758) and XGBoost (0.715944). From a business perspective, implementing the best model reduced the Loss Given Default (LGD) by 39.77 %, from $1,705,098,055.50 to $1,026,944,185.50. This finding confirms that simpler models remain competitive on imbalanced datasets when supported by appropriate preprocessing and balancing strategies.
Co-Authors A.A. Ketut Agung Cahyawan W Abdillah, Rusli Abidarin Rosidi Adzalika, Ayu Reza Agung Wahyudi Ahmad Alwi Nurudin Ahmad Alwi Nurudin Aisyah, Hanif Akhirmaini, Zelda Alam, Bima Tangguh Alirman Sarpan Amalia, Ikhlasul Ansori, Ikhsan Anwar Anwar Aprilandri, Gusti Aribowo, Dhany Suhartantyo Ariyadi, Jamal Imam Asmoro, Jimy Purbo Bambang Priyono Billy Castyana, Billy Dewi, Sila Cahya Dina Maulina Donny Firmansyah Dwinanda Saputra, Ahmad Faris Eunike Raffy Rustiana Fakhruddin Fakhruddin Farah Nur Oktavia Febriyanto, Dimas Bayu Gema, Abdul Rachim Gilang, Meillenio Agung Gumono Gumono, Gumono Handoko, Lovi Harry Pramono, Harry Harsono, Tri Nur Heny Setyawati Hermahayu Hidayah, Ayu Nur Ilman Zuda Septiawan Imawati, Veni Indriyanti Indriyanti, Indriyanti Irawan, Fredi Januarto, Khoerul Rohman Kasmudi, Udin Khotibul Umam Kusuma, Donny Wira Yudha Kusuma, Prita Widia Lesmana, Dian Lukmanul Hakim Luthfi, Auliya Ma'muroh, Ma'muroh Mardanto, Langgeng Asmoro Budi Mugi Hartono Nasuka Nasuka, Nasuka Norhikmah Norhikmah Oktia Woro Kasmini Handayani Pambudi, Satrio Rilo Parista, Vivi Septiana Patria, Lalu Demung Pradana, Toufan Wahyu Pramandhika, Reddy Prasepty, Winda Pratama, Rifqy Rayhan Andi Riga Pratiwi, Adinda Resta Putri pratiwi, Debby Puspita Sari, Ika Endah Putra, Hamdika Rachman, Imaniar Ranu Baskora Aji Putra Ria Lumintuarso Rizki SP, Tri Agustin Wulandari Roas Irsyada, Roas roni, Ahmad Sya'roni S.Pd. M Kes I Ketut Sudiana . Said Junaidi Saputra, Irawan Hadi Saputro, Irfan Toni Satria Armanjaya Seran, Yohanes Arka Maria Setya Rahayu Shodikin, Shodikin Siti Baitul Mukarromah Siti Hajar SOEGIYANTO Soegiyanto Soegiyanto, Soegiyanto Sudarmawan, Sudarmawan Sugiharto - Sugiharto Sugiharto Sugiharto Sugiharto Sukestiyarno, Y.L Sulaiman Sulaiman Suratman Suratman Tandiyo Rahayu Taufiq Hidayah Taufiq Hidayah Tommy Soenyoto Tri Rustiadi, Tri Ucu Muhammad Afif Wahyuli, Wahyuli Wibowo, Ristianto Adi Wicaksono, Restu Aji Wicaksono, Wisnu Widy Astuti, Widy Winara, Winara Winasis, Probo Wiwit Kurniawan, Wiwit Wiyono, Indra Zahirma, Mutia Zola, Nicko