Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Jurnal Teknik Informatika (JUTIF)

Comparative Performance Evaluation of Linear, Bagging, and Boosting Models Using BorutaSHAP for Software Defect Prediction on NASA MDP Datasets Kartika, Najla Putri; Herteno, Rudy; Budiman, Irwan; Nugrahadi, Dodon Turianto; Abadi, Friska; Ahmad, Umar Ali; Faisal, Mohammad Reza
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 6 (2025): JUTIF Volume 6, Number 6, Desember 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.6.5393

Abstract

Software defect prediction aims to identify potentially defective modules early on in order to improve software reliability and reduce maintenance costs. However, challenges such as high feature dimensions, irrelevant metrics, and class imbalance often reduce the performance of prediction models. This research aims to compare the performance of three classification model groups—linear, bagging, and boosting—combined with the BorutaSHAP feature selection method to improve prediction stability and interpretability. A total of twelve datasets from the NASA Metrics Data Program (MDP) were used as test references. The research stages included data preprocessing, class balancing using the Synthetic Minority Oversampling Technique (SMOTE), feature selection with BorutaSHAP, and model training using five algorithms, namely Logistic Regression, Linear SVC, Random Forest, Extra Trees, and XGBoost. The evaluation was conducted with Stratified 5-Fold Cross-Validation using the F1-score and Area Under the Curve (AUC) metrics. The experimental results showed that tree-based ensemble models provided the most consistent performance, with Extra Trees recording the highest average AUC of 0.794 ± 0.05, followed by Random Forest (0.783 ± 0.06). The XGBoost model provided the best results on the PC4 dataset (AUC = 0.937 ± 0.008), demonstrating its ability to handle complex data patterns. These findings prove that BorutaSHAP is effective in filtering relevant features, improving classification reliability, and strengthening transparency and interpretability in the Explainable Artificial Intelligence (XAI) framework for software quality improvement.
Enhancing Classification of Self-Reported Monkeypox Symptoms on Social Media Using Term Frequency-Inverse Document Frequency Features and Graph Attention Networks Rizian, Rizailo Akfa; Budiman, Irwan; Faisal, Mohammad Reza; Kartini, Dwi; Indriani, Fatma; Ahmad, Umar Ali
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 6 (2025): JUTIF Volume 6, Number 6, Desember 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.6.5482

Abstract

Early detection of infectious diseases plays a crucial role in minimizing their spread and enabling timely intervention. In the digital era, social media has emerged as a valuable source of real-time health information, where individuals often share self-reported symptoms that can serve as early warning signals for disease outbreaks. However, textual data from social media is typically unstructured, noisy, and contextually diverse, posing challenges for conventional text classification methods. This study proposes a hybrid model combining Term Frequency–Inverse Document Frequency (TF-IDF) feature representation with a Graph Attention Network (GAT) to enhance the early detection of Monkeypox-related self-reported symptoms on Indonesian social media. A dataset of 3,200 tweets was collected through Tweet-Harvest and subsequently preprocessed and manually labeled, producing a balanced distribution between positive (51%) and negative (49%) samples. TF-IDF vectors were used to construct a document similarity graph via the k-Nearest Neighbors (k-NN) method with cosine similarity, enabling GAT to leverage both textual and relational information across posts. The model’s performance was evaluated using accuracy, precision, recall, and macro-F1, with macro-F1 serving as the primary indicator. The proposed TF-IDF + GAT model achieved 93.07% accuracy and a macro-F1 score of 93.06%, outperforming baseline classifiers such as CNN (92.16% macro-F1), SVM (85.73%), Logistic Regression (84.89%). These findings demonstrate the effectiveness of integrating classical text representations with graph-based neural architectures for improving social media based disease surveillance and supporting early epidemic response strategies.
Co-Authors Abdurahman, M Ammar Adam Aji Bhuwana Agung Nugroho Jati Agung Wicaksono Ahmad Rif'a Al Muttaqi Al Muttaqi, Ahmad Rif'a Al-muttaqi, Ahmad Rif`a Alam, Rifdo Shah Alexander William Setiawan Putra Aliwarman Tarihoran Almuchlisin Almuchlisin Amin, Miftah Amirul Andi Farmadi Angga Febrian Angga Rusdinar Anggun Fitrian Isnawati Antaufany Puji Rahmadha Ariep Maaulana Yassar Arif, Nuuruddin Hamid Arifin, Hafid Ikhsan Arjunaldi, Ari Asep Mulyana Ashri Dinimaharawati Awaludin, Asif Bagas Wara Rachmat Ramadhan Bangkit Indarmawan Nugroho Budhi Irawan Burhanuddin Dirgantara Burhanuddin Dirgantara Burhanuddin Dirgantoro Casi Setianingsih Danan Jaya, I Putu Yuda Danang Triantoro Murdiansyah Darmawan, Syukri Dedi Gunawan Degas Rinaldo Deni Rustandi Devi Pratami Diana, Firmansyah Dimas Chaidar Dirgantara, Fussy Mentari Dodon Turianto Nugrahadi Dwi Kartini, Dwi Dwiputra S, R Roger Dyah Pramesthi Larasati Eryzebuan, Yon Sigit Fairuz Azmi Fardilla Zardi Putri Fardilla Zardi Putri Fath Muhammad Isham Fatma Indriani Fauzi Sofyan Favian Dewanta Friska Abadi Fussy Mentari Dirgantara Ghiffari, Muhammad Emir Hanu Handriadma Henry Soleman Raubaba, Henry Soleman Herteno, Rudy Hidayatullah, Muhammad Faisal Ika Arum Puspita Ikbal Ramdani Indah Purnamasari Irsyad, Muhammad Arham Irwan Budiman Isham, Fath Muhammad Islam, Mushlih Nur Ivan Ramadhan Ramadhan JAROT BUDIASTO Jati Satria Wicaksana Jati Satria Wicaksana Kartika, Najla Putri Kevin Simangunsong Khadafi, Fadli Iksan Komarudin, Rudy Litasari Widyastuti, Litasari mal, Istik Manggala, Fajar Wira Cakra Mas'ud Adhi Saputra Maulana Akbar Dwijaya Mera Kartika Delimayanti Miftah Amirul Amin Mubarok, Ilham Maulana Mubaroq, Muhammad Raihan Muhammad Arham Irsyad Muhammad Emir Ghiffari Muhammad Ihsan Muhammad Ihsan Muhammad Nasrun Muhammad Reza Faisal, Muhammad Reza Muhammad Rifan Muhammad Solihin Muhammad Zain Imtiyaz Mukarom, Muhammad Alzed Mulki Syahputra Muharram Mustofa, Fahmi Charish Nana Febriana Nasution, Izhar Pinayungan Novianty, Astri Nugrahadi, Dodon Nugraheni, Ratna Astuti Nurul Anggraini Nurul Ikhsan Panji Tresna, Wildan Perdana Erick Oktafianto Prasetyo Yuda Pangestu Pratama, Rama Pratama, Yusup Diva Prayitno Abadi Prayitno Abadi Putra, Aldi Febrian Yuwono Putra, Bagus Ariantama Dewanto R Rogers Dwiputra Setiady Rahadian Nugraha Ramdani, Ikbal Randy Erfa Saputra Ratna Astuti Nugrahaeni resptiawan, reza rendian Reza Ilmi Reza Rendian Septiawan Rifdo Shah Alam Rizaldy, Rizqy Eka Putra Rizian, Rizailo Akfa Robby Reza Rohman, Aditya Rudi Purwo Wijayanto Saputra, Masud Adhi Septian Budi Asmara Setiady, R Rogers Dwiputra Setiady, R. Rogers Dwiputra Sofyan, Fauzi Sugiarto, Iyon Titok Sugirman, Gian Nugraha Sumbung, Frederik H Syafril, Tm. Alvian Syaiful Bahri , Muhamad Naufal Syazwani, Savira Denisa Sya’bani, Taufiq Salman Taufik Ismail Tengku Ahmad Riza Utsman Al Aydarus Wahyu Pamungkas Wahyu Pamungkas Wibowo, Muhammad Rizky Wicaksana, Jati Satria Wildan Panji Tresna Xaverius, Fransiskus Yon Sigit Eryzebuan Yuliantho Mardiansyah Yuniar Pristyan Chandra Yusup Diva Pratama ZK Abdurahman Baizal