Claim Missing Document
Check
Articles

Found 4 Documents
Search

A Analisis Perbandingan Kinerja Metode Ensemble Bagging dan Boosting pada Klasifikasi Bantuan Subsidi Listrik di Kabupaten/Kota Bogor Cintari, Nanda Putri; Alifviansyah, Kevin; Tsabitah, Dhiya Ulayya; Sartono, Bagus; Firdawanti, Aulia Rizki
The Indonesian Journal of Computer Science Vol. 13 No. 6 (2024): The Indonesian Journal of Computer Science (IJCS)
Publisher : AI Society & STMIK Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33022/ijcs.v13i6.4537

Abstract

The classification of electricity subsidy recipients is an crucial step to ensure that the government's social assistance program is distributed in a targeted manner, so an appropriate analysis method is needed. This research compares the Bagging and Boosting ensemble methods for the classification of households receiving electricity subsidies in Bogor Regency and City using Susenas 2023 data totaling 2002 households. The bagging method uses Random Forest and Extra Trees, while boosting includes CatBoost and LightGBM. The results showed that the Extra Trees method of bagging provided the best performance with 91% accuracy, 95% F1score, and 97% sensitivity. Factors such as ownership of electronic goods and modern facilities, such as ownership of air conditioners, laptops, and televisions are the most significant variables in influencing the classification of electricity subsidy recipients. With high accuracy and minimal bias, this model effectively supports data-driven policies for electricity subsidy distribution. This research is expected to be a strategic recommendation for the government to improve the effectiveness of the electricity subsidy program to be more efficient, well-targeted, and support the improvement of people's welfare.
Identification of Latent Dimensions of Digital Readiness and Typology of Districts/Cities in Indonesia Using PCA and K-Means Clustering Sari, Jefita Resti; Fahira, Fani; Zahra, Latifah; Fitrianto, Anwar; Alifviansyah, Kevin
Journal of Applied Informatics and Computing Vol. 9 No. 6 (2025): December 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i6.11487

Abstract

Digital transformation is a key agenda in Indonesia’s national development that requires balanced readiness across regions. However, the level of digital readiness among districts and cities still varies widely, highlighting the need for a typology that can comprehensively describe existing disparities. This study aims to identify the latent dimensions of digital readiness and to develop a regional typology of Indonesian districts/cities using Principal Component Analysis (PCA) and K-Means clustering. The data were obtained from the 2024 Indonesian Digital Society Index (IMDI), which consists of four pillars—Infrastructure and Ecosystem, Digital Skills, Empowerment, and Employment—with ten sub-pillars. PCA reduced these correlated indicators into two main latent components, namely Digital Capacity and Participation and Digital Infrastructure Foundation, which together explain 70.4% of the total variance. Cluster validation using the Silhouette Score and Davies–Bouldin Index (DBI) showed that K = 2 yielded the best internal validity (Silhouette = 0.402; DBI = 0.906), but a three-cluster configuration (K = 3) was adopted to obtain a more interpretable typology of high-, medium-, and low-readiness regions (Silhouette = 0.346; DBI = 1.007). Spatial mapping reveals that high-readiness districts are concentrated in Java, Bali, and parts of Sumatra, whereas low-readiness areas dominate eastern Indonesia. These findings confirm persistent digital inequality across regions and provide a quantitative basis for targeted policy interventions, including infrastructure development, digital literacy programs, and innovation ecosystem strengthening, to support an inclusive digital transformation in Indonesia.
A Study on Multi-Class Topic Prediction for E-commerce Review Data Using Ensemble Learning Alifviansyah, Kevin
CAUCHY: Jurnal Matematika Murni dan Aplikasi Vol 11, No 1 (2026): CAUCHY: JURNAL MATEMATIKA MURNI DAN APLIKASI
Publisher : Mathematics Department, Universitas Islam Negeri Maulana Malik Ibrahim Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.18860/cauchy.v11i1.37941

Abstract

The exponential growth of e-commerce platforms has generated massive volumes of unstruc tured user reviews, necessitating advanced automated analysis methodologies to extract actionable insights for strategic decision-making. This study addresses multi-class text classi f ication challenges by integrating BERTopic-based topic modeling with ensemble learning algorithms to analyze Indonesian e-commerce reviews. A dataset comprising 24,000 customer reviews from Google Play Store underwent systematic preprocessing and topic extraction using BERTopic, yielding eight distinct thematic clusters reflecting application performance, product quality, pricing, delivery logistics, and service reliability. The dataset exhibited severe class imbalance with an imbalance ratio of 65:1, where the dominant class represented 76.02% of instances while minority classes constituted less than 2.12%. Hybrid resampling techniques combining undersampling and oversampling successfully reduced the imbalance ratio to 1.4:1. TF-IDF vectorization transformed preprocessed text into numerical features, followed by supervised classification using CatBoost and Extra Trees classifiers optimized through randomized hyperparameter search with stratified k fold cross-validation. CatBoost demonstrated superior performance, achieving balanced accuracy of 0.829, recall of 0.829, and AUC of 0.965, attributed to its ordered boosting mechanism and capacity for handling categorical and imbalanced data. Independent validation of 2025 data confirmed robust gen eralization with prediction confidence exceeding 0.90, revealing significant temporal evolution in which product-related topics emerged dominant at 70.35%, pricing concerns increased from 6.58% to 16.57%, while application issues decreased from 76.02% to 2.51%. This research establishes a methodologically rigorous framework integrating unsupervised topic discovery with supervised ensemble classification, demonstrating computational efficiency while providing scalable solutions for automated review categorization.
Multivariate Exploration of Food Security in the Sulampua Region Identification of Clusters and Dominant Dimensions of Food Security Saputra, Wawan; Alfiryal, Naufalia; Prasetya, I Putu Gde Inov Bagus; Fitrianto, Anwar; Alifviansyah, Kevin
Journal of Applied Food Technology Vol 12, No 2 (2025)
Publisher : Diponegoro University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17728/jaft.29754

Abstract

Food security is a strategic issue closely related to economic development, community welfare, and the achievement of sustainable development goals. The Food Security Index (FSI) is an important instrument for measuring food security conditions at the provincial and district/city levels. However, FSI performance in Indonesia still shows regional disparities, particularly in Sulawesi, Maluku, and Papua (Sulampua), which tend to have low scores. This study aims to explore patterns of food security and vulnerability in Sulampua through multivariate analysis and regional clustering using K-Means and K-Medoids (PAM) methods. The analysis begins with Principal Component Analysis (PCA) to reduce the dimensionality of FSI indicators and identify dominant factors contributing to data variation. The PCA results show that the first three components explain more than 77% of the variance, with dominant factors including poverty, food expenditure, basic infrastructure access, as well as health and nutrition indicators. The clustering analysis produces two main groups: cluster 1, which includes the majority of districts/cities in Sulawesi and Maluku with relatively better food security, and cluster 2, consisting of 16 districts/cities in Papua with significant food insecurity. Cluster validity evaluation indicates that the K-Medoids method performs better than K-Means, being more robust to outliers and producing more consistent cluster separation. This study contributes to the literature by providing multivariate visual exploration and regional classification based on FSI indicators, which can serve as a basis for formulating more targeted food security policies in the Sulampua region.