Claim Missing Document
Check
Articles

Found 2 Documents
Search

PERBANDINGAN K-MEDOIDS DAN CLARA (Clustering Large Application) PADA DATA POPULASI TERNAK DI INDONESIA Ardhani, Rizky; Marshelle, Sean; Fitrianto, Anwar; Erfiani, Erfiani; Jumansyah, L. M. Risman Dwi
Jurnal Lebesgue : Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistika Vol. 5 No. 3 (2024): Jurnal Lebesgue : Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistik
Publisher : LPPM Universitas Bina Bangsa

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.46306/lb.v5i3.764

Abstract

This study compares the K-Medoids and CLARA (Clustering Large Application) methods for livestock population data in Indonesian districts and cities. Calculating the distance between points and objects in the data, K-Medoids is a method for clustering based on data points (medoids). A larger dataset is divided into several samples for comparison in CLARA, an extension of the K-Medoids approach. The CLARA method analysis results show that three clusters are the ideal number. The ideal number of clusters in a K-Medoids cluster analysis is two. The Silhouette Score (SS), Davis-Bouldin Index (DBI), and Calinski-Harabasz Index (CHI) are the metrics that are measured. The evaluation of the comparison results shows that the CLARA method has an SS value of 0.66, while K-Medoids has an SS value of 0.62. The comparison of the CLARA and K-Medoids approaches yielded DBI values of 1.38 and 1.92, respectively, and 197.54 and 132.73 for CHI. The findings indicate that, in comparison to the K-Medoids approach, the SS value for the CLARA method is closer to 1, and that the CHI value derived from the CLARA method is likewise greater. The K-Medoids approach has a higher DBI value than the CLARA method, where a lower DBI value denotes superior performance. The CLARA approach is the most effective way to do cluster analysis on livestock population data in Indonesian districts and cities, according to the findings.
Feature selection in supervised machine learning: a case study of poverty dataset in West Java, Indonesia Marshelle, Sean; Rahardiantoro, Septian; Kurnia, Anang
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 15, No 1: February 2026
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v15.i1.pp524-535

Abstract

West Java, one of the largest provinces in Indonesia with a population exceeding 50 million, reported a poverty rate of 7.62% in 2023. Data from the national socio-economic survey or survei sosial ekonomi nasional (SUSENAS) show that poverty is multidimensional, encompassing aspects of employment, education, sanitation, housing, food security, technology, and government assistance. Addressing this complexity requires identifying the most influential factors that determine household welfare. This study applies and compares three feature selection approaches—filter, wrapper, and embedded—to the SUSENAS dataset to evaluate their effectiveness in identifying key poverty determinants. By prioritizing variables with the strongest predictive power, the study provides an evidence-based framework for more efficient and targeted poverty alleviation strategies. Results indicate that the information filter method combined with random forest (RF) and the least absolute shrinkage and selection operator (LASSO) embedded method combined with logistic regression (LR) deliver the best performance, improving model accuracy while reducing more than 65% of irrelevant features. The selected indicators highlight critical sectors such as food security, housing, and access to technology, which can serve as short-term policy priorities. In the long term, broader interventions in education, employment, sanitation, and government support are recommended. These findings demonstrate how data-driven feature selection can guide effective policy design for reducing poverty in West Java.