Claim Missing Document
Check
Articles

Found 2 Documents
Search

Comparison of Methods for Applying the Data Mining Clustering Concept to Banten Provincial Government Poverty Data: Systematic Review Yuningsih, Irma; Dwi Aryani, Mina Winawati; Imelda, Imelda
Jurnal Mamangan Vol 13, No 1 (2024): Jurnal Ilmu Sosial Mamangan Accredited 2 (SK Dirjen Ristek Dikti No. 0173/C3/DT
Publisher : LPPM Universitas PGRI Sumatera Barat

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22202/mamangan.v13i1.8074

Abstract

An in-depth understanding of the phenomenon of poverty in Banten Province is the key to formulating effective strategies in overcoming this problem. In general, poverty in this area is caused by several interrelated and complex factors. This research provides a comparative analysis of the application of clustering data mining methods in calculating population poverty data in certain regions or provinces, especially Banten. This research aims to assess the consistency and effectiveness of various clustering methods used in previous research. Using a qualitative approach with a literature review, secondary data from relevant research was collected through systematic searching, reading, and note-taking. The data collected is then carried out through a process of data collection, filtering, presentation and drawing conclusions. The results show that most of the literature reviews show similar results regarding the effectiveness of various clustering methods in analyzing poverty data in Banten. This shows that there is a consensus among previous studies regarding the efficacy of clustering data mining techniques in overcoming problems related to poverty in the region. These findings contribute to a deeper understanding of the methodology used in poverty analysis and provide insights for policy makers and researchers to develop more effective strategies in poverty alleviation efforts. The conclusion of this research is that clustering data mining in analyzing poverty data in Banten Province has shown quite high consistency and effectiveness. Various literature studies that have been reviewed show that various clustering methods that have been applied previously provide similar results in identifying poverty patterns, understanding socio-economic structures, and providing valuable insights for developing more targeted policies.
Comparison of Geothermal Well Productivity Using KNN, SVM and Gradient Boost Methods Dwi Aryani, Mina Winawati; Abdullah, Indra Nugraha
Building of Informatics, Technology and Science (BITS) Vol 7 No 4 (2026): March 2026
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i4.9302

Abstract

Manual and conventional processing of geothermal well production data is computationally inefficient and requires several hours to days to generate productivity assessments, particularly when dealing with large-scale and non-linear operational datasets. The complexity of geothermal production parameters this paper such as wellhead pressure (WHP), enthalpy, steam flow, brine flow, total flow, and generated power this paper creates challenges for accurate and timely productivity classification at the well level. This study utilizes 74,912 daily production records collected from January 2018 to June 2024, comprising 13 operational and production-related attributes. The objective is to identify the most effective machine learning algorithm for classifying geothermal well productivity levels to support faster and more reliable operational decision-making. A comparative machine learning classification approach was conducted using K-Nearest Neighbors (K-NN), Support Vector Machine (SVM), and Gradient Boosting. Model evaluation was performed using three train–test split ratios: 70:30, 80:20, and 90:10. Two modelling scenarios were implemented: with and without Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance. The results indicate that the K-NN model achieved the highest classification performance, reaching 94.22% accuracy using the 90:10 split ratio without SMOTE. Gradient Boosting demonstrated stable performance across all ratios, with its best accuracy of 91.39% at the 70:30 split without SMOTE. In contrast, SVM produced the lowest performance, with a maximum accuracy of 79.78% at the 90:10 ratio without SMOTE. The application of SMOTE improved minority class recall, particularly for SVM, but generally reduced overall model accuracy for K-NN and Gradient Boosting. These findings demonstrate that classical machine learning algorithms, particularly K-NN, provide an efficient and accurate solution for geothermal well productivity classification. The proposed approach significantly reduces processing time compared to conventional analytical methods and supports data-driven decision-making in geothermal production forecasting and development planning.