Claim Missing Document
Check
Articles

Found 2 Documents
Search

Spam Message Classification Using the Naïve Bayes Algorithm Based on RapidMiner Muhamad Yusup; Mochamad Isham Fadillah; Rifky Adinanta Fauzanie; Risca Lusiana Pratiwi; Rani Irma Handayani; Euis Widanengsih
Journal of Artificial Intelligence and Engineering Applications (JAIEA) Vol. 5 No. 2 (2026): February 2026
Publisher : Yayasan Kita Menulis

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.59934/jaiea.v5i2.1811

Abstract

This study implements the Naïve Bayes algorithm for classifying spam and non-spam (ham) messages using the RapidMiner Studio platform. The dataset used was obtained from the SMS Spam Collection Dataset on the Kaggle platform, which consists of 5,759 messages with a distribution of 4,075 ham messages and 1,291 spam messages. The research stages included text pre-processing, model training, and performance evaluation using accuracy, precision, recall, and F1-score metrics. The experimental results showed that the Naïve Bayes model achieved an accuracy of 89.64% with a precision of 56.93%, a recall of 100%, and an F1-score of 72.56%. The research findings indicate that the Naïve Bayes algorithm is effective in detecting spam messages with adequate accuracy, and prove that RapidMiner is an efficient tool for implementing machine learning methods in text classification.
Clustering Provinces in Indonesia Based on Economic Indicators Using the K-Means Algorithm Ilham Ilyasa; Muhamad Fazri Sugara; Aziiz, Abdul; Rani Irma Handayani; Risca Lusiana Pratiwi; Euis Wida Nengsih
Journal of Artificial Intelligence and Engineering Applications (JAIEA) Vol. 5 No. 2 (2026): February 2026
Publisher : Yayasan Kita Menulis

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.59934/jaiea.v5i2.1812

Abstract

This study aims to analyze and classify the level of economic development in provinces in Indonesia using the K-Means algorithm. The data used includes three main indicators, namely Gross Regional Domestic Product (GRDP) per capita, percentage of poor population, and Human Development Index (HDI) in 2024 obtained from the Central Statistics Agency (BPS). The data was processed through normalization and analysis using the Elbow method to determine the optimal number of clusters. The results were evaluated using the Davies–Bouldin Index (DBI) to assess the level of separation and compactness between clusters. The results show that the most effective division consists of three groups representing high, medium, and low levels of development. Provinces such as DKI Jakarta and Riau are included in the high development cluster, Central Java and South Sulawesi are in the medium cluster, while Papua and East Nusa Tenggara are in the low cluster. These results show that machine learning methods, particularly K-Means, are capable of identifying patterns of regional economic inequality and provide a useful basis for the government in formulating more targeted and equitable development policies.