Claim Missing Document
Check
Articles

Found 1 Documents
Search

Optimizing Email Spam Classification Using Naïve Bayes and Principal Component Analysis Shinta Virgiana; Rudi Kurniawan; Tati Suprapti
Journal of Artificial Intelligence and Engineering Applications (JAIEA) Vol. 4 No. 2 (2025): February 2025
Publisher : Yayasan Kita Menulis

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.59934/jaiea.v4i2.803

Abstract

In the ever-evolving digital era, email spam filtering is an important challenge to maintain the security and comfort of email services. The Naïve Bayes algorithm is widely used for spam email classification because of its ability to manage large data, although there are still limitations in terms of accuracy, precision and recall. This research aims to improve spam email classification performance by combining Naïve Bayes and Principal Component Analysis (PCA) to optimize model accuracy and explore optimal parameters in the reduction dimension. The research methodology goes through the Knowledge Discovery in Database (KDD) stages which include selection, preprocessing, transformation using PCA, development of a classification model using Naïve Bayes, and evaluation of model performance. The dataset used consists of emails categorized as spam and non-spam. The experimental results show that the combination of Naïve Bayes and PCA achieves the highest accuracy of 99.24% with 7 principal components. The fixed number of components approach shows better performance compared to preserving variance, emphasizing the importance of selecting appropriate PCA parameters in improving the effectiveness of model classification. This research shows that PCA not only reduces the complexity of the dataset but also increases the efficiency of the classification algorithm.