Email has become an essential communication tool in everyday life. However, the ease of its use is also exploited by irresponsible parties to spread spam. This research aims to implement the Naive Bayes algorithm with Chi-square in classifying spam emails based on words and frequency. The dataset used in this research consists of 153 data. This data was processed using the classification method using the Naive Bayes algorithm with Chi-square through the Knowledge Discovery in Databases (KDD) process. The results show that the accuracy value is 81.00%, the precision value is 100%, the recall value is 65%, and the F1-score value is 79% using Naive Bayes with Chi-square. Furthermore, the evaluation results using the ROC curve show that the AUC value reaches 0.91, which is categorized as very good. This research shows that the Naive Bayes algorithm with Chi-square is successful in classifying spam emails based on words and frequency.
Copyrights © 2025