Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal Coreid Journal

Dinah, Azalia Fathimah

Unknown Affiliation

Author-ID : 9719850

Computer Science & IT

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

Short Message Spam Classification using Decision Tree, Naive Bayes, and Logistic Regression Aulia, Citra; Dinah, Azalia Fathimah; Zahratunnisa, Dzilan Nazira; Efendi, Rofik
CoreID Journal Vol. 3 No. 3 (2025): November 2025
Publisher : CV. Generasi Intelektual Digital

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60005/coreid.v3i3.146

The increasing use of Short Message Service (SMS) in digital communication has been accompanied by a rise in spam messages, which threaten user convenience and information security. This study presents a comparative analysis of three classical machine learning algorithms—Decision Tree, Naïve Bayes, and Logistic Regression—for SMS spam classification. The research follows the CRISP-DM methodology, including data collection, understanding, preparation, modeling, and evaluation. The dataset used is the SMS Spam Collection (A More Diverse Dataset) from Kaggle, comprising 5,574 SMS messages labeled as spam or ham. Text preprocessing is performed through cleaning operations and feature extraction using the Term Frequency–Inverse Document Frequency (TF-IDF) method. The models are evaluated using accuracy, precision, recall, F1-score, and Area Under the Curve (AUC) metrics. Experimental results indicate that Logistic Regression achieves the most balanced performance, with an accuracy of 97.13%, precision of 99.23%, recall of 80.75%, F1-score of 89.04%, and an AUC of 98.72%. Naïve Bayes demonstrates high efficiency and perfect precision but lower recall, while Decision Tree offers interpretability with comparatively lower classification performance. The results suggest that Logistic Regression is the most suitable model for lightweight and reliable SMS spam detection systems, balancing accuracy and misclassification risk. This study provides practical insights for implementing efficient spam filtering solutions and serves as a reference for future research in text classification and natural language processing, particularly for short-message communication.

Co-Authors Aulia, Citra Rofik Efendi Zahratunnisa, Dzilan Nazira

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search