Jurnal Nasional Teknologi Informasi dan Aplikasinya
Vol. 1 No. 4 (2023): JNATIA Vol. 1, No. 4, Agustus 2023

Klasifikasi Teks Spam dengan Algoritma Support Vector Machine dan Chi – Square - Perbaikan Tabel

Getzbie Alfredo Tpoy (Unknown)
Agus Muliantara (Unknown)



Article Info

Publish Date
01 Aug 2023

Abstract

Spam messages are messages that contain false information, commonly regarding events, banking, insurance, bills, advertisements, and viruses. To address the issue of spam, classification can be performed on the received messages. Classification can be done by separating texts that contain spam messages from texts that contain legitimate (ham) messages. In this study, spam text classification was conducted using the Support Vector Machine algorithm, feature selection using Chi-Square. The Chi-Square feature selection method was performed using percentages of 20%, 40%, 60%, and 80%, with accuracy, precision, recall, and F1-Score as the measured values. The result of study obtained was an accuracy of 98.82% with an F1-Score of 93.05% at a feature selection percentage of 60%, using the RBF kernel. Feature selection with percentages of 20%, 40%, and 80% resulted in accuracies of 97.93%, 98.29%, and 98.02%, respectively. These accuracies were better compared to the accuracy without feature selection, which was 97.57%. 

Copyrights © 2023






Journal Info

Abbrev

jnatia

Publisher

Subject

Computer Science & IT Engineering

Description

JNATIA (Jurnal Nasional Teknologi Informasi dan Aplikasinya) adalah jurnal yang berfokus pada teori, praktik, dan metodologi semua aspek teknologi di bidang ilmu komputer, informatika dan teknik, serta ide-ide produktif dan inovatif terkait teknologi baru dan teknologi informasi. Jurnal ini memuat ...