Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Paradigma

Comparative Analysis of Email Spam Detection Using SVM with TF-IDF and Word2Vec on Multilingual Datasets Katamsyi, Kaifa Ahlal; Akbar, Ahmad Taufiq; Nurkholis, Andi; Prapcoyo, Hari; Akbar, Bagus Muhammad; Saifullah, Shoffan
Paradigma - Jurnal Komputer dan Informatika Vol. 28 No. 1 (2026): March 2026 Period
Publisher : LPPM Universitas Bina Sarana Informatika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31294/p.v28i1.12339

Abstract

The rapid growth of email communication has increased the prevalence of spam emails, which can disrupt productivity and compromise information security. This study presents a comparative analysis of two text representation methods—TF-IDF and Word2Vec—for spam email classification using a Support Vector Machine (SVM) with a Radial Basis Function kernel. The experiments utilized Indonesian and English email datasets totaling 5,421 emails, split into 75% training and 25% testing sets. Two scenarios were evaluated: baseline with default parameters and after hyperparameter optimization using Grid Search combined with K-Fold Cross Validation. The results indicate that TF-IDF consistently outperformed Word2Vec across both languages, achieving the highest accuracy of 0.9562 on the English dataset after tuning. Word2Vec showed substantial improvement following parameter adjustment, reducing the performance gap with TF-IDF. The findings highlight the importance of hyperparameter optimization for enhancing the quality of feature representations and improving classification performance. This study also demonstrates that TF-IDF provides more stable results across different linguistic contexts, while Word2Vec benefits significantly from careful tuning. The results provide practical insights for implementing efficient spam email detection systems in multilingual environments. Future research could explore additional classifiers, deep learning approaches, and contextual embeddings to further improve classification accuracy and robustness.
Co-Authors Adi Sucipto, Adi Ady Candra Nugroho Afifudin Afifudin Aftirah, Nadia Agung Riyantomo Ahmad Ari Aldino Akbar, Ahmad Taufiq Akbar, Bagus Muhammad Aldi Bagus Prasetyo Alita, Debby Alvi Suhartanto Andrey Ferriyan Andrey Ferriyan, Andrey Anjumi, Krisma Nur Annisa Annisa Ans, Faris Arkan Arfat, Muhammad Fadilah Arief Budiman Aris Munandar Bagas Aditama Bagus Miftaq Hurohman Berlintina Permatasari Budi Suyanto Dalimunthe, Ernando Rizki Damayanti, Damayanti Donaya Pasha Dyah Ayu Megawaty Eka Saputra Ellin Gusbriana Erliyan Redy Susanto Fahreza Aditya Aryatama Faris Arkans Ans Fernando, Yusra Firmansyah, Ilham Gusti Firmansyah Gustian Rama Putra Harry Gunawan Heni Sulistiani I Ketut Wahyu Gunawan Imas Sukaesih Sitanggang Indra Kurniawan Indra Kurniawan Irsan, Aqilla Hattami Irwan Tubagus Isnain, Auliya Rahman Iwan Syahputra johansyah johansyah Johansyah Johansyah Jupriyadi Jupriyadi Jupriyadi, Jupriyadi Kartini, Nuri Katamsyi, Kaifa Ahlal Koeswara, Wawan Leny Meilisa M Fabian Apriando Maria Ainun Nazar Mega Desi Diah Ayu Megawaty, Dyah Ayu Mohammad Tafrikan Muhammad Aldhi Septianto Muhammad Fadilah Arfat Muhammad Fauzan Ramadhani Muhammad Fitratullah Muhammad Hamdan Sobirin Muhaqiqin Muhaqiqin muhaqiqin Munawar, Alifah Chairul Nadia Aftirah Nadiya Safitri Neneng Neneng Ni’mawati, Akfina Oktora, Putri Suci Pasaribu, A. Ferico Octaviansyah Pasha, Donaya Prapcoyo, Hari Prasetyo, Aditya Dwi Pria Agung Laksono Priandika, Adhie Thyo Purwayoga, Vega Rafi Athallah Rahayu, Masnia Rahayu, Ririn Wuri Ramadhani, Muhammad Fauzan Renda Bimantara Rikendry Rikendry Rio Andika Rulyansyah Permata Putra S. Samsugi Saifullah, Shoffan Sakti, Hakim Erlangga Bernado Sampurna Dadi Riskiono Saputra, Alvin Saputra, Hendi Setiawansyah Setiawansyah Sitanggang, Imas S. Siti Yuliyanti, Siti Sobir Sobir Sokid, Sokid Styawati Styawati Styawati, S Styawati, Styawati Suhartanto, Alvi Susanto, Erliyan Redy susi susi Syahirul Alim Syahirul Alim Syaiful Ahdan Temi Ardiansah Tia Nanda Pratiwi Tiara Azizul Andika Tiyas Utami Tri Widodo Try Susanto Veithzal Rivai Zainal Wahyu Sardjono Wawan Koswara Wijaya, Suhenda Yeris Ari Sandi Yopita Anggela Yuli Fauziah Yuri Rahmanto Yusra Fernando Zaenal Abidin Zahra Kharisma Sangha Zahrina Amalia Zainabun Mardiyansyah