p-Index From 2021 - 2026
5.626
P-Index
This Author published in this journals
All Journal International Journal of Electrical and Computer Engineering Jurnal Sistem Komputer Bulletin of Electrical Engineering and Informatics Jurnal Informatika Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI) Bulletin of Electrical Engineering and Informatics Telematika : Jurnal Informatika dan Teknologi Informasi Sinergi Jurnal Teknologi Informasi dan Ilmu Komputer JUITA : Jurnal Informatika International Journal of Advances in Intelligent Informatics Seminar Nasional Informatika (SEMNASIF) Register: Jurnal Ilmiah Teknologi Sistem Informasi JURNAL NASIONAL TEKNIK ELEKTRO Bulletin of Electrical Engineering and Informatics Jurnal Teknologi dan Sistem Komputer Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) JIKO (Jurnal Informatika dan Komputer) Jurnal Sisfokom (Sistem Informasi dan Komputer) ILKOM Jurnal Ilmiah Compiler MATRIK : Jurnal Manajemen, Teknik Informatika, dan Rekayasa Komputer Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI) GERVASI: Jurnal Pengabdian kepada Masyarakat Systemic: Information System and Informatics Journal Journal of Information Systems and Informatics Buletin Ilmiah Sarjana Teknik Elektro International Journal of Engineering, Technology and Natural Sciences (IJETS) Indonesian Journal of Electrical Engineering and Computer Science International Journal of Advances in Data and Information Systems Journal of Innovation Information Technology and Application (JINITA) Science in Information Technology Letters Paradigma Masyarakat Berkarya: Jurnal Pengabdian dan Perubahan Sosial JuTISI (Jurnal Teknik Informatika dan Sistem Informasi)
Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Paradigma

Comparative Analysis of Email Spam Detection Using SVM with TF-IDF and Word2Vec on Multilingual Datasets Katamsyi, Kaifa Ahlal; Akbar, Ahmad Taufiq; Nurkholis, Andi; Prapcoyo, Hari; Akbar, Bagus Muhammad; Saifullah, Shoffan
Paradigma - Jurnal Komputer dan Informatika Vol. 28 No. 1 (2026): March 2026 Period
Publisher : LPPM Universitas Bina Sarana Informatika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31294/p.v28i1.12339

Abstract

The rapid growth of email communication has increased the prevalence of spam emails, which can disrupt productivity and compromise information security. This study presents a comparative analysis of two text representation methods—TF-IDF and Word2Vec—for spam email classification using a Support Vector Machine (SVM) with a Radial Basis Function kernel. The experiments utilized Indonesian and English email datasets totaling 5,421 emails, split into 75% training and 25% testing sets. Two scenarios were evaluated: baseline with default parameters and after hyperparameter optimization using Grid Search combined with K-Fold Cross Validation. The results indicate that TF-IDF consistently outperformed Word2Vec across both languages, achieving the highest accuracy of 0.9562 on the English dataset after tuning. Word2Vec showed substantial improvement following parameter adjustment, reducing the performance gap with TF-IDF. The findings highlight the importance of hyperparameter optimization for enhancing the quality of feature representations and improving classification performance. This study also demonstrates that TF-IDF provides more stable results across different linguistic contexts, while Word2Vec benefits significantly from careful tuning. The results provide practical insights for implementing efficient spam email detection systems in multilingual environments. Future research could explore additional classifiers, deep learning approaches, and contextual embeddings to further improve classification accuracy and robustness.
Co-Authors Abdul Fadlil Adityo Nugroho, Adityo Afiqa, Nurul Agus Sasmito Aribowo Ahmad Taufiq Akbar Ahmad Tri Hidayat Aji Prasetya Wibawa Akbar, Ahmad Taufiq Akbar, Bagus Muhammad Alek Setiyo Nugroho Alfiani, Oktavia Dewi Alin Khaliduzzaman Alin Khaliduzzaman Alisya Amalia Putri Hasanah Andi Muhammad Dirham Dewantara Andi Nurkholis Andiko Putro Suryotomo Andri Pranolo Anton Satria Prabuwono Anton Satria Prabuwono Anton Yudhana Arianti, Berliana Andra Arief Hermawan Awang Hendrianto Pratomo Azlan, Faris Farhan Azrul Mahfurdz Bambang Yuwono Betty Yel, Mesra Budi Santosa Devia, Elmi Dharmawan, Tio Dreżewski, RafaÅ‚ Drezewski, Rafal Drezewski, Rafał Dwi Wahyuningrum Dwiyanto, Felix Andika Faqihuddin Al-anshori Ghazali, Ahmad Badaruddin Haekal, Haekal Herlina Jayadianti Heru Cahya Rustamaji Hidayat, Ahmad Tri Humairoh, Nanda Lailatul Ismail, Amelia Ritahani Isna Nur Aini Ivana Puspita Sari Japkowicz, Nathalie Judanti Cahyaning Junaidi Junaidi Kaswijanti, Wilis Katamsyi, Kaifa Ahlal Khaliduzzaman, Alin Kusuma, M. Apriandi Lean Karlo Tolentino Luh Putu Ratna Sundari Mubarak, Zulfikar Yusya Muhammad Nur Hendra Alvianto Nathalie Japkowicz Nisa, Syed Qamrun Noormaizan, Khairul Akmal Nur Heri Cahyana Nuril Anwar, Nuril Nuryana, Zalik Opi Irawansah, Opi Prapcoyo, Hari Putra, Agung Bella Utama Putra, Seno Aji Rabbimov Ilyos Rabbimov, Ilyos Rafal Drezewski Rafal Drezewski Rafal Drezewski Rochmat Husaini Rochmat Husaini Rustamadji, Heru Saidah, Andi Santosa, Budi Satya Ghifari Adipratama Seno Aji Putra Suhirman SUHIRMAN SUHIRMAN Sularso Sularso, Sularso Sunardi - Sunardi - Sunardi Sunardi Sunardi, Sunardi Taufiq Akbar, Ahmad Tri Andi, Tri Tundo, Tundo Tuti Purwaningsih, Tuti Utomo, Agung Tri Wahyu Adjie Saputra Wilis Kaswidjanti Wilis Kaswidjanti Wilis Kaswijanti Yuhefizar Yuhefizar Yuli Fauziah Yuli Fauziyah