Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Jurnal Ilmiah Kursor

Comparison of Feature Extraction in Support Vector Machine (SVM) Based Sentiment Analysis System Rozi, Imam Fahrur; Maulidia, Irma; Hani’ah, Mamluatul; Arianto, Rakhmat; Yunianto, Dika Rizky; Ananta, Ahmadi Yuli
Jurnal Ilmiah Kursor Vol. 13 No. 1 (2025)
Publisher : Universitas Trunojoyo Madura

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21107/kursor.v13i1.417

Abstract

Sentiment analysis plays a crucial role in natural language processing by identifying and categorizing opinions or emotions conveyed in textual data. It is widely applied across diverse fields such as product review analysis, social media monitoring, and market research. To enhance the accuracy and reliability of sentiment classification, various methods and feature extraction techniques have been explored. This study investigates the use of Support Vector Machine (SVM) for sentiment analysis, comparing three feature extraction techniques: Term Frequency-Inverse Document Frequency (TF-IDF), Bag of Words (BoW), and Word2Vec. Our findings indicate that SVM performs effectively with all three feature extraction methods, with TF-IDF yielding the highest accuracy at 0.79. Although the BoW method showed competitive results, it slightly trailed TF-IDF in k-fold validation. Word2Vec, however, exhibited the lowest performance, achieving a maximum accuracy of 0.69. A comparative analysis of accuracy, precision, recall, and F1-score highlight the superiority of TF-IDF in delivering consistent and accurate results. Further statistical analysis using ANOVA revealed no significant differences between the models across any of the evaluation metrics. Additionally, the evaluation was conducted under several scenarios, including tests on balanced and imbalanced datasets, varying dataset sizes, and different CCC parameter values for SVM. These scenarios provided deeper insights into the factors influencing the system's performance, reinforcing that TF-IDF combined with SVM remains the most effective approach in this study.
Co-Authors Adhisuwignjo, Supriatna Aflah, Darin Zahira Agus Zainal Arifin Agus Zainal Arifin Akbar, Syafaat Alif Akbar Fitrawan Alif Akbar Fitrawan, Alif Akbar Ananta, Ahmadi Yuli Annisa Puspa Kirana Annisa Puspa Kirana Ariadi Retno Tri Hayati Ririd Arie Rachmad Syulistyo Arwin Datumaya Wahyudi Sumari Aryo Harto Aryo Harto Astrifidha Rahma Amalia Aziza, Nadia Layra Budi Harijanto, Budi Budiprasetyo, Gunawan Cahya Rahmad Candra Bella Vista Chastine Fatichah Christian Sri Kusuma Aditya Christian Sri Kusuma Aditya Christian Sri kusuma Aditya, Christian Sri kusuma Darin Zahira Aflah Deasy Sandhya E.I. Diana Purwitasari Diana Purwitasari Diana Purwitasari Dika Rizky Yunianto Dikky Rahmad Shafara Dwi Puspitasari Gunawan Budiprasetyo Hayati, Ariadi Retno Iftitah Hidayati Ika Kusumaning Putri Ika Kusumaning Putri Ilham Sinatrio Gumelar Imam Fahrur Rozi Irfan Thalib Alfarid Irsyad Arif Mashudi Komalasari, Nita Kusumaning, Ika Luqman Affandi Luqman Affandi M. Hasyim Ratsanjani Maulidia, Irma Moch Zawaruddin Abdullah Mochammad Hairullah Nita Komalasari Noprianto Noprianto Nurfaidah Nurfaidah Nurfaidah Nurfaidah Pratama, Muhammad Irgy Rahmad, Cahya Rahman, Muhammad Arif Rakhmat Arianto Rokhimatul Wakhidah Septianda Reza Maulana Shoumi, Milyun Ni’ma Sofyan Noor Arief Supriatna Adhisuwignjo Syafaat Akbar Triana Fatmawati Umi Laili Yuhana Vipkas Al Hadid Firdaus Vivi Nur Wijayaningrum Wilda Imama Sabilla Yan Watequlis Syaifudin Yogi Kurniaawan Yogi Kurniaawan, Yogi Yogi Kurniawan Yogi Kurniawan Yoppy Yunhasnawa