JOURNAL OF APPLIED INFORMATICS AND COMPUTING
Vol. 9 No. 5 (2025): October 2025

Comparison of Text Vectorization Methods for IMDB Movie Review Sentiment Analysis Using SVM

Mulyawan, Rifqi (Unknown)
Naparin, Husni (Unknown)
Fatihia, Wifda Muna (Unknown)



Article Info

Publish Date
06 Oct 2025

Abstract

Sentiment Analysis is a scientific study in the field of Machine Learning that focuses on classifying opinions expressed in text. IMDb is a platform widely used to provide information and share viewpoints among moviegoers worldwide, where audience reactions often serve as a benchmark for a movie’s success. This research aims to classify positive and negative sentiments by applying and evaluating the effectiveness of Support Vector Machine (SVM) with four different feature representation methods: (a) Bag of Words (BoW), (b) TF-IDF, (c) Word2Vec, and (d) Doc2Vec. After preprocessing the textual data, each method was employed to extract features for model training. The experimental results demonstrate that the combination of SVM with Word2Vec achieved the best overall performance with an F1-Score of 0.8607 and an Accuracy of 0.8607, while also being the fastest in training time (75.0s). In comparison, BoW reached an F1-Score of 0.8219, TF-IDF achieved 0.8520, and Doc2Vec obtained 0.8440. These findings highlight that Word2Vec provides the most effective feature representation for sentiment classification using SVM in this study.

Copyrights © 2025






Journal Info

Abbrev

JAIC

Publisher

Subject

Computer Science & IT

Description

Journal of Applied Informatics and Computing (JAIC) Volume 2, Nomor 1, Juli 2018. Berisi tulisan yang diangkat dari hasil penelitian di bidang Teknologi Informatika dan Komputer Terapan dengan e-ISSN: 2548-9828. Terdapat 3 artikel yang telah ditelaah secara substansial oleh tim editorial dan ...