JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika)
Vol 10, No 2 (2025)

Effectiveness of Word2Vec and TF-IDF in Sentiment Classification on Online Investment Platforms Using Support Vector Machine

Rifaldy, Fadil (Unknown)
Sibaroni, Yuliant (Unknown)
Prasetiyowati, Sri Suryani (Unknown)



Article Info

Publish Date
05 Mar 2025

Abstract

Investing in Indonesia is increasingly popular, especially among the millennial generation. investments such as deposits, gold, stocks, and online investment applications are increasingly in demand. This research focuses on the sentiment classification of user reviews of the Nanovest online investment application on the Google Play Store using the Support Vector Machine (SVM) method. SVM is used because it can classify opinions into positive and negative sentiment classes with good accuracy, by evaluating how effective Word2Vec features extraction that can convert words in a text into numerical vectors and TF-IDF that is capable of high-dimensional word weighting and TF-IDF Weighted Word2Vec combination features to produce richer vector representations. Tests were conducted using four SVM kernels namely Linear, Polynomial, RBF, and Sigmoid. The results show that Word2Vec with RBF kernel and 300 vector size produces the highest accuracy of 95.46%, the combination of TF-IDF Weighted Word2Vec also gives good performance with 95.29% accuracy on RBF kernel. However, TF-IDF alone resulted in the lowest accuracy of 93.31% on the Sigmoid kernel. This research shows that Word2Vec and combined feature extraction methods are effective in improving sentiment classification performance compared to TF-IDF.

Copyrights © 2025






Journal Info

Abbrev

Publisher

Subject

Computer Science & IT Education

Description

JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika) e-ISSN: 2540 - 8984 was made to accommodate the results of scientific work in the form of research or papers are made in the form of journals, particularly the field of Information Technology. JIPI is a journal that is managed by the ...