Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Vol 8 No 6 (2024): December 2024

Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine

Yanuargi, Bayu (Unknown)
Ema Utami (Unknown)
Kusrini (Unknown)
Parikesit, Arli Aditya (Unknown)



Article Info

Publish Date
28 Dec 2024

Abstract

Visitor reviews play a crucial role in determining the success of a business, particularly those offering hospitality and services, such as hotels. The growth of internet technology has made it easier for guests to share their experiences, which can influence potential customers. Google Maps is one of the platforms used for giving and searching reviews This research uses data crawled from Google Maps Review using the playwright library. However, the large volume of reviews can make analysis and topic-based categorization—such as service quality, hotel location, and operational hours—challenging. To address this, DBSCAN is used to cluster reviews based on these topics. Clustering helps improve sentiment classification, making it more targeted and allowing a comparison of two machine learning algorithms: Naïve Bayes and Support Vector Machine (SVM). Naïve Bayes achieved higher accuracy (0.87) in the operational hours cluster, while SVM scored 0.78. However, SVM showed improved accuracy in the location (0.89) and service (0.88) clusters, with Naïve Bayes maintaining a stable 0.86 accuracy in both. Both models demonstrated an average training time of less than one second, excluding preprocessing.

Copyrights © 2024






Journal Info

Abbrev

RESTI

Publisher

Subject

Computer Science & IT Engineering

Description

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) dimaksudkan sebagai media kajian ilmiah hasil penelitian, pemikiran dan kajian analisis-kritis mengenai penelitian Rekayasa Sistem, Teknik Informatika/Teknologi Informasi, Manajemen Informatika dan Sistem Informasi. Sebagai bagian dari semangat ...