Claim Missing Document
Check
Articles

Evaluasi Efektivitas Support Vector Machine dan Random Forest dalam Klasifikasi Ulasan Pengguna Aplikasi Streaming Vidio Fastabiqul Khusna; Khothibul Umam; Siti Nur'aini; Maya Rini Handayani
Jurnal Sistem Informasi Vol. 12 No. 2 (2025)
Publisher : Universitas Serang Raya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30656/jsii.v12i2.10495

Abstract

Perkembangan pesat platform streaming telah menghasilkan banyak ulasan pengguna yang dapat dimanfaatkan sebagai sumber masukan untuk pengembangan aplikasi. Penelitian ini dilakukan untuk mengevaluasi kinerja algoritma Support Vector Machine (SVM) dan Random Forest (RF) dalam mengklasifikasikan sentimen ulasan pengguna terhadap aplikasi Vidio. Sebanyak 1.000 ulasan berbahasa Indonesia dikumpulkan menggunakan teknik web scraping dan diberi label sentimen berdasarkan rating bintang, di mana rating 1–2 dikategorikan sebagai sentimen negatif dan 3–5 sebagai sentimen positif. Data ulasan diproses melalui beberapa tahap preprocessing, seperti pembersihan teks, tokenisasi, penghapusan stopword, dan stemming, sebelum dikonversi menjadi representasi numerik menggunakan metode TF-IDF. Dataset dibagi menjadi 80% data latih dan 20% data uji. Kedua model dilatih dan dievaluasi menggunakan metrik accuracy, precision, recall, dan F1-score. Hasil penelitian menunjukkan bahwa performa yang lebih unggul diperoleh oleh algoritma SVM, dengan akurasi mencapai 76,11%, dibandingkan dengan RF yang memperoleh akurasi sebesar 71,67%. Selain itu, identifikasi ulasan dengan sentimen negatif juga dilakukan dengan lebih efektif oleh SVM. Temuan ini membuktikan bahwa klasifikasi sentimen ulasan aplikasi Vidio lebih tepat dilakukan menggunakan SVM, sehingga berpotensi mendukung otomatisasi analisis sentimen dan peningkatan kualitas layanan streaming. Hasil ini dapat diimplementasikan dalam sistem dashboard otomatis untuk mendeteksi keluhan pengguna secara real-time, memungkinkan pengembang Vidio meningkatkan pengalaman pengguna dengan respons yang lebih cepat dan tepat. Kata Kunci: Text Classification, User Sentiment, Support Vector Machine, Random Forest, Vidio.
EVALUASI HYPERPARAMTER TUNING PADA SUPPORT VECTOR MACHINE (SVM) DALAM KLASIFIKASI ULASAN HOTEL DI TRIPADVISOR Dewi, Fiashintha; Wibowo, Nur Cahyo Hendro; Handayani, Maya Rini; Umam, Khothibul
JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika) Vol 10, No 3 (2025)
Publisher : STKIP PGRI Tulungagung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29100/jipi.v10i3.7774

Abstract

Dengan adanya perkembangan teknologi para wisatawan sangat dimudahkan dalam mengakses informasi mengenai pemesanan kamar hotel. Dengan adanya hal tersebut, maka ulasan dari pengguna lain sangatlah penting untuk menemukan tempat yang mereka inginkan. Studi ini membahas tentang analisa ulasan para wisatawan mengenai hotel pada Tripadvisor. Tripadvisor adalah salah satu platform pan-duan wisata terbesar di dunia, yang menawarkan wisatawan untuk merencakan serta memperoleh perjalanan memuaskan. Data diambil melalui website Hugging Face yang kemudian dilanjutkan dengan proses pre-processing data. Dataset yang digunakan berjumlah 20.491 ulasan, terdiri dari 15.093 ulasan positif dan 5.938 ulasan negatif. Tujuan dari penelitian ini untuk mengevaluasi performa model SVM dalam melakukan klasifikasi sentimen pada ulasan hotel di Tripadvi-sor. Untuk mengoptimalkan performa model, dilakukan hyperparame-ter tuning menggunakan metode GridSearchCV. Hasil menunjukkan bahwa model default SVM memiliki akurasi 91%, namun recall pada kelas negatif masih rendah (0,75). Setelah tuning, akurasi sedikit menurun menjadi 90%, tetapi recall kelas negatif meningkat menjadi 0,77. Model terbaik diperoleh pada kombinasi parameter C = 10, gamma = 0,01, dan kernel = linear, dengan precision 0,92, recall 0,94, dan f1-score 0,80. Tuning terbukti meningkatkan keseimbangan klas-ifikasi antar kelas dan sensitivitas terhadap ulasan negatif. Hasil ini menegaskan pentingnya hyperparameter tuning dalam mengoptimal-kan performa dan generalisasi model SVM pada analisis sentimen dengan data yang tidak seimbang.
Identification of Buzzers in Skincare Reviews Using a Lexicon-Based Sentiment Analysis Method Pramesti, Arfiana Diah; Umam, Khothibul; Handayani, Maya Rini
Journal of Applied Informatics and Computing Vol. 9 No. 5 (2025): October 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i5.11005

Abstract

Along with the rapid development of digital technology, social media has become the main platform for consumers to share experiences about products, including skincare products. However, it is not uncommon for reviews provided by users to not reflect authentic experiences, but rather reviews created by certain parties, or buzzers, to manipulate public perception. The presence of buzzers in skincare reviews is important to consider, as they can affect consumer trust and influence purchasing decisions. This study aims to identify the presence of buzzers in skincare product reviews using a lexicon dictionary-based sentiment analysis. Of the 529 comments analyzed, 75 comments showed negative sentiment and 454 comments showed positive sentiment. The classification results revealed that 85.8% of the comments belonged to the non-buzzer category, while 14.2% were indicated as buzzers. Evaluation of the classification model showed high accuracy, reaching 93%, but performance in detecting buzzers was limited, with a recall metric of only 0.50. This shows that while the model managed to classify non-buzzer comments well, there are still difficulties in identifying buzzer comments, mostly due to data imbalance. This research emphasizes the importance of a proper analytical approach in detecting inauthentic reviews to ensure the information consumers receive remains accurate, transparent, and accountable.
Mapping the Polarity of Tourist Opinions on Indonesian Destinations through Google Maps Reviews Using Supervised Learning Methods Sa’adah, Siti Miftahus; Umam, Khothibul; Handayani, Maya Rini; Mustofa, Mokhammad Iklil
Journal of Applied Informatics and Computing Vol. 9 No. 5 (2025): October 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i5.9836

Abstract

The advancement of information technology has transformed how individuals seek information and plan their travels, notably through online reviews of tourist attractions on platforms like Google Maps. However, these reviews do not always align with visitors' expectations, necessitating further analysis to comprehend the underlying sentiments. The objective of this research is to inspect the performance of multiple machine learning algorithms in executing sentiment analysis on user generated reviews related to tourist attractions in Indonesia. The algorithms examined include Multinomial Naïve Bayes, Random Forest Classifier, Logistic Regression, Support Vector Machine, K-Nearest Neighbors, and Extra Trees Classifier. The research process encompasses data collection and labeling, data preprocessing, exploratory data analysis (EDA), Word Cloud visualization, feature extraction, classification implementation, and performance evaluation. Experimental results indicate that the K-Nearest Neighbors (KNN) algorithm attain the most accuracy and F1-score of 97%, indicating its effectiveness in categorizing text-based sentiment reviews sourced from the Google Maps platform.
Eksploitasi Anak sebagai Dampak Kemiskinan dalam Novel Kita Pergi Hari Ini Karya Ziggy Zezsyazeoviennazabrizkie (Kajian Sosiologi Sastra) Limparu, Stella Lasverina; Umam, Khothibul; Fadilah, Yuniardi
Wicara: Jurnal Sastra, Bahasa, dan Budaya Vol 4, No 2: Oktober 2025
Publisher : Program Studi Sastra Indonesia, Fakultas Ilmu Budaya, Universitas Diponegoro

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.14710/wjsbb.2025.27427

Abstract

This study aims to describe the structure and explain the issue of child exploitation as a consequence of poverty in the novel Kita Pergi Hari Ini (KPHI) using a literary sociology approach, along with the theories of fiction structure and child exploitation. Data were collected through literature study and note-taking techniques, and analyzed using data reduction, data presentation, and verification. The findings reveal that the main theme of the novel is the struggle of children to obtain a better life amidst poverty and exploitation. The main characters consist of five children and a recruiter named Nona Gigi, with a progressive plot driving the narrative. Poverty is portrayed through economic hardship and limited access to employment due to a system dominated by certain groups, leading parents to relinquish their responsibilities and hand over their children to exploitative parties. The forms of exploitation depicted in the novel include emotional, verbal, and physical abuse, with emotional abuse being the most dominant. The novel offers social criticism of a system that fails to protect children and emphasizes the importance of child protection and poverty alleviation to prevent further exploitation.Keywords: Kita Pergi Hari Ini, Child Exploitation, Poverty
Identifikasi Polaritas Sikap Pengguna Aplikasi X terhadap Coretax di Indonesia Menggunakan Algoritma Naïve Bayes Prasilda, Dina Rahma; Yuniarti, Wenty Dwi; Handayani, Maya Rini; Umam, Khothibul
JURNAL RISET KOMPUTER (JURIKOM) Vol. 12 No. 3 (2025): Juni 2025
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/jurikom.v12i3.8548

Abstract

The Core Tax Administration System (Coretax) was launched by the Directorate General of Taxes (DGT) in January 2025 as a technology-based integrated tax system. While its initial goal was to improve tax efficiency and compliance, Coretax faced technical challenges, including system errors, slow processing speed, and criticism from the public. The main platform used to address these challenges is the X app (formerly known as Twitter). This research aims to understand the public's views and responses to Coretax's services by analyzing user sentiment patterns seen on social media. The research identifies the polarity of user attitudes by utilizing natural language processing (NLP) and Naïve Bayes algorithms, applied to a dataset of 1,628 tweets collected between January and March 2025. The analyzed data reflects a wide range of public reactions that include both positive and negative opinions towards the Coretax implementation, both in terms of functionality and ease of use. The results show that the model has an accuracy rate of 93.07%, a precision value of 95%, a recall value of 96%, and an F1-Score value of 96%. The results of this study are expected to be able to provide precise mapping related to changes in public opinion towards Coretax, so that it can be a valuable source of information for application developers, policy makers in the field of taxation, and analysis in the technology sector in responding to the needs and expectations of society in the digital era.
Deteksi Dark patterns Biaya Layanan E-commerce Berdasarkan Perspektif Konsumen Menggunakan Algoritma Support Vector Machine Salmalina, Divana Taricha; Umam, Khothibul; Handayani, Maya Rini
Jurnal Sistem Komputer dan Informatika (JSON) Vol. 6 No. 4 (2025): Juni 2025
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/json.v6i4.8563

Abstract

Perkembangan industri e-commerce di Indonesia belakangan ini dibayangkan pada fenomena meningkatnya keluhan konsumen terkait kebijakan biaya layanan yang dinilai kurang transparan, termasuk indikasi adanya praktik pola gelap . Penelitian ini bertujuan mengkaji persepsi konsumen terhadap isu tersebut melalui pendekatan analisis sentimen berbasis machine learning dan deteksi pola manipulatif. Data penelitian diperoleh dari ulasan pengguna di platform media sosial X yang kemudian diproses melalui serangkaian tahapan text mining meliputi pembersihan data, tokenisasi, stopword removal , dan stemming . Analisis sentimen menggunakan algoritma Support Vector Machine (SVM) menunjukkan hasil yang signifikan, dimana 55-78% ulasan di platform ketiga e-commerce (Shopee, Tokopedia, Lazada) tergolong negatif. Analisis TF-IDF mengidentifikasi kata kunci seperti "biaya", "layan" (layanan), dan "mahal" sebagai istilah paling dominan dalam ulasan negatif. Model SVM menunjukkan kinerja yang cukup baik dengan akurasi mencapai 87% dalam mengklasifikasikan sentimen negatif. Lebih lanjut, analisis tematik terhadap ulasan negatif berhasil mengidentifikasi indikasi pola gelap , khususnya dalam kategori biaya tersembunyi (biaya tersembunyi) dan menyelinap ke keranjang (penambahan produk tanpa disadari) yang muncul secara konsisten di semua platform. Temuan ini tidak hanya menegaskan adanya pola manipulatif yang berulang dalam industri e-commerce Indonesia, tetapi juga menegaskan urgensi bagi para pelaku industri untuk meningkatkan transparansi dalam kebijakan biaya. Secara praktis, hasil penelitian ini dapat menjadi bahan pertimbangan penting bagi regulator dalam merumuskan kebijakan perlindungan konsumen di era digital yang lebih komprehensif.
Application of SVM and Naive Bayes with PSO for the Classification of Saloka Amusement Park Reviews Putri, Indira Alifia; Umam, Khothibul; Handayani, Maya Rini; Mustofa, Hery
Journal La Multiapp Vol. 6 No. 6 (2025): Journal La Multiapp
Publisher : Newinera Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.37899/journallamultiapp.v6i6.2505

Abstract

Visitor opinions on tourist destinations can be evaluated through sentiment analysis based on textual reviews. This study aimed to compare the performance of Support Vector Machine (SVM) and Naive Bayes (NB) algorithms in classifying visitor sentiments toward reviews of Saloka Theme Park, while also assessing the impact of parameter optimization using Particle Swarm Optimization (PSO). A total of 740 reviews were collected from the Traveloka platform and underwent text preprocessing. The optimization process targeted key parameters of each algorithm to improve the F1-score. Experimental results showed that the unoptimized SVM achieved an accuracy of 89 percent, while NB reached 86 percent. After applying PSO, SVM's accuracy dropped to 84 percent, whereas NB improved to 85 percent with more balanced classification across sentiment classes. These results recommend the integration of Naive Bayes with Particle Swarm Optimization as a potential approach for sentiment classification of tourism reviews, particularly in the case study of Saloka Theme Park.
Perbandingan Model SpaCy dan BERT untuk Persebaran Penggemar di Platform X (Twitter) Rahmadani, Nurul; Umam, Khothibul; Dwi Yuniarti, Wenty; Rini Handayani, Maya
Jurnal Algoritma Vol 22 No 2 (2025): Jurnal Algoritma
Publisher : Institut Teknologi Garut

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33364/algoritma/v.22-2.2310

Abstract

This study was conducted to compare the performance of the SpaCy Named Entity Recognition (NER) model and the Bidirectional Encoder Representation from Transformers (BERT) model in identifying the distribution of Bernadya fans based on the mention of Geo-Political Entity (GPE) locations. The dataset used was collected from X users' tweets using a scraping method with Python and will be analyzed on both NER models. The SpaCy NER model will be built from scratch with manual annotation, while the BERT model will be built using the transforms approach. From the evaluation results, the SpaCy model achieved a precision of 1.00, a recall of 0.92, and an F1-score of 0.96 on the training data, as well as a recall of 0.98 and an F1-score of 0.99 on the test data. The BERT model recorded a precision of 1.00, a recall of 0.95 (training), and 1.00 (testing), with an F1-score of 0.98 and 1.00. The Spacy model can recognize more than two entities well in one test sentence. However, when tested with the entire dataset, it cannot consistently recognize GPE entities. Conversely, the BERT model is better at recognizing GPE entities, with 4 GPE entities identified, including: Karanganyar, Indonesia, Mongolia, and Bandung as regions capable of identifying GPE entities with the most mentions. Therefore, in this study, the BERT model is better at recognizing GPE entities from the dataset used.
Comparative Study of SVM and Decision Tree Algorithms on the Effect of SMOTE Technique on LinkAja Application Faruq, Muhammad Kholfan; Umam, Khothibul; Mustofa, Mokhamad Iklil; Mahfudh, Adzhal Arwani
Journal of Applied Informatics and Computing Vol. 9 No. 6 (2025): December 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i6.9806

Abstract

The widespread adoption of digital wallets like LinkAja in Indonesia has led to a surge in user-generated reviews, which are valuable for assessing service quality. This study compares the classification performance of Support Vector Machine (SVM) and Decision Tree algorithms on user reviews from the LinkAja application. 7.000 reviews were gathered through web scraping and processed with standard text cleaning, tokenization, stopword removal, and stemming, resulting in 6,261 usable entries. These were divided into training and testing sets in a 70:30 ratio. The performance of each algorithm was evaluated both before and after the application of Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance. Prior to SMOTE, SVM recorded an accuracy of 77.97%, precision of 0.74, recall of 0.33, and F1 score of 0.45, while Decision Tree reached 72.01% accuracy, 0.50 precision, 0.62 recall, and 0.55 F1 score. After SMOTE, SVM accuracy slightly improved to 78.29%, with notable increases in recall (0.74) and F1 score (0.60); Decision Tree also saw an accuracy rise to 74.56% but experienced a slight decline in F1 score to 0.52. These findings demonstrate that SVM, particularly when used with SMOTE, offers better overall performance and class balance in classifying reviews with imbalanced sentiment distribution, making it more suitable than Decision Tree for this application.