Claim Missing Document
Check
Articles

Mapping the Polarity of Tourist Opinions on Indonesian Destinations through Google Maps Reviews Using Supervised Learning Methods Sa’adah, Siti Miftahus; Umam, Khothibul; Handayani, Maya Rini; Mustofa, Mokhammad Iklil
Journal of Applied Informatics and Computing Vol. 9 No. 5 (2025): October 2025
Publisher : Politeknik Negeri Batam

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30871/jaic.v9i5.9836

Abstract

The advancement of information technology has transformed how individuals seek information and plan their travels, notably through online reviews of tourist attractions on platforms like Google Maps. However, these reviews do not always align with visitors' expectations, necessitating further analysis to comprehend the underlying sentiments. The objective of this research is to inspect the performance of multiple machine learning algorithms in executing sentiment analysis on user generated reviews related to tourist attractions in Indonesia. The algorithms examined include Multinomial Naïve Bayes, Random Forest Classifier, Logistic Regression, Support Vector Machine, K-Nearest Neighbors, and Extra Trees Classifier. The research process encompasses data collection and labeling, data preprocessing, exploratory data analysis (EDA), Word Cloud visualization, feature extraction, classification implementation, and performance evaluation. Experimental results indicate that the K-Nearest Neighbors (KNN) algorithm attain the most accuracy and F1-score of 97%, indicating its effectiveness in categorizing text-based sentiment reviews sourced from the Google Maps platform.
THE PERCEPTIONS OF SEMARANG FIVE STAR HOTEL TOURISTS WITH SUPPORT VECTOR MACHINE ON GOOGLE REVIEWS Aufan, Muhammad Haikal; Handayani, Maya Rini; Nurjanna, Afifah Basmah; Wibowo, Nur Cahyo Hendro; Umam, Khotibul
Jurnal Teknik Informatika (Jutif) Vol. 5 No. 5 (2024): JUTIF Volume 5, Number 5, Oktober 2024
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2024.5.5.2025

Abstract

Travelers on the road sometimes need a hotel to rest. In choosing a hotel, they refer to the ratings or reviews written by users through reviews on Google. This is because not all star hotels provide facilities in accordance with user assessments. This study discusses the analysis of the opinions of tourists who have stayed in 5-star hotels in Semarang through a review of commentary data on Google. The 5-star hotels used as the research are Padma, Gumaya, Tentrem, Grand Candi, Ciputra, and PO. The dataset of the six hotels was obtained through a scraping process then followed by data pre-processing. The data was retrieved from Google Maps using the Chrome Instant Data Scrapper extension. Data preprocessing begins with case folding, tokenizing, filtering, and ends with stemming. Support Vector Machine (SVM) is implemented for sentimen classification process. The results from this study are the majority of 5-star hotel reviews in Semarang tend to have positive rather than negative sentimens. Our model was able to produce an accuracy of 0.87 to 0.98. The highest accuracy was achieved by Ciputra Hotel at 0.98 with 543 positive reviews.
Identifikasi Polaritas Sikap Pengguna Aplikasi X terhadap Coretax di Indonesia Menggunakan Algoritma Naïve Bayes Prasilda, Dina Rahma; Yuniarti, Wenty Dwi; Handayani, Maya Rini; Umam, Khothibul
JURNAL RISET KOMPUTER (JURIKOM) Vol. 12 No. 3 (2025): Juni 2025
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/jurikom.v12i3.8548

Abstract

The Core Tax Administration System (Coretax) was launched by the Directorate General of Taxes (DGT) in January 2025 as a technology-based integrated tax system. While its initial goal was to improve tax efficiency and compliance, Coretax faced technical challenges, including system errors, slow processing speed, and criticism from the public. The main platform used to address these challenges is the X app (formerly known as Twitter). This research aims to understand the public's views and responses to Coretax's services by analyzing user sentiment patterns seen on social media. The research identifies the polarity of user attitudes by utilizing natural language processing (NLP) and Naïve Bayes algorithms, applied to a dataset of 1,628 tweets collected between January and March 2025. The analyzed data reflects a wide range of public reactions that include both positive and negative opinions towards the Coretax implementation, both in terms of functionality and ease of use. The results show that the model has an accuracy rate of 93.07%, a precision value of 95%, a recall value of 96%, and an F1-Score value of 96%. The results of this study are expected to be able to provide precise mapping related to changes in public opinion towards Coretax, so that it can be a valuable source of information for application developers, policy makers in the field of taxation, and analysis in the technology sector in responding to the needs and expectations of society in the digital era.
Deteksi Dark patterns Biaya Layanan E-commerce Berdasarkan Perspektif Konsumen Menggunakan Algoritma Support Vector Machine Salmalina, Divana Taricha; Umam, Khothibul; Handayani, Maya Rini
Jurnal Sistem Komputer dan Informatika (JSON) Vol. 6 No. 4 (2025): Juni 2025
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/json.v6i4.8563

Abstract

Perkembangan industri e-commerce di Indonesia belakangan ini dibayangkan pada fenomena meningkatnya keluhan konsumen terkait kebijakan biaya layanan yang dinilai kurang transparan, termasuk indikasi adanya praktik pola gelap . Penelitian ini bertujuan mengkaji persepsi konsumen terhadap isu tersebut melalui pendekatan analisis sentimen berbasis machine learning dan deteksi pola manipulatif. Data penelitian diperoleh dari ulasan pengguna di platform media sosial X yang kemudian diproses melalui serangkaian tahapan text mining meliputi pembersihan data, tokenisasi, stopword removal , dan stemming . Analisis sentimen menggunakan algoritma Support Vector Machine (SVM) menunjukkan hasil yang signifikan, dimana 55-78% ulasan di platform ketiga e-commerce (Shopee, Tokopedia, Lazada) tergolong negatif. Analisis TF-IDF mengidentifikasi kata kunci seperti "biaya", "layan" (layanan), dan "mahal" sebagai istilah paling dominan dalam ulasan negatif. Model SVM menunjukkan kinerja yang cukup baik dengan akurasi mencapai 87% dalam mengklasifikasikan sentimen negatif. Lebih lanjut, analisis tematik terhadap ulasan negatif berhasil mengidentifikasi indikasi pola gelap , khususnya dalam kategori biaya tersembunyi (biaya tersembunyi) dan menyelinap ke keranjang (penambahan produk tanpa disadari) yang muncul secara konsisten di semua platform. Temuan ini tidak hanya menegaskan adanya pola manipulatif yang berulang dalam industri e-commerce Indonesia, tetapi juga menegaskan urgensi bagi para pelaku industri untuk meningkatkan transparansi dalam kebijakan biaya. Secara praktis, hasil penelitian ini dapat menjadi bahan pertimbangan penting bagi regulator dalam merumuskan kebijakan perlindungan konsumen di era digital yang lebih komprehensif.
Klasifikasi Sentimen Masyarakat Terhadap Aplikasi Tiktok Menggunakan Algoritma Naive Bayes Nahdhudin, Muhammad; Wibowo, Nur Cahyo Hendro; Handayani, Maya Rini; Umam, Khotibul
Jurnal Teknologi informasi dan Ilmu Komputer Vol. 1 No. 3 (2025): Juli 2025
Publisher : Nolsatu Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.65258/jutekom.v1.i3.13

Abstract

Perkembangan pesat media telah menciptakan ruang digital baru bagi masyarakat untuk mengekspresikan opini, salah satunya melalui aplikasi TikTok. Aplikasi ini menjadi fenomena global dengan jutaan pengguna aktif, namun juga menuai pro dan kontra. Penelitian ini bertujuan untuk mengklasifikasikan sentimen masyarakat terhadap TikTok menggunakan algoritma Naive Bayes. Data yang digunakan  berupa 100.000 ulasan pengguna TikTok dari Google Play Store yang dikumpulkan melalui web scraping. Proses penelitian mencakup tahapan pra-pemrosesan teks, pelabelan sentimen berdasarkan rating, ekstraksi fitur menggunakan TF-IDF, dan klasifikasi menggunakan algoritma Multinominal Naive Bayes. Hasil evaluasi menunjukkan bahwa model memiliki akurasi sebesar 88,2% precision 89,1%, recall 87,4% dan F1-score 88,2%. Mayoritas ulasan pengguna bersentimen positif. Hasil ini menunjukkan bahwa Naive Bayes efektif dalam mengklasifikasi opini berbasis teks dalalam bahasa Indonesia. Penelitian ini dapat dijadikan acuan dalam memahami persepsi publik serta penerapan maching learning dalam analisis opini masyarakat.
Application of SVM and Naive Bayes with PSO for the Classification of Saloka Amusement Park Reviews Putri, Indira Alifia; Umam, Khothibul; Handayani, Maya Rini; Mustofa, Hery
Journal La Multiapp Vol. 6 No. 6 (2025): Journal La Multiapp
Publisher : Newinera Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.37899/journallamultiapp.v6i6.2505

Abstract

Visitor opinions on tourist destinations can be evaluated through sentiment analysis based on textual reviews. This study aimed to compare the performance of Support Vector Machine (SVM) and Naive Bayes (NB) algorithms in classifying visitor sentiments toward reviews of Saloka Theme Park, while also assessing the impact of parameter optimization using Particle Swarm Optimization (PSO). A total of 740 reviews were collected from the Traveloka platform and underwent text preprocessing. The optimization process targeted key parameters of each algorithm to improve the F1-score. Experimental results showed that the unoptimized SVM achieved an accuracy of 89 percent, while NB reached 86 percent. After applying PSO, SVM's accuracy dropped to 84 percent, whereas NB improved to 85 percent with more balanced classification across sentiment classes. These results recommend the integration of Naive Bayes with Particle Swarm Optimization as a potential approach for sentiment classification of tourism reviews, particularly in the case study of Saloka Theme Park.
Perbandingan Model SpaCy dan BERT untuk Persebaran Penggemar di Platform X (Twitter) Rahmadani, Nurul; Umam, Khothibul; Dwi Yuniarti, Wenty; Rini Handayani, Maya
Jurnal Algoritma Vol 22 No 2 (2025): Jurnal Algoritma
Publisher : Institut Teknologi Garut

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33364/algoritma/v.22-2.2310

Abstract

This study was conducted to compare the performance of the SpaCy Named Entity Recognition (NER) model and the Bidirectional Encoder Representation from Transformers (BERT) model in identifying the distribution of Bernadya fans based on the mention of Geo-Political Entity (GPE) locations. The dataset used was collected from X users' tweets using a scraping method with Python and will be analyzed on both NER models. The SpaCy NER model will be built from scratch with manual annotation, while the BERT model will be built using the transforms approach. From the evaluation results, the SpaCy model achieved a precision of 1.00, a recall of 0.92, and an F1-score of 0.96 on the training data, as well as a recall of 0.98 and an F1-score of 0.99 on the test data. The BERT model recorded a precision of 1.00, a recall of 0.95 (training), and 1.00 (testing), with an F1-score of 0.98 and 1.00. The Spacy model can recognize more than two entities well in one test sentence. However, when tested with the entire dataset, it cannot consistently recognize GPE entities. Conversely, the BERT model is better at recognizing GPE entities, with 4 GPE entities identified, including: Karanganyar, Indonesia, Mongolia, and Bandung as regions capable of identifying GPE entities with the most mentions. Therefore, in this study, the BERT model is better at recognizing GPE entities from the dataset used.
E EKSPLORASI PENDAPAT PENGGUNA APLIKASI DEEPSEEK DI PLAY STORE DENGAN NATURAL LANGUAGE PROCESSING putri lathifah, shofi; Khotibul Umam; Maya Rini Handayani
JIFOSI Vol. 6 No. 2 (2025): Enhancing Information System Security and Governance in the Digital Transformat
Publisher : UPN "Veteran" Jawa Timur

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Ulasan yang ditulis oleh pengguna di Google Play Store dapat sangat membantu untuk mengetahui bagaimana masyarakat menerima aplikasi. Dalam penelitian ini, kami menggunakan metode pemrosesan bahasa natural (NLP) untuk melihat bagaimana pengguna melihat aplikasi DeepSeek. Metode scraping dari Play Store digunakan untuk mengumpulkan data, yang kemudian dibersihkan dan diproses melalui proses normalisasi teks, tokenisasi, penghapusan stopword, dan stemming. Selanjutnya, kami mengklasifikasikan ulasan menjadi dua kelompok besar, yaitu ulasan positif dan negatif, menggunakan algoritma Naive Bayes. Hasil analisis menunjukkan bahwa sebagian besar ulasan bersifat positif, dengan beberapa kritik yang terkait dengan bug atau kinerja aplikasi. Studi ini menunjukkan bagaimana teknik pemrosesan bahasa natural dapat digunakan untuk menggali wawasan secara otomatis dan efektif dari data ulasan.
Analisis Performa Metode Machine Learning dalam Mengidentifikasi Penyebab Ulasan Rating Satu Aplikasi MyBluebird Azziizah, Almira Farradinda; Mustofa, Hery; Umam, Khothibul; Handayani, Maya Rini
Jurnal Ilmiah Global Education Vol. 6 No. 4 (2025): JURNAL ILMIAH GLOBAL EDUCATION
Publisher : LPPM Institut Pendidikan Nusantara Global

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.55681/jige.v6i4.4704

Abstract

This study addresses the increasing prevalence of negative user reviews for the MyBluebird ride-hailing application, focusing on the identification and classification of the main causes of one-star ratings. The research aims to compare the effectiveness of Support Vector Machine, Random Forest, and Naïve Bayes algorithms in classifying user complaints. Employing a quantitative experimental approach, the study utilizes a dataset of 1,399 one-star reviews collected purposively from Google Play Store. Data preprocessing includes cleaning, tokenization, and feature extraction using TF-IDF. The classification models are evaluated using accuracy, precision, recall, and F1-score metrics. Results indicate that Random Forest achieves the highest accuracy (90%), outperforming the other algorithms, with bugs/errors as the most frequent complaint, followed by driver performance, other issues, and price. The study concludes that machine learning-based classification can effectively map user dissatisfaction, though data imbalance remains a limitation. Future research should apply data balancing techniques and expand the dataset for broader generalization. Practical implications suggest that developers can utilize automated classification to improve service quality and address user needs more efficient.
Digital innovation in islamic guidance: Fuzzy mamdani-based stress level detection for muslim adolescents Amal, Muhammad Niltal; Handayani, Maya Rini
Jurnal Ilmu Dakwah Vol. 45 No. 2 (2025)
Publisher : Faculty of Dakwah and Communication, Walisongo State Islamic University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21580/jid.v45.2.28875

Abstract

Purpose - The purpose of this study is to address the urgent need for a reliable and accessible preliminary assessment tool for mental health issues among Indonesian adolescents, a demographic significantly impacted by recent societal changes and showing high rates of psychological distress (e.g., 34.9% experiencing mental problems per I-NAMHS 2022) Method - The methodology employed is the design and development of a web-based expert system utilizing Fuzzy Logic to diagnose stress symptoms. The system is structured with four core components (User Interface, Knowledge Base, Inference Mechanism, Working Memory) and incorporates Fuzzy Tsukamoto/Mamdani/Sugeno principles for mapping firm numerical symptom inputs to linguistic variables through Fuzzification and Defuzzification processes. Result  -  The result (anticipated/achieved) is a functional, dual-role web system (Admin/User) capable of providing objective, preliminary stress categorization, thereby serving as a first step toward early intervention. Implication – The implication of this work is twofold: clinically, it provides an immediate, non-expert-dependent tool for initial screening; ethically and religiously, it supports the collective obligation (fard kifayah) to safeguard youth mental well-being and maintain their fitrah. Originality/Value - The originality of this research lies in the specific integration of Fuzzy Logic reasoning within a web-based expert system tailored explicitly to the Indonesian adolescent context for stress awareness and preliminary diagnosis.*** Tujuan - Tujuan studi ini adalah untuk memenuhi kebutuhan mendesak akan alat penilaian awal yang andal dan mudah diakses untuk masalah kesehatan mental di kalangan remaja Indonesia, kelompok demografis yang sangat terdampak oleh perubahan sosial terkini dan menunjukkan tingkat gangguan psikologis yang tinggi (misalnya, 34,9% mengalami masalah mental menurut I-NAMHS 2022) Metode - Metode yang digunakan adalah desain dan pengembangan sistem pakar berbasis web yang memanfaatkan Logika Fuzzy untuk mendiagnosis gejala stres. Sistem ini terdiri dari empat komponen inti (Antarmuka Pengguna, Basis Pengetahuan, Mekanisme Inferensi, Memori Kerja) dan mengintegrasikan prinsip-prinsip Fuzzy Tsukamoto/Mamdani/Sugeno untuk memetakan masukan numerik gejala yang pasti ke variabel linguistik melalui proses Fuzzifikasi dan Defuzzifikasi. Hasil - Hasil (diperkirakan/tercapai) adalah sistem web fungsional dengan dua peran (Admin/Pengguna) yang mampu memberikan kategorisasi stres awal yang objektif, sehingga berfungsi sebagai langkah awal menuju intervensi dini. Implikasi – Implikasi dari penelitian ini dua arah: secara klinis, menyediakan alat skrining awal yang segera dan tidak bergantung pada ahli; secara etis dan agama, mendukung kewajiban kolektif (fard kifayah) untuk melindungi kesejahteraan mental remaja dan menjaga fitrah mereka. Orisinalitas/Nilai - Keaslian penelitian ini terletak pada integrasi spesifik logika fuzzy dalam sistem pakar berbasis web yang dirancang khusus untuk konteks remaja Indonesia dalam kesadaran stres dan diagnosis awal.