Claim Missing Document
Check
Articles

Effectiveness of Word2Vec and TF-IDF in Sentiment Classification on Online Investment Platforms Using Support Vector Machine Rifaldy, Fadil; Sibaroni, Yuliant; Prasetiyowati, Sri Suryani
JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika) Vol 10, No 2 (2025)
Publisher : STKIP PGRI Tulungagung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29100/jipi.v10i2.6055

Abstract

Investing in Indonesia is increasingly popular, especially among the millennial generation. investments such as deposits, gold, stocks, and online investment applications are increasingly in demand. This research focuses on the sentiment classification of user reviews of the Nanovest online investment application on the Google Play Store using the Support Vector Machine (SVM) method. SVM is used because it can classify opinions into positive and negative sentiment classes with good accuracy, by evaluating how effective Word2Vec features extraction that can convert words in a text into numerical vectors and TF-IDF that is capable of high-dimensional word weighting and TF-IDF Weighted Word2Vec combination features to produce richer vector representations. Tests were conducted using four SVM kernels namely Linear, Polynomial, RBF, and Sigmoid. The results show that Word2Vec with RBF kernel and 300 vector size produces the highest accuracy of 95.46%, the combination of TF-IDF Weighted Word2Vec also gives good performance with 95.29% accuracy on RBF kernel. However, TF-IDF alone resulted in the lowest accuracy of 93.31% on the Sigmoid kernel. This research shows that Word2Vec and combined feature extraction methods are effective in improving sentiment classification performance compared to TF-IDF.
Land Price Distribution Prediction in Jakarta Using Support Vector Machine with Feature Expansion and Kriging Interpolation Pilar Gautama, Hadid; Prasetiyowati, Sri Suryani; Sibaroni, Yuliant
Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control Vol. 10, No. 3, August 2025
Publisher : Universitas Muhammadiyah Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22219/kinetik.v10i3.2216

Abstract

Fluctuations in land prices over time are significant, especially in big cities, one of which is Jakarta. The increase in land prices is influenced by high demand, location-related needs, ease of access to various public facilities and population density. Uncontrolled prices and lack of information about the distribution of land prices cause buyers to acquire land that does not meet their needs. This study develops a land price distribution prediction system for Jakarta for 2025-2026 using Support Vector Machine (SVM) with time-based feature expansion and spatial interpolation. The SVM model with an RBF kernel demonstrated superior performance, achieving 93.14% accuracy for 2025 predictions using the t-1 model. For 2026 predictions, the t-2 model achieved 83.33% accuracy. This approach involves utilizing one to two years of historical data and systematically selected features, ensuring more accurate and relevant predictions. Ordinary kriging interpolation visualizations revealed a significant shift in land price distribution patterns, indicating a decline in affordable land availability and an increase in high-value properties across Jakarta. The integration of SVM and kriging interpolation, coupled with comprehensive evaluation metrics, provides a robust methodological framework for predicting urban land price distributions. This system offers practical implications for informed decision-making in Jakarta's dynamic land market, enabling stakeholders to make efficient, budget-based property decisions. The research contributes significantly to urban planning by providing a comprehensive tool for understanding and predicting land price trends, which can assist various stakeholders in making informed property investment decisions.
Fake News Detection with Hybrid CNN-SVM on Data AI and Technology Lesmana, Aditya; Sibaroni, Yuliant; Prasetyowati, Sri Suryani
Building of Informatics, Technology and Science (BITS) Vol 7 No 2 (2025): September 2025
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i2.7871

Abstract

The spread of fake news or hoaxes in this digital era, especially related to the issue of intelligence (AI) and Technology, is increasingly unsettling because it can trigger public misunderstanding and reduce trust in technological developments. News such as the claim that AI will lead to mass unemployment is a clear example of the spread of misleading information. Therefore, a system that can accurately detect fake news is needed. The purpose of this research is to develop a fake news detection system that is able to accurately identify hoaxes on topics related to AI and Technology. This study proposes a hybrid deep learning method that combines Convolutional Neural Network (CNN) and Support Vector Machine to improve the accuracy of hoax news detection. CNN is used to extract complex news text features, whereas SVM is used as a classifier because of its advantage of being able to separate classes within optimal margins. The selection of this method is based on the results of previous research which shows that each method has good performance, but has certain limitations. By combining the two, it is hoped that more optimal results can be obtained in detecting fake news, especially the topic of AI and Technology. The evaluation was carried out using news datasets related to AI and Technology that have gone through a process of preprocessing, feature extraction with TF – IDF, and feature expansion using Glove Embedding. The results obtained showed that the CNN-SVM hybrid model provided increased accuracy compared to using a single method.
Effectiveness of Bi-GRU and FastText in Sentiment Analysis of Shopee App Reviews Rahmanda, Rayhan Fadhil; Sibaroni, Yuliant; Prasetiyowati, Sri Suryani
Sinkron : jurnal dan penelitian teknik informatika Vol. 9 No. 1 (2025): Research Article, January 2025
Publisher : Politeknik Ganesha Medan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33395/sinkron.v9i1.14474

Abstract

E-commerce is proof of evolution in the economic field due to its flexibility to shop for various necessities of life anytime and anywhere. Shopee is one of the e-commerce platforms in demand by people from varied circles in Indonesia. Multiple reviews are shed publicly by Shopee users on the Google Play Store regarding shopping experiences, which can be positive or negative. This condition affects the decision of other users to shop at Shopee, thus impacting the increase or decrease in profits from Shopee itself. Therefore, user sentiment analysis is needed as a form of effort to maintain user trust in Shopee. This research aims to build a system to classify the sentiment of Shopee application users through reviews in the Google Play Store by utilizing the Bidirectional Gated Recurrent Unit (Bi-GRU) deep learning model. The dataset contains 9,716 reviews, including 3,937 positive and 5,779 negative sentiments. Several test scenarios were conducted to achieve the highest peak of performance, utilizing TF-IDF feature extraction, FastText feature expansion, and optimization using the Cuckoo Search Algorithm. Additionally, SMOTE resampling was utilized to correct the dataset’s uneven distribution. The combined test scenarios mentioned significantly improved the accuracy by 1.03% and F1-Score by 1.04% from the baseline, with the highest accuracy reaching 90.48% and the highest F1-Score of 90.16%.
Detection of Fraudulent Financial Statement based on Ratio Analysis in Indonesia Banking using Support Vector Machine Sibaroni, Yuliant; Ekaputra, Muhammad Novario; Prasetiyowati, Sri Suryani
JOIN (Jurnal Online Informatika) Vol. 5 No 2 (2020)
Publisher : Department of Informatics, UIN Sunan Gunung Djati Bandung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15575/join.v5i2.646

Abstract

This study proposes the use of ratio analysis-based features combined with the SVM classifier to identify fraudulent financial statements. The detection method used in this study applies a data mining classification approach. This method is expected to replace the expert in forensic accounting in identifying fraudulent financial statements that are usually done manually. The experimental results show that the proposed classifier model and ratio analysis-based features provide more than 90% accuracy results where the optimal number of features based on ratio analysis is 5 features, namely Capital Adequacy Ratio (CAR), (ANPB) to total earning assets and non-earning assets (ANP), Impairment provision on earning assets (CKPN) to earning assets, Return on Asset (ROA), and Return on Equity (ROE). The contribution of the study is to complement the research of fraudulent financial statements detection where the classifier method used here is different compare to other research. The selection of banking cases in Indonesia is also unique in this research which distinguishes it from other research because the financial reporting standards in each country can be different. 
Performance Analysis of ACO and FA Algorithms on Parameter Variation Scenarios in Determining Alternative Routes for Cars as a Solution to Traffic Jams Sibaroni, Yuliant; Prasetiyowati, Sri Suryani; Fairuz, Mitha Putrianty; Damar, Muhammad; Salis, Rafika
JOIN (Jurnal Online Informatika) Vol 7 No 1 (2022)
Publisher : Department of Informatics, UIN Sunan Gunung Djati Bandung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15575/join.v7i1.797

Abstract

This study proposes several alternative optimal routes on traffic-prone routes using Ant Colony Optimization (ACO) and Firefly Algorithm (FA). Two methods are classified as the metaheuristic method, which means that they can solve problems with complex optimization and will get the solution with the best results. Comparison of alternative routes generated by the two algorithms is measured based on several parameters, namely alpha and beta in determination of the best alternative route. The results obtained are that the alternative route produced by FA is superior to ACO, with an accuracy of 88%. This is also supported by the performance of the FA algorithm which is generally superior, where the resulting alternative route is shorter in distance, time, running time and  there is no influence on the alpha parameter value. But in each iteration, the number of alternative routes generated is less. The contribution of this research is to provide information about the best algorithm between ACO and FA in providing the most optimal alternative route based on the fastest travel time. The recommended alternative path is a path that is sufficient for cars to pass, because the selection takes into account the size of the road capacity.
LEARNING RATE AND EPOCH OPTIMIZATION IN THE FINE-TUNING PROCESS FOR INDOBERT’S PERFORMANCE ON SENTIMENT ANALYSIS OF MYTELKOMSEL APP REVIEWS Zaidan, Muhammad Naufal; Sibaroni, Yuliant; Prasetyowati, Sri Suryani
Jurnal Teknik Informatika (Jutif) Vol. 5 No. 5 (2024): JUTIF Volume 5, Number 5, Oktober 2024
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2024.5.5.2396

Abstract

With the advancement of the digital era, the growth of mobile applications in Indonesia is rapidly increasing, particularly with the MyTelkomsel app, one of the leading applications with over 100 million downloads. Given the large number of downloads, user reviews become crucial for improving the quality of services and products. This study proposes a sentiment analysis approach utilizing the Indonesian language model, IndoBERT. The main focus is on optimizing the learning rate and epochs during the fine-tuning process to enhance the performance of sentiment analysis on MyTelkomsel app reviews. The IndoBERT model, trained with the Indo4B dataset, is the ideal choice due to its proven capabilities in Indonesian text classification tasks. The BERT architecture provides contextual and extensive word vector representations, opening opportunities for more accurate sentiment analysis. This study emphasizes the implementation of fine-tuning with the goal of improving the model's accuracy and efficiency. The test results show that the model achieves a high accuracy of 96% with hyperparameters of batch size 16, learning rate 1e-6, and 3 epochs. The optimization of the learning rate and epoch values is key to refining the model. These results provide in-depth insights into user sentiment towards the MyTelkomsel app and practical guidance on using the IndoBERT model for sentiment analysis on Indonesian language reviews.
Geo-Sentiment Analysis of Public Opinion of X Users towards the Documentary Film Dirty Vote using the Bidirectional Long Short-Term Memory Method Salsabila, Syifa; Sibaroni, Yuliant; Prasetiyowati, Sri Suryani
Jurnal Teknik Informatika (Jutif) Vol. 6 No. 2 (2025): JUTIF Volume 6, Number 2, April 2025
Publisher : Informatika, Universitas Jenderal Soedirman

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52436/1.jutif.2025.6.2.4195

Abstract

Presidential elections held every five years, often generates significant public discourse. The 2024 presidential election saw the release of the documentary Dirty Vote, which raised allegations of electoral fraud and sparked polarized opinions on social media, especially on X. This study aims to analyze public sentiment toward Dirty Vote using geo-sentiment analysis and the Bidirectional Long Short-Term Memory (Bi-LSTM) model. Data were collected from geotagged tweets, with sentiment classified as positive, negative, or neutral. The research explored various data processing techniques, including TF-IDF for feature extraction, FastText for feature expansion, and balancing methods like SMOTE and class weighting to address data imbalance. Results showed that the baseline Bi-LSTM model achieved an accuracy of 71.57% and an F1-Score of 74.05%. When enhanced with TF-IDF and FastText, accuracy increased to 77.07%, though the F1-Score dropped slightly to 72.95%. Applying SMOTE resulted in a decrease in accuracy to 76.45%, but significantly improved the F1-Score to 74.93%. Exploratory data analysis revealed that negative sentiment was most concentrated in Java Island, particularly Jakarta, and peaked during February 2024, coinciding with the documentary's release and the election period. This study significantly contributes to understanding how geographic locations influence public opinion on sensitive political issues. A lack of understanding of geographically-based sentiment patterns can hinder identifying regional needs, leading to poorly targeted policies. By integrating data analysis methods with geographical approaches, this research provides deep insights for designing more effective, data-driven public intervention strategies and supports policymaking that is more responsive to the dynamics of public opinion.
Performance of Time-Based Feature Expansion in Developing ANN Classification Prediction Models on Time Series Data Sri Suryani Prasetiyowati; Yahya, Arnasli; Rohmawati, Aniq Atiqi
International Journal on Information and Communication Technology (IJoICT) Vol. 9 No. 2 (2023): Vol.9 No. 2 Dec 2023
Publisher : School of Computing, Telkom University

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

The prediction problem in most research is the main goal, to estimate future events related to the field under study. Research on classification that involves the prediction process in it, with spatial-time data and influenced by many features, such as the problem of disease spread, climate change, regional planning, environment, economic growth, requires methods that can predict while solving the problem of features and time. To obtain a time-based classification prediction model using many features, this research uses machine learning methods, one of which is Artificial Neural Network (ANN). The scenario carried out is to develop a t+r classification prediction model by expanding features based on the time t-r of the previous period. The performance of feature expansion in the development of ANN classification prediction models is determined based on the optimal accuracy value of the combination of t-r classification prediction models for the previous time period. By implementing the model on the data, it is found that the performance of time-based feature expansion in ANN classification ranges from 3.5% to 11%. While the optimal accuracy value is obtained from the feature expansion scenario of 3 to 5 time periods earlier.
Revealing the Impact of the Combination of Parameters on SVM Performance in COVID-19 Classification Prasetiyowati, Sri Suryani; Harini, Sri; Nur Fadila, Juniardi; Fahlena, Hilda
International Journal on Information and Communication Technology (IJoICT) Vol. 10 No. 1 (2024): Vol. 10 No.1 June 2024
Publisher : School of Computing, Telkom University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21108/ijoict.v10i1.965

Abstract

Non-linear SVM functions to modify the kernel in the SVM. Each kernel function in linear and non-linear SVMs has several parameters that are used in the classification process. SVM is a method that has advantages in classification, but there are still obstacles in selecting optimal parameters. This research investigates the effect of parameter variations on SVM classification performance on the COVID-19 dataset, using linear, RBF, Sigmoid and polynomial kernels. The analysis shows that the polynomial kernel is superior with the highest performance compared to other kernels. The highest accuracy of 77.57% was achieved with a combination of C values ??of 0.75 and Gamma of 0.75, and an F1-Score value of 76.67% indicating an optimal balance between precision and recall. The performance stability produced by the polynomial kernel provides advantages in classifying the COVID-19 dataset, with more controlled fluctuations compared to other kernels. The interaction between the C and Gamma parameters shows that a Gamma value of 0.75 consistently provides good results, while adjusting the C parameter shows more controlled performance variations. This confirms that appropriate Gamma parameter settings are key in improving the accuracy and consistency of SVM model predictions in this case.
Co-Authors Abduh Salam Adhe Akram Azhari Adhitya Aldira Hardy Aditya Andar Rahim Aditya Firman Ihsan Aditya Gumilar Akmal Muhamad Faza Aniq A. Rohmawati Aniq Atiqi Rohmawati Aqilla, Livia Naura arief rahman Arnasli Yahya Asramanggala, Muhammad Sulthon Aufa, Rizki Nabil Azmi Aulia Rahman Chamadani Faisal Amri Christina Natalia Claudia Mei Serin Sitio Damar, Muhammad Dede Tarwidi Derwin Prabangkara Diyas Puspandari Ekaputra, Muhammad Novario Elqi Ashok Erna Sri Sugesti Fairuz, Mitha Putrianty Fatha, Rizkialdy Fathin, Muhammad Ammar Fatri Nurul Inayah Gede Astawa Pradika Gilang Brilians Firmanesha Gusti Aji, Raden Aria Gutama, Soni Andika Hawa, Iqlima Putri Haziq, Muhammad Raffif Hilda Fahlena I Putu Ananda Miarta Utama Ibnu Muzakky M. Noor Indra Kusuma Yoga Indri Octavellia Wulanissa Irfani Adri Maulana Jauzy, Muhammad Abdurrahman Al Juniardi Nur Fadila Lesmana, Aditya Mahadzir, Shuhaimi Maharani, Anak Agung Istri Arinta Mardha Al Nazhfi Ali Mitha Putrianty Fairuz Muh. Kiki Adi Panggayuh Muhammad Alauddin Angka Kurniawan Muhammad Damar Muhammad Ghifari Adrian Muhammad Hadyan Baqi Muhammad Ikram Kaer Sinapoy Muhammad Novario Ekaputra Muldani, Muhamad Dika Nanda Ihwani Saputri Naufal Alvin Chandrasa Nenny Lisbeth Minarno Ni Made Dwipadini Puspitarini Nur Fadila, Juniardi Nuraena Ramdani Nurul Fajar Riani Pernanda Arya Bhagaskara S M Pilar Gautama, Hadid Purwanto, Brian Dimas Putra, Ihsanudin Pradana Putri, Pramaishella Ardiani Regita Rachmadania Irmanita Rafika Salis Rahmanda, Rayhan Fadhil Ridha Novia Ridho Isral Essa Rifaldy, Fadil Rizky Fauzi Ramadhani Rizky Yudha Pratama Rizky, Muhammad Zacky Faqia Salis, Rafika Salsabila, Syifa Sinaga, Astria M P Siti Uswah Hasanah Sri Harini Sri Harini Suhendar, Annisya Hayati Winico Fazry Wira Abner Sigalingging Yahya, Arnasli Yuliant Sibaroni Zaidan, Muhammad Naufal