Claim Missing Document
Check
Articles

Found 3 Documents
Search
Journal : Building of Informatics, Technology and Science

Perbandingan Algoritma NBC, SVM, Logistic Regression untuk Analisis Sentimen Terhadap Wacana KaburAjaDulu di Media Sosial X Rohman, Adib Annur; Trisnapradika, Gustina Alfa
Building of Informatics, Technology and Science (BITS) Vol 7 No 1 (2025): June (2025)
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i1.7261

Abstract

This research aims to analyze sentiment towards KaburAjaDulu discourse on X social media by utilizing Logistic Regression, Support Vector Machine (SVM), and Naive Bayes algorithms. Data was collected through a crawling process and resulted in 3,011 tweet data. Pre-processing stages include data cleaning, conversion of letters to lowercase, normalization, tokenization, stopword removal, and stemming. After preprocessing, the data was divided into two sentiment categories, namely positive and negative using a lexicon approach. The dataset is divided using an 80:20 scheme for training and test data, with feature representation utilizing the TF-IDF method. The modeling process is performed utilizing the three algorithms to be evaluated using accuracy, precision, recall, and f1-score metrics. As a solution to class inequality, the oversampling technique SMOTE (Synthetic Minority Over-sampling Technique) is applied. Based on the evaluation, it shows that before the application of SMOTE, Naive Bayes algorithm obtained 78.18% accuracy, 81.80% precision, 77.06% recall, and 77.35% f1-score; SVM obtained 85.63% accuracy, 86.49% precision, 85.68% recall, and 85.94% f1-score; while Logistic Regression obtained 83.05% accuracy, 85.31% precision, 82.47% recall, and 82.95% f1-score. After applying SMOTE, Naive Bayes improved to 81.90% accuracy, 82.27% precision, 81.67% recall, and 81.87% f1-score; SVM obtained 85.63% accuracy, 87.59% precision, 86.89% recall, and 87.13% f1-score; and Logistic Regression obtained 83.33% accuracy, 84.46% precision, 83.62% recall, and 83.88% f1-score. These findings prove that SVM has the most consistent and superior sentiment classification performance on this dataset, making an important contribution to the development of methods for analyzing people's views on social media platforms.
Optimasi Algoritma SVM dengan Teknik SMOTE dan Tuning Parameter pada Klasifikasi Balita Stunting Muttaqin, Muhammad Al Ghorizmi; Trisnapradika, Gustina Alfa
Building of Informatics, Technology and Science (BITS) Vol 7 No 3 (2025): December 2025
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i3.8330

Abstract

Stunting in toddlers is a chronic nutritional problem that has long-term impacts on human resource quality, including cognitive development and vulnerability to diseases. Brebes Regency is one of the priority areas for stunting management in Indonesia. This study aims to optimize the performance of the Support Vector Machine (SVM) algorithm in classifying stunting status among toddlers by addressing data imbalance using the Synthetic Minority Oversampling Technique (SMOTE) and parameter tuning. A total of 9,598 anthropometric samples collected from several community health center in Brebes were processed through stages of data cleaning, label encoding, outlier handling, standardization, and class splitting, and then divided into training data (80%) and testing data (20%). Two models were compared: the baseline SVM model and the optimized SVM model, which integrates SMOTE and parameter tuning through GridSearchCV. The results showed that the baseline model achieved an accuracy of 98.31%, but the recall for the stunting class was only 89.19%. After applying SMOTE and parameter tuning, the model’s performance improved, achieving an accuracy of 99.78% and a recall for the stunting class of 98.46%. This improvement demonstrates that the use of SMOTE and parameter tuning is highly effective in enhancing the model’s sensitivity toward the minority class. Therefore, this study shows that a comprehensive optimization approach can effectively support early detection of stunting, making it a valuable tool for more targeted health intervention planning.
Peningkatan Kinerja Model Naïve Bayes untuk Analisis Sentimen Komentar Terkait “Sound Horeg” Menggunakan SMOTE dan Tuning Parameter Kaisalana, Mustafid; Trisnapradika, Gustina Alfa
Building of Informatics, Technology and Science (BITS) Vol 7 No 3 (2025): December 2025
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i3.8554

Abstract

The phenomenon of “Sound Horeg” on online platforms has sparked diverse public sentiments, making sentiment analysis an essential tool for understanding public opinion. This study aims to classify user sentiments (positive/negative) related to “Sound Horeg” using the Naïve Bayes algorithm. The dataset used in this research exhibits significant class imbalance, with a predominance of negative sentiments. The methodology involves a series of text preprocessing stages, including case folding, tokenizing, normalization, lexicon-based sentiment labeling, stopword removal, stemming, and duplicate removal. The sentiment labeling process utilizes an Indonesian sentiment lexicon compiled from two sources lexicon_positif.csv and lexicon_negatif.csv containing predefined lists of words with positive and negative sentiment scores based on Indonesian public opinion lexicons. Subsequently, text features are extracted using the Term Frequency–Inverse Document Frequency (TF-IDF) method. To address data imbalance, the Synthetic Minority Oversampling Technique (SMOTE) is applied to the training data to balance the number of positive and negative samples. The Naïve Bayes model is then optimized using GridSearchCV to determine the best alpha value. Experimental results show that the unoptimized Naïve Bayes model achieved an accuracy of 73%, but struggled to classify minority classes (positive sentiments) due to data bias. After applying SMOTE and parameter tuning, the model’s performance improved significantly, demonstrating the effectiveness of these techniques in producing a more balanced and robust model. This study concludes that the Naïve Bayes algorithm, when optimized with SMOTE and hyperparameter tuning, is effective for Indonesian-language sentiment analysis, particularly on imbalanced datasets. Future work may include exploring other algorithms and employing broader sentiment lexicons and more complex linguistic features to further enhance model performance.