Claim Missing Document
Check
Articles

Found 3 Documents
Search

The Classification of Hate Comments on Twitter Using a Combination of Logistic Regression and Support Vector Machine Algorithm Damayanti, Nabila Putri; Prameswari, Della Egyta; Puspita, Wiyanda; Sundari, Putri Susi
Journal of Information System Exploration and Research Vol. 2 No. 1 (2024): January 2024
Publisher : shmpublisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joiser.v2i1.229

Abstract

This research was conducted to increase accuracy in classifying sentences containing hate speech and non-hate speech on Twitter. This is important to do because, as technology develops, it also comes with negative impacts, one of which is hate speech. This classification is carried out using a combination of Logistic Regression (LR) and Support Vector Machine (SVM) methods. This combination is based on the ease of implementation and speed of LR as well as SVM's ability to handle more complex and non-linear data. In this context, LR is used to model the probability that a comment on Twitter contains hate elements or not. The model can then provide probability predictions for each class, and a threshold can be set to determine the final class. This research shows that combining these methods can build a good classification model with an accuracy of 96%.
Increased accuracy in predicting student academic performance using random forest classifier Mulyana, Aditya Fajar; Puspita, Wiyanda; Jumanto, Jumanto
Journal of Student Research Exploration Vol. 1 No. 2: July 2023
Publisher : SHM Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/josre.v1i2.169

Abstract

This research aims to classify the academic performance of students who are successful and who have dropped out of school with high accuracy so that these matters can be addressed quickly. Things like this need fast handling to find out what factors influence it. In addition, this research was conducted to test how good the random forest algorithm is in classifying a problem. Random forest, which includes an algorithm that is commonly used for classifying a problem. By using the random forest algorithm, the accuracy results will be better than a single decision tree. This algorithm is quite good at handling and managing large datasets. From this study it can be concluded that this method can provide good prediction accuracy with a fairly high level of accuracy, namely 89%. Utilization of this random forest can be an alternative in classifying student academic achievement. This algorithm can work well in handling large datasets. This study discusses how the use of Random Forest can work to classify students' academic performance.
Optimization of Logistic Regression Algorithm Using Grey Wolf Optimizer for Credit Card Fraud Detection Puspita, Wiyanda; Hakim, M. Faris Al
Scientific Journal of Informatics Vol. 12 No. 4: November 2025
Publisher : Universitas Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15294/sji.v12i4.26807

Abstract

Purpose: The advancement of digital technology has significantly changed the financial transaction system, but has also led to an increase in cybercrime, especially credit card fraud. This crime poses a significant financial threat, with reported losses reaching hundreds of millions of dollars annually. This study aims to improve the effectiveness of fraud detection using the Logistic Regression (LR) algorithm, which although widely used in binary classification, is still vulnerable to challenges with imbalanced data. The goal is to optimize LR using the Grey Wolf Optimizer (GWO) to improve accuracy and reliability. Methods: This research implements a Logistic Regression (LR) model whose hyperparameters are optimized using Grey Wolf Optimizer (GWO) algorithm. The model was trained and tested on a public Kaggle dataset containing 284,807 credit card transactions. Data preprocessing includes handling outliers using Interquartile Range (IQR) method and handling class imbalance using KMeansSMOTE. Evaluation metrics include accuracy, precision, recall, f1-score, and specificity based on confusion matrix. Result: The baseline LR model achieved 99.92% accuracy, 75.18% precision, 74.73% recall, 75.45% F1-score, and 99.96% specificity. After GWO optimization, the model improved to 99.94% accuracy, 85.96% precision, 83.08% recall, 84.01% F1-score, and 99.97% specificity, showing a significant performance boost. This represents a notable improvement in key metrics for fraud detection, with an increase of 14.3% in precision, 11.2% in recall, and 11.3% in the F1-score, demonstrating a more robust model. Novelty: This study proposed the application of the Grey Wolf Optimizer (GWO) for hyperparameter tuning of a Logistic Regression model in the context of fraud detection. Unlike conventional optimization techniques that can be computationally expensive, our GWO-based approach offers an efficient and effective method for discovering optimal model settings. The optimized model not only outperforms the baseline LR but also presents a scalable and powerful solution for financial institutions to improve the accuracy of their fraud detection systems.