Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Journal of Information Systems and Informatics

Sentiment Analysis on Coretax Data Using SVM and Random Forest with SMOTE and Tomek-Link Oktafiandi, Hery; Winarnie, Winarnie; Ramadhan, M. Fajar; Panjaitan, Febriyanti
Journal of Information System and Informatics Vol 7 No 3 (2025): September
Publisher : Universitas Bina Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51519/journalisi.v7i3.1279

Abstract

This study is motivated by the increasing adoption of digital tax platforms in Indonesia, particularly Coretax, which enables online tax reporting and payment. Understanding user sentiment is crucial for evaluating system effectiveness and identifying areas for improvement. However, sentiment data is often imbalanced, making it challenging to detect the sentiments of the minority class. This research evaluates the performance of Support Vector Machine (SVM) and Random Forest (RF) in classifying sentiment from Coretax related reviews collected between March and September 2025 from Twitter, YouTube, and the DJP application. Lexicon-based labeling and preprocessing were applied, followed by class balancing using Tomek-Link, SMOTE, and SMOTE-Tomek techniques. On the original data, SVM achieved an accuracy of 98.56%, while Random Forest reached 98.43%, both performing strongly on the majority class. However, minority class detection was improved through SMOTE and SMOTE-Tomek, albeit with a slight decrease in overall accuracy due to the risk of overfitting. The novelty of this study lies in its focus on Coretax 2025 data and a comparative analysis of multiple resampling techniques, providing practical insights into improving sentiment analysis performance on imbalanced digital tax data.
Machine Learning Classification of SCD, CHF, and NSR Using 15-Minute ECG-Derived HRV Features Panjaitan, Febriyanti; Ce, Win; Ramadhan, M. Fajar; Winarnie; Oktafiandi, Hery
Journal of Information System and Informatics Vol 8 No 2 (2026): April
Publisher : Asosiasi Doktor Sistem Informasi Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.63158/journalisi.v8i2.1557

Abstract

Heart disease remains one of the leading causes of mortality worldwide, making early detection essential for effective intervention. Heart Rate Variability (HRV) is widely used as a non-invasive marker for assessing cardiac conditions, and machine learning has shown potential in classifying heart diseases such as Sudden Cardiac Death (SCD) and Congestive Heart Failure (CHF). This study evaluates the performance of Support Vector Machine (SVM), Decision Tree (DT), and K-Nearest Neighbors (KNN) using 15-minute ECG signals comprising three 5-minute segments. The dataset consists of 53 subjects, generating 159 segments, including SCD, CHF, and Normal Sinus Rhythm (NSR). To prevent data leakage, a subject-wise split (80:20) is applied for training and testing. Two evaluation scenarios are considered: per-segment classification and combined 15-minute classification. Results indicate that SVM and DT achieve consistently high, stable performance with near-perfect accuracy, precision, recall, and F1-score, whereas KNN shows lower, more variable performance, particularly in segment-based analysis. The combined 15-minute approach provides more stable results, suggesting improved HRV representation and class separability. Although the results are promising, further validation with larger, more diverse datasets is required to ensure robustness and generalizability. This study highlights the potential of HRV-based machine learning while emphasizing the importance of appropriate temporal representation and rigorous evaluation design.