Claim Missing Document
Check
Articles

Found 2 Documents
Search

Improving Random Forest Performance for Sentiment Analysis on Unbalanced Data Using SMOTE and BoW Integration: PLN Mobile Application Case Study Rahmatullah, Muhammad Rifqi Fadhlan; Andono, Pulung Nurtantio; Affandy; Soeleman, M. Arief
Scientific Journal of Informatics Vol. 12 No. 1: February 2025
Publisher : Universitas Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15294/sji.v12i1.19295

Abstract

Purpose: This research aims to improve the accuracy of sentiment analysis on PLN Mobile app reviews by overcoming the challenge of data imbalance. This goal is important to provide a better understanding of user opinions and support PT PLN (Persero) in improving mobile application services. Methods: This research uses the Random Forest algorithm combined with Synthetic Minority Over-sampling Technique (SMOTE) to handle imbalanced data. Data is collected through web scraping reviews from the Google Play Store, followed by preprocessing processes such as data cleaning, stopword removal, tokenization, and stemming. Feature extraction is performed using the Bag of Words (BoW) method, and the data is tested with four sharing schemes. Result: The results showed that the 90%-10% sharing scheme gave the best performance with an accuracy of 81% and an average precision and recall of 0.79. This finding confirms that the larger the proportion of training data, the better the model performs sentiment classification. Novelty: This research's novelty lies in combining SMOTE with BoW and Random Forest to overcome data imbalance. This approach is a significant reference for future sentiment analysis research. It provides practical insights that PT PLN (Persero) can use to improve the quality of its application services.
Classification of Breast Cancer Histopathology Images with Attention-Based Multiple Instance Learning Method Setiyani, Safira Hasna; Noersasongko, Edi; Affandy
Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control Vol. 10, No. 4, November 2025
Publisher : Universitas Muhammadiyah Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22219/kinetik.v10i4.2310

Abstract

Breast cancer is one of the deadliest types of cancer among women worldwide. Early detection plays a crucial role in increasing the chances of successful treatment and reducing the risk of death. Various efforts have been made by both the general public and medical professionals to raise awareness, promote early screening, and ensure timely medical intervention. With advances in technology, the use of computer-based systems, particularly in the field of medical image analysis, has become increasingly important. One such application is histopathological image analysis to support the diagnostic process in breast cancer cases. Histopathological image classification has gained significant attention from researchers in recent years, and various machine learning and deep learning techniques have been applied to improve its accuracy. Convolutional Neural Networks (CNNs), as part of the deep learning framework, have shown promising results in identifying tissue patterns in histopathological images. However, despite their high accuracy, CNNs are often less interpretable, making it difficult to understand the reasoning behind their predictions—especially when dealing with subtle features such as small spots, dots, or fine lines that may be overlooked. This study addresses these limitations by proposing a method that not only classifies histopathological images with high accuracy but also enhances readability through localization techniques. The goal is to make the classification process more transparent and clinically useful. Using widely recognized datasets like BreakHIS, the proposed method achieves a classification accuracy of up to 97.50%, demonstrating its potential as a reliable tool in medical diagnostics and breast cancer research.