Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : BAREKENG: Jurnal Ilmu Matematika dan Terapan

OPTIMIZING LONG TEXT CLASSIFICATION PERFORMANCE THROUGH KEYWORD-BASED SENTENCE SELECTION: A CASE STUDY ON ONLINE NEWS CLASSIFICATION FOR INDONESIAN GDP GROWTH-RATE DETECTION Sholawatunnisa, Dinda Pusparahmi; Suadaa, Lya Hulliyyatus
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 18 No 2 (2024): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol18iss2pp1081-1094

Abstract

Efficiently managing lengthy textual data, particularly in online news, is crucial for enhancing the performance of long text classification. This study delves into innovative approaches to streamline the Gross Domestic Product (GDP) computation process by harnessing modern data analytics, Natural Language Processing (NLP), and online news sources. Leveraging online news data introduces real-time information, promising to improve the accuracy and timeliness of economic indicators like GDP. However, handling the complexity of extensive textual data poses a challenge, demanding advanced NLP techniques. This research shifts from traditional word-weight-based methods to keyword-based extractive summarization techniques to address this. These tailored approaches ensure that selected sentences align precisely with specific keywords relevant to the research case, such as GDP growth rate detection. The study emphasizes the necessity of adapting summarization methods to capture information in unique research contexts effectively. According to classification results, the implementation of sentence selection successfully demonstrated improved performance in terms of classification accuracy. Specifically, there was an average accuracy increase of 0.0226 for machine learning and 0.0164 for transfer learning models. Additionally, in terms of computational efficiency, sentence selection also accelerates processing time during hyperparameter tuning and fine-tuning, as observed using the same computational resources.