Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Tech-E

Enhancing Sundanese News Articles Classification: A Comparative Study of Models and Feature Extraction Techniques A. Permana, Yadhi; Setiawan, Irwan; Diani, Fitri; Suprihanto
Tech-E Vol. 8 No. 2 (2025): TECH-E (Technology Electronic)
Publisher : Fakultas Sains dan Teknologi-Universitas Buddhi Dharma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31253/te.v8i2.3212

Abstract

This paper presents a comprehensive investigation into the classification of Sundanese news articles, focusing on the evaluation of various classification models and feature extraction methods. Using a dataset obtained from Sundanese news websites, this study conducts a systematic comparison of Naive Bayes and Logistic Regression classifiers combined with TF-IDF and Bag-of-Words feature extraction methods. The research process involves critical steps such as data preprocessing, model training, hyperparameter optimization, and performance assessment based on standard metrics, including accuracy, precision, recall, and F1-score. Results demonstrate high accuracy across all combinations, with the Logistic Regression model using Bag-of-Words feature extraction achieving the highest accuracy of 98.20%. Beyond model evaluation, the research delves into qualitative data analysis. Word clouds and TF-IDF weighting are employed to uncover prominent themes and topics within the news articles, highlighting recurring patterns in the Sundanese language. The study identifies key challenges, including the scarcity of annotated datasets for low-resource languages like Sundanese and the limitations of traditional models in capturing complex linguistic structures. Future opportunities are highlighted, such as leveraging deep learning models, including transformers, to enhance classification performance and address current limitations. Additionally, ensemble methods and domain-specific adaptations could further improve accuracy. Overall, this research contributes to advancing Sundanese language processing and provides a roadmap for future innovations in text classification and natural language processing applications.
Co-Authors -, Nilda ., Nuraksa A, Fauzan Adzima Ade Chandra Nugraha Andi Velahyati Baharuddin Armin Darmawan Asmal, Sapta Azzahra, Salsabila Dini Bagas Ristanto, Dionesius Bahrul Ngulum, Muhamat Bayu Putra, Fajar Tri Burstiando, Rizki Cahyani, Rani Caroline Felicita Aurelius Chaniago, Harmon Deria, Desy Dhedhy Yuliawan Diani, Fitri Dwi Handayani F, Andi Syahreza F, Widya Ananda Faqih, Achmad Zulva Ainun Farida Muchtadi, Farida Feryanto, Wahyu Gilas, Gilas Pradana Putra Hablinur Alkindi Halilah, Ii Haris, Syukri Agung Wijaya Harmon Harmon Hestika Huda, Dhimas Miftachul Husein Allsabah, M. Akbar Imam Basori Ismanto, Juli Jannah, Silvie Roikhatul Jonner Hutahaean Kahirulloh, Aldona Kartolo, Rachmat Kholis, Moh. Nur Kusumohadi, Catur Setyawan Lian Min, Joe Lusianti, Septyaning M, Weney Kanatya Mangngenre, Saiful Moh. Nurkholis Moh. Yunan Firmansyah MOKHAMMAD FIRDAUS, MOKHAMMAD Muchtadi, Farida I. Muhammad Aditya Irfaani Muhammad, Maula Sidi Nur Ahmad Muharram Nur Fadillah, Nur Nurrohman Nurrohman Panjaitan, Tutur Parade Tua Parenreng, Syarifuddin Mabe Perdana, Ferdy Aprilian Prasetya Kurniawan, Wing Purba, Radia Puspodari Ramadani, Moch. Rifqi Reo Prasetiyo Herpandika, Reo Prasetiyo Ricky Saputra Ryan Widhandi Sagitarius, Andri Prasetyo Saiful - Sholihati Amalia Slamet Junaidi Sri Surjani Tjahjawati Sugiarto Sugiyanto - Suprihanto Suprijanto Suprijanto Susilo, Teguh Budi Sutisna, Ma’mun Syamsul, Diniary Ikasari Tambunan, Daulat Marulitua Wisnuadhi, Bambang Yadhi Aditya Permana YAYAN FIRMANSYAH, YAYAN Yuwono, Diki Candra Zawawi, M. Anis