Garuda - Garba Rujukan Digital

p-Index From 2021 - 2026

3.421

P-Index

This Author published in this journals

All Journal IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Jurnal Teknologi Informasi dan Ilmu Komputer KLIK (Kumpulan jurnaL Ilmu Komputer) (e-Journal) JOIN (Jurnal Online Informatika) Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Indonesian Journal of Information System BAREKENG: Jurnal Ilmu Matematika dan Terapan Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI) Jurnal Aplikasi Statistika & Komputasi Statistik AL-ULUM: JURNAL SAINS DAN TEKNOLOGI Jurnal Nasional Teknik Elektro dan Teknologi Informasi Prosiding Seminar Nasional Official Statistics PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND OFFICIAL STATISTICS

Suadaa, Lya Hulliyyatus

Politeknik Statistika STIS

Author-ID : 2899463

Agriculture, Biological Sciences & Forestry Humanities Chemical Engineering, Chemistry & Bioengineering Computer Science & IT Control & Systems Engineering Decision Sciences, Operations Research & Management Economics, Econometrics & Finance Education Electrical & Electronics Engineering Energy Engineering Mathematics Mechanical Engineering Physics Social Sciences Transportation

Published : 24 Documents Claim Missing Document

Claim Missing Document

Articles

1 2 3

Penerapan Text Augmentation untuk Mengatasi Data yang Tidak Seimbang pada Klasifikasi Teks Berbahasa Indonesia Rahma, Iftitah Athiyyah; Suadaa, Lya Hulliyyatus
Jurnal Teknologi Informasi dan Ilmu Komputer Vol 10 No 6: Desember 2023
Publisher : Fakultas Ilmu Komputer, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25126/jtiik.2023107325

Klasifikasi teks merupakan salah satu tugas yang fundamental dalam natural language processing (NLP). Dalam dunia nyata, data dan sumber daya yang tersedia untuk pengklasifikasian teks terbatas. Salah satu kendala pada data berlabel yang digunakan yaitu imbalanced data atau data yang tidak seimbang. Permasalahan data yang tidak seimbang memengaruhi kinerja dan keakuratan model karena model hanya terfokus pada data dengan label mayoritas. Sementara itu, data berlabel minoritas cenderung diklasifikasikan tidak tepat oleh model, padahal untuk beberapa kasus kemampuan model untuk memprediksi data dengan label minoritas lebih penting. Untuk mengatasinya, penelitian ini melakukan pendekatan oversampling yaitu menambah data untuk menyeimbangkan dataset. Penerapan oversampling pada data teks dikenal dengan text augmentation. Pada penelitian ini dilakukan dua teknik text augmentation yaitu synonym replacement dan back translation pada beberapa kondisi ketidakseimbangan dan skenario augmentasi terhadap dua dataset. Berdasarkan hasil eksperimen, augmentasi mampu meningkatkan skor F1 label minoritas. Augmentasi lebih signifikan dalam dataset kecil dan kondisi ketidakeimbangan yang parah. Hasil dari teknik back translation lebih baik dibandingkan dengan teknik synonym replacement. Selain itu, hasil penelitian menunjukkan bahwa skenario jumlah augmentasi juga berpengaruh terhadap kenaikan skor F1. Semakin banyak jumlah data augmentasi belum tentu memberikan hasil yang semakin baik karena terindikasi overfitting pada data latih. Kata-kata yang tidak normal atau tidak baku pada dataset teks informal memengaruhi proses augmentasi sehingga hasil teks sintetis yang diperoleh tidak sebaik pada dataset teks formal. Abstract Text classification is one of the fundamental tasks in natural language processing (NLP). However, data and resources for text classification are limited in actual application. One of the constraints on the dataset for text classification is imbalanced data, or the condition when one label has more data than the others. Imbalanced data affects the performance and accuracy of the model because the model only focuses on the majority label data. Meanwhile, the minority label data tends to be classified incorrectly by the model, even though, in some cases, the model's ability to predict data with minority labels is more important. To solve this problem, this research uses an oversampling approach to augment data and balance the dataset. The application of oversampling text data is known as text augmentation. This research uses two text augmentation techniques, synonym replacement and back translation, applied to several imbalance conditions and augmentation scenarios for two datasets. Based on experimental results, augmentation can increase the F1 score of the minority class. Augmentation is more significant in small datasets and severe imbalance conditions. The results of the back translation technique are better than synonym replacement. In addition, this study's results show that the number of augmentation scenarios affects an increase in F1-score. However, increasing the augmentation data cannot ensure the results are getting better. Furthermore, words that are not normal in informal text datasets affect the augmentation process, so the results of synthetic text are better than the formal text dataset.

Automated Essay Scoring Menggunakan Semantic Textual Similarity Berbasis Transformer Untuk Penilaian Ujian Esai Pradani, Kharisma Ayu; Suadaa, Lya Hulliyyatus
Jurnal Teknologi Informasi dan Ilmu Komputer Vol 10 No 6: Desember 2023
Publisher : Fakultas Ilmu Komputer, Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25126/jtiik.2023107338

Ujian berbasis esai seringkali digunakan untuk menguji pemahaman siswa dalam menyelesaikan permasalahan. Tak terkecuali dalam pelaksanaan ujian di Politeknik Statistika STIS. Dalam melakukan penilaian pada jawaban tipe ini, dibutuhkan waktu serta tenaga yang besar, dan sering kali menimbulkan ketidakkonsistenan dalam penilaian. Hal ini dapat terjadi salah satunya karena perbedaan cara penilaian yang dilakukan oleh orang yang berbeda. Oleh karena itu diperlukan penyelesaian yang bisa mengefektifkan waktu, tenaga serta menjaga kekonsistenan aspek penilaian, diantaranya yaitu dengan automated essay scoring (AES). AES merupakan suatu model yang dilatih untuk menilai suatu esai secara otomatis berdasarkan kemiripan jawaban dengan kunci jawaban. Pada penelitian ini, metode yang diusulkan untuk menghitung kemiripan semantik teks berbahasa Indonesia antara jawaban esai dan kunci jawabannya yaitu model berbasis Transformers IndoBERT. Sebagai baseline, digunakan teknik ekstraksi fitur Term Frequency - Inverse Document Frequency (TF-IDF) dan penghitungan kemiripan fitur menggunakan cosine similarity dan linear regression. Selanjutnya nilai kemiripan tersebut dikonversi ke rentang nilai yang diinginkan sebagai prediksi nilai dari setiap esai. Berdasarkan hasil evaluasi, diperoleh bahwa model fine-tuned IndoBERT merupakan model terbaik dengan nilai MAE dan RMSE sebesar 0.1285 dan 0.2001. Abstract Essay-based exams are often used to test students’ understanding of solving problems. However, assessing this type of answer takes a lot of time and effort and often results in inconsistencies. One of the reasons is the different ways between people while doing the assessment. Therefore, a solution is needed to streamline time, effort, and maintain consistency in aspects of assessment, including automated essay scoring (AES). AES is a model trained to assess an essay automatically based on the similarity of answers with the answer key. In this study, the method proposed to calculate the semantic similarity of Indonesian text between essay answers and answer keys is a model based on the Transformer BERT. As a baseline, the Term Frequency – Inverse Document Frequency (TF-IDF) feature extraction technique is used and calculating feature similarity using cosine similarity and linear regression. Then the similarity value is converted to the desired range of values as the predicted value of each essay. Based on the evaluation results, it was found that the fine-tuned IndoBERT model was the best model, with MAE and RMSE values of 0.1285 and 0.2001.

Pembangunan Dataset Sintetis Klasifikasi Baku Lapangan Usaha Indonesia 2020 dengan Generative Artificial Intelligence Silmi Kaffah, M. Ihsan; Rahman, Dimas Haafizh; Amnur, Muh. Alfian; Montolalu, Cloudya Qashwah; Siregar, Amir Mumtaz; Sinulingga, Geraldo Benedictus; Ayu Alistin, Zharifah Dhiya; Raihannur, Cut Indah; Putri Arivia, Anggi Marya; Rahmawati, Arih; Nauli Sihombing, Fiona Audia; Salsabiela, Rahmadika Kemala; Bahy, Sabastian Alfons; Suadaa, Lya Hulliyyatus; Choir, Achmad Syahrul
Seminar Nasional Official Statistics Vol 2025 No 1 (2025): Seminar Nasional Official Statistics 2025
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/semnasoffstat.v2025i1.2581

The limited quality datasets is a fundamental challenge in developing automatic classification of business description into the Indonesia Standard Industrial Classification (KBLI) using machine learning models. This research aims to develop a synthetic KBLI dataset using Generative AI via ChatGPT chatbot with a one-shot prompting technique. This technique is employed to generate business descriptions based on five-digit KBLI codes in order to address the limitations of labeled data and the variability of existing business descriptions. The dataset generated through prompt engineering and manual validation shows that 93,25% of the business descriptions align with the established KBLI standards. The average number of business descriptions per category demonstrates a fairly uniform distribution, ensuring sufficient representation for each five-digit code. This research makes a significant contribution in providing a dataset for training machine learning models in the automatic classification of business descriptions into the five-digit KBLI categories.

SPAN-LEVEL ASPECT-BASED SENTIMENT TRIPLET ANALYSIS IN GOVERNMENT APPLICATION REVIEWS Feza Raffa Arnanda; Lya Hulliyyatus Suadaa; Avi Rudianita Indah Dg Widya; Setia Irham Pramana
AL ULUM: JURNAL SAINS DAN TEKNOLOGI Vol 12, No 1 (2026)
Publisher : UPT Publication and Journal Management, Islamic University of Kalimantan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31602/jst.v12i1.22767

The government is enhancing digital public services through mobile applications in line with the Electronic-Based Government System (SPBE) 2018–2025 vision. To support continuous innovation, the Ministry of Administrative and Bureaucratic Reform (Kemenpan-RB) organize the Public Service Innovation Competition (KIPP). Understanding user complaints is essential, and aspect-based sentiment analysis, particularly Span-Level Aspect Sentiment Triplet Extraction (Span-ASTE), was applied to analyze government app reviews. A domain-specific dataset was developed with a Cohen’s Kappa of 0.817, indicating strong annotation reliability. IndoBERT-large achieved the highest F1-score of 0.76, while IndoBERT-lite-base provided an efficient alternative with an F1-score of 0.727. An aspect categorization model reached 0.86 accuracy. These models aim to improve public services, strengthen SPBE implementation, and enhance Indonesia’s E-Government Development Index ranking.

Co-Authors Adilla, Rahmi Elfa Amanda Tabitha Bulan Panjaitan Amnur, Muh. Alfian Arie Wahyu Wijayanto Avi Rudianita Indah Dg Widya Ayu Alistin, Zharifah Dhiya Bahy, Sabastian Alfons Berliana Sugiarti Putri Choir, Achmad Syahrul Cynthia As Bahri Efri Diah Utami Feza Raffa Arnanda Hana Raihanatul Jannah Huraira, Sabit Ibnu Santoso Iftitah Athiyyah Rahma Indah Simbolon Maghfiroh, Lutfi Rahmatuti Maulidya, Luthfi Monika, Anugerah Karta Montolalu, Cloudya Qashwah Muhammad Aziz Muhammad Farhan Muhammad Farhan Muhammad Huda Munaf, Alfatihah Reno Maulani Nuryaningsih Soekri Putri Nauli Sihombing, Fiona Audia Nicholas H Manurung Nugraha, Gede Putra Nur Ainun Daulay Pradani, Kharisma Ayu Pramana, Setia Putri Arivia, Anggi Marya Putri, Berliana Sugiarti Rahma, Iftitah Athiyyah Rahman, Dimas Haafizh Rahmawati, Arih Raihannur, Cut Indah Renata De La Rosa Manik Ridho, Farid Rifqi Ramadhan Rimawati, Yeni Rindang Bangun Prasetyo Rizka Maulida Yanti Salsabiela, Rahmadika Kemala Sari, Mutiara Indryan Septianugraha, Damar Setia Irham Pramana Sholawatunnisa, Dinda Pusparahmi Silmi Kaffah, M. Ihsan Sinulingga, Geraldo Benedictus Siregar, Amir Mumtaz Sugiri Sukma Andini Wilantika, Nori Wildannissa Pinasti Yanti, Rizka Maulida

Title

Found 24 Documents
Search

Abstract

Abstract

Abstract

Abstract

Title Search

Found 24 Documents Search

Abstract

Abstract

Abstract

Abstract

Title

Found 24 Documents
Search