Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Jurnal Algoritma

Optimasi Algoritma Pemilihan Soal pada POMDP Berbasis Advantage Actor-Critic untuk Model Ujian Adaptif Anggriani, Epri; Setyo Utomo, Fandy
Jurnal Algoritma Vol 22 No 2 (2025): Jurnal Algoritma
Publisher : Institut Teknologi Garut

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33364/algoritma/v.22-2.3004

Abstract

Evaluation in learning through assessment plays an important role as a measure of success and assesses student competency achievement. In this context, CAT as an IRT-based adaptive assessment solution has been widely used, but has technical limitations such as heuristic question selection, dependence on question banks, and being undimensional. In addition, to solve decision-making problems in the context of adaptive testing, a general approach that can be used is policy-based reinforcement learning, such as policy gradient, particularly the REINFORCE algorithm. However, this algorithm has limitations such as high gradient variance and lacks a state-value function evaluation mechanism, making it unable to provide direct feedback on the quality of the actions taken. The purpose of this study is to optimize adaptive decision making in the POMDP framework using the Advantage Actor-Critic (A2C) algorithm, one of the Reinforcement Learning approaches. The actor generates a question selection policy based on the belief state of the NCDM model, while the critic evaluates the quality of actions to maximize cumulative rewards. The results show that in an adaptive environment, A2C performs better than the baseline, with an accuracy of 0.952 and an average reward of 18.56 in 20-question episodes, and an accuracy of 0.934 and a reward of 22.58 in 25-question episodes. In contrast, the baseline only achieved an average accuracy of around 0.789 and 0.760 in the 20 and 25 question episodes, and a reward of 14.19 and 16.80 in the 20 and 25 question episodes. The results of the study show that optimization with A2C can improve the personalization of exam question selection. This study contributes to the development of a more effective adaptive exam model, while also opening up opportunities for further research.
Co-Authors Adiatma, Febriansyah Husni Adiya, Az Zahra Dwi Nur Afit Ajis Solihin Aisha Hukama Setyowati Aji Saeful Aji Septa, Adrian Ajis Solihin, Afit Amar Al Farizi Anas Nur Khafid Anggini, Melisa Anggraeni, Mutia Dwi Anggraini, Nova Anggriani, Epri Azhari Shouni Barkah Azmi, Mohd Sanusi Bagus Adhi Kusuma Baihaqi, Wiga Maulana Balit, Muhamad Naufal Burhanuddin Berlilana Berlilana Berlilana Burhanuddin Balit, Muhamad Naufal Churil Aeni, Agustina Chyntia Raras Ajeng Widiawati Darmono Dedi Purwanto, Dedi Didi Prasetyo Dwi Krisbiantoro, Dwi Dzaky Candy Fahrezy Fadhilah, Siti Nur Filanzi, Shendy Giat Karyono Giat Karyono Hanif Hidayatulloh Hendra Marcos, Hendra hidayatulloh, hanif Ilham, Rifqi Arifin Imam Tahyudin Indriyani, Ria Jamie Mayliana Alyza Kafilla, Princess Iqlima Kusuma, Bagus Adhi Kusuma, Velizha Sandy Lasmedi Afuan Lubna, Zuhriyatul Lukita, Dita Maharani, Titi Safitri Maulana Baihaqi, Wiga Mohd Fairuz Iskandar Othman Mohd Nazrin Muhammad Mohd Sanusi Azmi Muaziz, Imam Muhamad Naufal Burhanuddin Balit Muhtyas Yugi Murtiyoso Murtiyoso Nandang Hermanto Nanna Suryana Nikmah Trinarsih Nugroho, Khabib Adi Nur Cholis Romadhon Octavia, Annisa Suci Prayoga, Fandhi Dhuga Pungkas Subarkah Purbo, Yevi Septiray Purwidiantoro, Moch. Hari Putranto, R. Vitto Mahendra Pyawai, Hero Galuh Ramadhan, Aziz Ramadhan, Rio Fadly Rifqi Arifin Ilham RR. Ella Evrita Hestiandari Rujianto Eko Saputro Safitri Maharani, Titi Sagita, Selvi Samsul Arifin Sarmini - Sarmini Sarmini Sarmini Sekhudin Sekhudin Setiabudi, Rizki Setiawan, Ito Shafira, Lulu Slamet Widodo Sofa, Nur Sri Hartini Subarkah, Pungkas Suryana, Nanna Trinarsih, Nikmah Turino, Turino Utomo, Dadang Wahyu Wahid, Arif Mu'amar Wibisono, Arif Cahyo Wiga Maulana Baihaqi Yuli Purwat Yuli Purwati Yulianto, Koko Edy