Garuda - Garba Rujukan Digital

Jurnal Inovasi Teknologi dan Edukasi Teknik

Vol. 5 No. 4 (2025)

Arifin, M. Zainal (Unknown)
Yusuf, M. Baharuddin (Unknown)

Publish Date
01 May 2025

Training agents is an intriguing research topic due to human limitations in maintaining consistent performance, particularly in the game Flappy Bird. This study compares action selection methods, namely Softmax and Upper Confidence Bound (UCB), to enhance agent performance in action selection. Testing was conducted using both methods in a Flappy Bird environment based on Gymnasium. Evaluation was performed using metrics such as average score, highest score, average steps, and Q-value. The final results indicate that Softmax tends to explore early in training but achieves convergence toward the end, whereas UCB tends to exploit early, leading to stagnant scores. Based on t-test results, no significant difference was found in the performance of the two action selection methods. This study provides guidance on selecting action selection methods for agents in simple games.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Jurnal Inovasi Teknologi dan Edukasi Teknik

Website

Abbrev

Publisher

Universitas Negeri Malang

Subject

Automotive Engineering Chemical Engineering, Chemistry & Bioengineering Civil Engineering, Building, Construction & Architecture Electrical & Electronics Engineering Industrial & Manufacturing Engineering

Description

Jurnal Inovasi Teknologi dan Edukasi Teknik menerbitkan naskah terkait Teknik Sipil, Teknologi Industri, Teknik Mesin, Teknik Elektro, dan Pendidikan Kejuruan. Fokus dan lingkup jurnal meliputi Teknik Sipil, Teknologi Industri, Teknik Mesin, Teknik Elektro, dan Pendidikan ...

Article Info

Abstract

COMPARATIVE ANALYSIS OF SOFTMAX AND UPPER CONFIDENCE BOUND IN GAME-PLAYING AGENTS FOR FLAPPY BIRD

Article Info

Abstract