Jurnal Inovasi Teknologi dan Edukasi Teknik
Vol. 5 No. 4 (2025)

COMPARATIVE ANALYSIS OF SOFTMAX AND UPPER CONFIDENCE BOUND IN GAME-PLAYING AGENTS FOR FLAPPY BIRD

Arifin, M. Zainal (Unknown)
Yusuf, M. Baharuddin (Unknown)



Article Info

Publish Date
01 May 2025

Abstract

Training agents is an intriguing research topic due to human limitations in maintaining consistent performance, particularly in the game Flappy Bird. This study compares action selection methods, namely Softmax and Upper Confidence Bound (UCB), to enhance agent performance in action selection. Testing was conducted using both methods in a Flappy Bird environment based on Gymnasium. Evaluation was performed using metrics such as average score, highest score, average steps, and Q-value. The final results indicate that Softmax tends to explore early in training but achieves convergence toward the end, whereas UCB tends to exploit early, leading to stagnant scores. Based on t-test results, no significant difference was found in the performance of the two action selection methods. This study provides guidance on selecting action selection methods for agents in simple games.

Copyrights © 2025






Journal Info

Abbrev

ft

Publisher

Subject

Automotive Engineering Chemical Engineering, Chemistry & Bioengineering Civil Engineering, Building, Construction & Architecture Electrical & Electronics Engineering Industrial & Manufacturing Engineering

Description

Jurnal Inovasi Teknologi dan Edukasi Teknik menerbitkan naskah terkait Teknik Sipil, Teknologi Industri, Teknik Mesin, Teknik Elektro, dan Pendidikan Kejuruan. Fokus dan lingkup jurnal meliputi Teknik Sipil, Teknologi Industri, Teknik Mesin, Teknik Elektro, dan Pendidikan ...