JIEET (Journal of Information Engineering and Educational Technology)
Vol. 9 No. 1 (2025)

Automated Chest X-Ray Captioning Using Pretrained Vision Transformer with LSTM and Multi-Head Attention

Aulia Akbar, Rafy (Unknown)
Putra, Ricky Eka (Unknown)
Yustanti, Wiyli (Unknown)



Article Info

Publish Date
01 Jul 2025

Abstract

Radiology report generation is a complex and error-prone task, especially for radiologists with limited experience. To overcome this, this study aims to develop an automated system for generating text-based radiology reports using chest X-ray images. The proposed approach combines computer vision and natural language processing through an encoder-decoder architecture. As an encoder, a Vision Transformer (ViT) model trained on the CheXpert dataset is used to extract visual features from X-ray images after Gamma Correction is performed to improve image quality. In the decoder section, word embeddings from the report text are processed using Long Short-Term Memory (LSTM) to capture word order relationships, and enriched with Multi-Head Attention (MHA) to pay attention to important parts of the text. Visual and text features are then combined and passed to a dense layer to generate text-based radiology reports. The evaluation results show that the proposed model achieves a ROUGE-L score of 0.385, outperforming previous models. The BLEU-1 score also shows competitive results with a value of 0.427. This study shows that the use of pre-trained ViT, combined with LSTM-MHA on the decoder, provides excellent performance in capturing visual and semantic context of text, as well as improving accuracy and efficiency in radiology report automation.

Copyrights © 2025






Journal Info

Abbrev

jieet

Publisher

Subject

Computer Science & IT Engineering

Description

Journal Description: JIEET (Journal of Information Engineering and Educational Technology) is a scientific journal that publishes the peer-reviewed research papers in the field of Computer Engineering, Distributed and Parallel Systems, Business Informatics, Computer Science, Computer Security, ...