Garuda - Garba Rujukan Digital

JIEET (Journal of Information Engineering and Educational Technology)

Vol. 9 No. 1 (2025)

Aulia Akbar, Rafy (Unknown)
Putra, Ricky Eka (Unknown)
Yustanti, Wiyli (Unknown)

Publish Date
01 Jul 2025

Radiology report generation is a complex and error-prone task, especially for radiologists with limited experience. To overcome this, this study aims to develop an automated system for generating text-based radiology reports using chest X-ray images. The proposed approach combines computer vision and natural language processing through an encoder-decoder architecture. As an encoder, a Vision Transformer (ViT) model trained on the CheXpert dataset is used to extract visual features from X-ray images after Gamma Correction is performed to improve image quality. In the decoder section, word embeddings from the report text are processed using Long Short-Term Memory (LSTM) to capture word order relationships, and enriched with Multi-Head Attention (MHA) to pay attention to important parts of the text. Visual and text features are then combined and passed to a dense layer to generate text-based radiology reports. The evaluation results show that the proposed model achieves a ROUGE-L score of 0.385, outperforming previous models. The BLEU-1 score also shows competitive results with a value of 0.427. This study shows that the use of pre-trained ViT, combined with LSTM-MHA on the decoder, provides excellent performance in capturing visual and semantic context of text, as well as improving accuracy and efficiency in radiology report automation.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

JIEET (Journal of Information Engineering and Educational Technology)

Website

Abbrev

jieet

Publisher

Universitas Negeri Surabaya

Subject

Computer Science & IT Engineering

Description

Journal Description: JIEET (Journal of Information Engineering and Educational Technology) is a scientific journal that publishes the peer-reviewed research papers in the field of Computer Engineering, Distributed and Parallel Systems, Business Informatics, Computer Science, Computer Security, ...

Article Info

Abstract

Automated Chest X-Ray Captioning Using Pretrained Vision Transformer with LSTM and Multi-Head Attention

Article Info

Abstract