bit-Tech
Vol. 8 No. 2 (2025): bit-Tech

Indonesian Sign Language (SIBI) Recognition from Audio Mel-Spectrograms Using LSTM Architecture

Enryco Hidayat (Universitas Pembangunan Nasional "Veteran" Jawa Timur)
Mohammad Idhom (Universitas Pembangunan Nasional "Veteran" Jawa Timur)
Afina Lina Nurlaili (Universitas Pembangunan Nasional "Veteran" Jawa Timur)



Article Info

Publish Date
10 Dec 2025

Abstract

Persistent communication barriers continue to challenge Deaf and Hard of Hearing (DHH) individuals in accessing spoken language, underscoring the need for effective and inclusive translation technologies. Existing audio-to-sign language systems typically employ multi-stage pipelines involving speech-to-text transcription, which may propagate recognition errors and fail to preserve acoustic nuances. Addressing these limitations, this study developed and evaluated a deep learning framework for translating spoken Indonesian audio directly into classifications of the Indonesian Sign Language System (SIBI), eliminating explicit text conversion. The dataset comprised 495 eight-second WAV recordings (22,050 Hz) representing five SIBI phrase classes, augmented through time stretching, pitch shifting, and noise addition to improve generalization. Mel-Spectrogram features were extracted and input to a stacked Long Short-Term Memory (LSTM) network implemented in TensorFlow/Keras, trained to learn temporal–spectral mappings between audio patterns and SIBI categories. Evaluation on a held-out test set demonstrated robust performance, achieving 98 % accuracy with consistently high precision, recall, and F1-scores. The trained model was further integrated into a prototype web application built with Flask and React, confirming its feasibility for real-time assistive communication. While results highlight the viability of direct Mel-Spectrogram-to-LSTM translation for SIBI recognition, current findings are constrained by the limited dataset size and restricted speaker diversity. Future research should therefore expand the dataset to include more speakers, varied acoustic environments, and continuous-speech inputs to ensure broader applicability and real-world robustness.

Copyrights © 2025






Journal Info

Abbrev

bt

Publisher

Subject

Computer Science & IT

Description

The bit-Tech journal was developed with the aim of accommodating the scientific work of Lecturers and Students, both the results of scientific papers and research in the form of literature study results. It is hoped that this journal will increase the knowledge and exchange of scientific ...