Garuda - Garba Rujukan Digital

Sinkron : Jurnal dan Penelitian Teknik Informatika

Vol. 9 No. 1 (2025): Research Article, January 2025

Althoff, Mohammad Noval (Unknown)
Affandy, Affandy (Unknown)
Luthfiarta, Ardytha (Unknown)
Satya, Mohammad Wahyu Bagus Dwi (Unknown)
Basiron, Halizah (Unknown)

Publish Date
08 Jan 2025

This research explores the potential of improving low-resource Automatic Speech Recognition (ASR) performance by leveraging label preprocessing techniques in conjunction with the wav2vec2-large Self-Supervised Learning (SSL) model. ASR technology plays a critical role in enhancing educational accessibility for children with disabilities in Indonesia, yet its development faces challenges due to limited labeled datasets. SSL models like wav2vec 2.0 have shown promise by learning rich speech representations from raw audio with minimal labeled data. Still, their dependence on large datasets and significant computational resources limits their application in low-resource settings. This study introduces a label preprocessing technique to address these limitations, comparing three scenarios: training without preprocessing, with the proposed preprocessing method, and with an alternative method. Using only 16 hours of labeled data, the proposed preprocessing approach achieves a Word Error Rate (WER) of 15.83%, significantly outperforming the baseline scenario (33.45% WER) and the alternative preprocessing method (19.62% WER). Further training using the proposed preprocessing technique with increased epochs reduces the WER to 14.00%. These results highlight the effectiveness of label preprocessing in reducing data dependency while enhancing model performance. The findings demonstrate the feasibility of developing robust ASR models for low-resource languages, offering a scalable solution for advancing ASR technology and improving educational accessibility, particularly for underrepresented languages.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Sinkron : Jurnal dan Penelitian Teknik Informatika

Website

Abbrev

sinkron

Publisher

Politeknik Ganesha Medan

Subject

Computer Science & IT

Description

Scope of SinkrOns Scientific Discussion 1. Machine Learning 2. Cryptography 3. Steganography 4. Digital Image Processing 5. Networking 6. Security 7. Algorithm and Programming 8. Computer Vision 9. Troubleshooting 10. Internet and E-Commerce 11. Artificial Intelligence 12. Data Mining 13. Artificial ...

Article Info

Abstract

Leveraging Label Preprocessing for Effective End-to-End Indonesian Automatic Speech Recognition

Article Info

Abstract