JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika)
Vol 10, No 4 (2025)

DEEP LEARNING-BASED ENVIRONMENTAL SOUND CLASSIFICATION USING TUNED MOBILEVIT WITH COMBINED AUGMENTATION TECHNIQUES

Slameta, Slameta (Unknown)
Rahmatullah, Griffani Megiyanto (Unknown)
Karostiani, Novia (Unknown)
Budiana, Mochamad Soebagja (Unknown)
Hartono, R.W. Tri (Unknown)



Article Info

Publish Date
01 Dec 2025

Abstract

Classifying environmental sounds poses significant challenges because of their naturally disorganized characteristics. This research introduces a deep learning method for categorizing urban audio using the MobileViT architecture, which serves as a versatile, lightweight solution for various deep learning applications. The study utilizes the UrbanSound8k dataset, enhanced through multiple augmentation strategies including noise injection, time stretching, pitch modulation, and mixup methods. These augmentation techniques are essential given the dataset's size constraints and help create a more robust model for practical applications. Following augmentation, the audio undergoes preprocessing to standardize length and is transformed into mel spectrograms, making it compatible with MobileViT's input requirements. The model undergoes training with both standard and optimized parameters, achieving peak performance exceeding 80% accuracy. The integration of augmented data and parameter optimization yields approximately 15% improvement over the baseline MobileViT configuration while preserving rapid inference speeds of roughly 7 milliseconds. The findings prove that MobileViT represents a promising solution for various environmental sound applications.

Copyrights © 2025






Journal Info

Abbrev

Publisher

Subject

Computer Science & IT Education

Description

JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika) e-ISSN: 2540 - 8984 was made to accommodate the results of scientific work in the form of research or papers are made in the form of journals, particularly the field of Information Technology. JIPI is a journal that is managed by the ...