This Author published in this journals
All Journal Sinergi
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Performance of speech enhancement models in video conferences: DeepFilterNet3 and RNNoise Maulana, Muhammad Iqbal; Raisul Akbar, Muhammad Fadhlillah; Iklima, Zendi
SINERGI Vol 29, No 2 (2025)
Publisher : Universitas Mercu Buana

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22441/sinergi.2025.2.001

Abstract

As remote work and online education continue to gain prominence, the importance of clear audio communication becomes crucial. Deep Learning-based Speech Enhancement has emerged as a promising solution for processing data in noisy environments. In this study, we conducted an in-depth analysis of two speech enhancement models, RNNoise and DeepFilterNet3, selected for their respective strengths. DeepFilterNet3 leverages time-frequency masking with a Complex Mask filter, while RNNoise employs Recurrent Neural Networks with lower complexity. The performance evaluation in training revealed that RNNoise demonstrated impressive denoising capabilities, achieving low loss values, while DeepFilterNet3 showed superior generalization. Specifically, "DeepFilterNet3 (Pre-Trained)" exhibited the best overall performance, excelling in intelligibility and speech quality. RNNoise also performed well in subjective quality measures. Furthermore, we assessed the real-time processing efficiency of both models. Both RNNoise variants processed speech signals almost in real-time, whereas DeepFilterNet3, though slightly slower, remained efficient. The findings demonstrate significant improvements in speech quality, with "DeepFilterNet3 (Pre-Trained)" emerging as the top-performing model. The implications of this study have the potential to enhance video conference experiences and contribute to the improvement of remote work and online education.