Claim Missing Document
Check
Articles

Found 2 Documents
Search

Pitch extraction using discrete cosine transform based power spectrum method in noisy speech Sunzida, Humaira; Parvin, Nargis; Jeba, Jafrin Akhter; Chi, Sulin; Ali, Md. Shiplu; Rahman, Moinur; Rahman, Md. Saifur
International Journal of Advances in Applied Sciences Vol 14, No 3: September 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijaas.v14.i3.pp955-965

Abstract

The pitch period is a key component of many speech analysis research projects. In real-world applications, voice data is frequently gathered in noisy surround- ings, therefore algorithms must be able to manage background noise well in order to estimate pitch accurately. Despite advancements, many state-of–the-art algorithms struggle to deliver adequate results when faced with low signal-to- noise ratios (SNRs) in processing noisy speech signals. This research proposes an effective concept specifically designed for speech processing applications, particularly in noisy conditions. To achieve this goal, we introduce a fundamen- tal frequency extraction algorithm designed to tolerate non-stationary changes in the amplitude and frequency of the input signal. In order to improve the extrac- tion accuracy, we also use a cumulative power spectrum (CPS) based on discrete cosine transform (DCT) rather than conventional power spectrum. We enhance extraction accuracy of our method by utilizing shorter sub-frames of the input signal to mitigate the noise characteristics present in speech signals. According to the experimental results, our proposed technique demonstrates superior per- formance in noisy conditions compared to other existing state-of-the-art meth- ods without utilizing any kind of post-processing techniques.
Fundamental frequency extraction by utilizing modified BaNa in noisy speech Saha, Arpita; Parvin, Nargis; Rahman, Md. Saifur; Rahman, Moinur; Chowdhury, Any
International Journal of Advances in Applied Sciences Vol 13, No 3: September 2024
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijaas.v13.i3.pp515-529

Abstract

A sound’s pitch can be largely understood and perceived by using its fundamental frequency. Multiple algorithms have been developed for extracting fundamental frequency, and the choice of which one to employ depends on the noise and features of the signal. Therefore, for an accurate fundamental frequency estimate, the noise resistance of the algorithm becomes even more crucial. Still, many of the most advanced algorithms fail to produce acceptable results when faced with loud speech recordings that have low signal-to-noise ratios (SNRs). In this research paper, we focus on the harmonic selection step in BaNa method, which is one of the vital parts for enhancing the extraction accuracy of fundamental frequency (F0) in noisy situations. BaNa algorithm always emphasizes 5 harmonics on average for both male and female speakers. However, our observation reveals that relying on 5 harmonics is inadequate for male speakers in noisy conditions. Thus, we propose a new idea based on BaNa that separately utilizes the 3 harmonics for male speakers and 5 harmonics for female speakers to achieve accurate pitch extraction within noisy environments. The results demonstrate that our proposed approach attains the lowest rate of gross pitch error (GPE) across various noise types and SNR levels.