Claim Missing Document
Check
Articles

Found 2 Documents
Search

Improving Infant Cry Recognition with CNNs and Imbalance Mitigation Indrawan, Michael; Luthfiarta, Ardytha; Fahreza, Muhammad Daffa Al; Rafid, Muhammad
JURNAL MEDIA INFORMATIKA BUDIDARMA Vol 8, No 2 (2024): April 2024
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/mib.v8i2.7370

Abstract

The classification of baby cries using machine learning is essential for developing automated systems that can assist caregivers in identifying and responding to the needs of infants promptly and accurately. This study aims to improve upon previous research relating to the Cry Baby Dataset, which has highly imbalanced data. We combine oversampling and undersampling techniques using SMOTE and ENN, along with data augmentation through pitch shifting and noise addition to address the data imbalance issue. The processed data was then modeled using Convolutional Neural Networks (CNN). The study yielded an overall accuracy of 88%, with balanced accuracy observed across all classes, effectively mitigating data imbalance. This represents a notable advancement compared to previous research, which often encountered challenges with unbalanced accuracies across classes. The classes identified include recordings of baby cries attributed to belly pain caused by colic, recordings related to burping, recordings associated with discomfort or other symptoms, recordings of hungry baby cries, and recordings indicating fatigue or the need for sleep. This shows a significant improvement from previous studies, which had very unbalanced accuracy for each class.
The Effect of LAB Color Space with NASNetMobile Fine-tuning on Model Performance for Crowd Detection Rafid, Muhammad; Luthfiarta, Ardytha; Naufal, Muhammad; Al Fahreza, Muhammad Daffa; Indrawan, Michael
Advance Sustainable Science, Engineering and Technology Vol 6, No 1 (2024): November-January
Publisher : Universitas PGRI Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26877/asset.v6i1.17821

Abstract

In the COVID-19 pandemic, computer vision plays a crucial role in crowd detection, supporting crowd restriction policies to mitigate virus spread. This research focuses on analyzing the impact of using the RGB LAB color space on the performance of NASNetMobile for crowd detection. The fine-tuning process, involving freezing layers in various NASNetMobile base model variations, is considered. Results reveal that the model with LAB color space outperforms model with RGB color space, with an average accuracy of 94.68% compared to 94.15%. From all the test iterations, it was found that the highest performance for the NASNetMobile model occurred when freezing 10% of the layers from the back for both model LAB and RGB color spaces, with the LAB color space achieving an accuracy of 95.4% and the RGB color space achieving an accuracy of 95.1%.