Garuda - Garba Rujukan Digital

eScience Humanity Journal

Vol 5 No 1 (2024): eScience Humanity Journal Volume 5 Number 1 November 2024

Soleman, Soleman (Unknown)

Publish Date
30 Nov 2024

This study examines the latest developments and future directions of deep learning techniques in image and sound recognition. The study focuses on the analysis of various neural network architectures such as Convolutional Neural Networks (CNNs) for image processing and Recurrent Neural Networks (RNNs) for speech recognition. The methodology used includes a comprehensive literature study of the latest studies, evaluation of the performance of various models, and comparative analysis of existing techniques. The results showed a significant improvement in recognition accuracy, with CNNs achieving up to 98% accuracy for image classification and transformer-based models outperforming traditional RNNs in speech recognition. The challenges identified include high computational requirements, reliance on quality datasets, and model interpretability issues. The study also proposes several future development directions, including the integration of attention mechanisms, hybrid architectures, and more efficient learning techniques. In conclusion, despite the rapid progress, there is still significant room for innovation in improving the efficiency and reliability of deep learning-based image and voice recognition systems

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

eScience Humanity Journal

Website

Abbrev

home

Publisher

Asosiasi Dosen Idebahasa Kepri

Subject

Arts Humanities Education Languange, Linguistic, Communication & Media Social Sciences

Description

eScience Humanity Journal is an electronic and printed journal published by Asosiasi Ide Bahasa KEPRI. This journal was founded on November 7, 2020 through the Decree of the Chairman of the Association Number : 008/SK-RJeSCi/IdeBahasa/XI/2020. Thescope of this journal is to published the article ...

Article Info

Abstract

Deep Learning Techniques for Image and Speech Recognition: Current Trends and Future Directions

Article Info

Abstract