Media Informatika
Vol 23 No 1 (2024)

Implementasi CNN-LSTM untuk Music Captioning

Diarsyah, M. Ghazali (Unknown)
Setiawan, Dhanny (Unknown)



Article Info

Publish Date
24 Jun 2024

Abstract

Music has become an integral part of human life, extending its influence across various industries. For many, music is considered a necessity. With the rise of neural network technology, Music Information Retrieval (MIR) has gained prominence as a multidisciplinary field focused on processing music information and its applications. One popular approach for music captioning is the multimodal encoder-decoder architecture, which utilizes the CNN-LSTM algorithm. In this study, we develop a model that simultaneously learns from audio and text data. We explore different design choices for modality fusion, including early fusion, late fusion, and hybrid fusion, to assess their impact

Copyrights © 2024






Journal Info

Abbrev

media-informatika

Publisher

Subject

Computer Science & IT

Description

Media Informatika is a scientific journal published by Pusat Penelitian dan Pengabdian pada Masyarakat (P3M) STMIK LIKMI. Media Informatika aims to publish research results or equivalent to the results of research, thoughts and views, popular knowledge in the fields of informatics, information ...