International Journal of Information Technology and Computer Science Applications (IJITCSA)
Vol. 2 No. 3 (2024): September - December 2024

Hadoop Ecosystem Enhances Data Analytics for Music Streaming: A Case Study of User Behavior in the Last FM Dataset

Elizade, Akkord (Unknown)



Article Info

Publish Date
26 Nov 2024

Abstract

This paper proposed a big data pipeline to analyze user behavior on Last.fm which aims to make data-driven recommendations for improving user engagement and attracting new users. The comprehensive analysis of user behavior in the music streaming industry using the Hadoop ecosystem and data analytics techniques. Specifically, the study focuses on Last.fm, a popular music streaming platform that collects large amounts of user activity data. The article proposes a new data pipeline utilizing Hadoop Distributed File System (HDFS) for data storage and Apache Pig for data transformation, leading to improved data preprocessing and analysis. Various analyses are conducted, including identifying the most listened to artists, top users based on song consumption and social connections, artist popularity by tags, and the most recently tagged artists. The findings provide valuable insights into user preferences, current trends, and opportunities for enhancing the recommendation algorithm and user engagement. The article concludes by offering recommendations for personalized marketing strategies and curated playlists to increase user satisfaction and revenue.

Copyrights © 2024






Journal Info

Abbrev

jitcsa

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Education Engineering

Description

he Journal of Information Technology and Computer Science Applications (JITCSA) is an information technology and computer science publication. Applications from both fields for solving real cases are also welcome. JITCSA accepts research articles, systematic reviews, literature studies, and other ...