International Journal for Applied Information Management
Vol. 5 No. 2 (2025): Regular Issue: July 2025

Clustering Netflix Shows Based on Features Using K-means and Hierarchical Algorithms to Identify Content Patterns

Hayadi, B Herawan (Unknown)
Priyanto, Eko (Unknown)



Article Info

Publish Date
01 Jul 2025

Abstract

This study explores clustering patterns within Netflix's movie catalog by applying K-means and hierarchical clustering algorithms. The primary objective is to identify distinct content groups based on features such as movie duration, release year, and content ratings. The dataset, which includes 5,185 Movies, was preprocessed by handling missing values, one-hot encoding categorical variables, and standardizing numerical features. Four distinct clusters were identified, with each cluster exhibiting unique characteristics. Cluster 0 primarily consists of longer, family-friendly Movies rated TV-14, while Cluster 1 contains shorter, mature Movies with a TV-MA rating. Cluster 2 represents a diverse range of TV-MA Movies with moderate durations, and Cluster 3 focuses on adult-oriented, longer Movies with an 'R' rating. These findings offer valuable insights into Netflix's content strategy, highlighting the platform's ability to cater to different audience segments based on content type and viewer preferences. The results suggest that Netflix can leverage clustering patterns to improve its recommendation system and content acquisition strategy. However, the study is limited by the absence of user-specific data and the reliance on basic metadata features. Future research could explore the integration of additional features like user ratings and apply deep learning techniques for more sophisticated clustering.

Copyrights © 2025






Journal Info

Abbrev

ijaim

Publisher

Subject

Humanities Decision Sciences, Operations Research & Management Economics, Econometrics & Finance Environmental Science Social Sciences

Description

Journal menerbitkan penelitian tentang semua aspek manajemen informasi. Informasi dilihat di sini secara luas untuk mencakup tidak hanya produk/layanan dan proses tetapi juga pasar, dan organisasi serta informasi sosial. Ini termasuk studi tentang proses secara keseluruhan atau tahap individu, ...