Movie rating is often used as an indicator of film quality and audience satisfaction. With the large availability of movie data on online platforms, machine learning techniques can be used to analyze the relationship between film characteristics and rating patterns. One important attribute that can influence movie ratings is genre. This study aims to classify movie ratings based on genre using the XGBoost and LightGBM algorithms and to analyze the contribution of each genre using SHAP (SHapley Additive Explanations). Movie data were collected from The Movie Database (TMDB) API and processed through several preprocessing stages including genre separation, data cleaning, one-hot encoding, and rating categorization. The dataset was then divided into training and testing data with a ratio of 70:30. The classification results show that XGBoost achieved an accuracy of 0.53, slightly higher than LightGBM with an accuracy of 0.52. Further analysis using SHAP indicates that genres such as Horror, Drama, Action, and Comedy have the highest global importance in the classification model. Meanwhile, the analysis of high-rating class predictions shows that Drama has the largest contribution to predicting movies with high ratings. The findings indicate that movie genres have a measurable influence on rating classification, although the importance of genres in the machine learning model does not always align with their average rating values.
Copyrights © 2026