Indonesian Journal on Computing (Indo-JC)
Vol. 10 No. 1 (2025): August, 2025

Hyperparameter Optimization Analysis of MultinomialNB and Logistic Regression in Multi‑Feature Text‑Based Film Genre Classification

Shabrio Cahyo Wardoyo (Universitas Mercu Buana)
Umniy Salamah (Unknown)



Article Info

Publish Date
01 Aug 2025

Abstract

This study aims to analyze and compare the performance of two text classification algorithms Multinomial Naive Bayes (MNB) and Logistic Regression (LR)—for film genre classification using multi-feature text data, both with and without hyperparameter optimization. Film genres play a crucial role in digital content recommendation systems; however, manual classification is subjective and time-consuming. The dataset, obtained from Letterboxd via Kaggle, includes film titles, descriptions, and themes. After preprocessing and text normalization (tokenization, lemmatization, and stemming), the text data were transformed into numerical features using the TF-IDF method. Two modeling scenarios were applied: the first using default parameters, and the second employing GridSearchCV to find the optimal hyperparameter settings. Model performance was evaluated using accuracy, precision, recall, and F1-score. The results indicate that the optimized LR model achieved the highest accuracy of 0.847, followed by the optimized MNB model with an accuracy of 0.837. This study concludes that hyperparameter optimization significantly improves model performance and that LR outperforms MNB in the context of multi-feature text-based genre classification.

Copyrights © 2025






Journal Info

Abbrev

indojc

Publisher

Subject

Computer Science & IT Control & Systems Engineering Education Engineering

Description

Indonesian Journal on Computing (Indo-JC) is an open access scientific journal intended to bring together researchers and practitioners dealing with the general field of computing. Indo-JC is published by School of Computing, Telkom University (Indonesia). The journal coverage includes, but is not ...