This research focuses on the development of a hybrid Convolutional Neural Network (CNN) model for sentiment analysis of movie comments, specifically designed to overcome the challenges of handling nonstandard language and slang. Slang is often an obstacle in sentiment analysis due to its non-standard nature and is difficult to recognize by traditional algorithms. By utilizing an kamusalay as a data preprocessing step, this research successfully converts slang words into standardized forms, thus improving the quality of data used in modeling. The data was collected through YouTube Data API on the comments of the movie “Pengabdi Setan 2: Communion” and processed using tokenization, stemming, stopwords removal, and TF-IDF feature extraction techniques. The hybrid model combines machine learning algorithms such as Naive Bayes, Logistic Regression, and Random Forest with CNN's ability to extract complex spatial patterns from text data. The evaluation results show that this model is able to achieve up to 95% accuracy, with consistently high precision, recall, and F1-score. This approach not only improves the accuracy of sentiment analysis, but also provides an effective solution for handling non-standard language variations, making it relevant for application in digital opinion analysis on social media.
Copyrights © 2025