Air pollution from motorized fuel vehicles causes adverse impacts on the environment and human health, driving the need for more sustainable alternatives such as electric vehicles. However, the transition to electric vehicles is often met with mixed responses from the public, reflected by sentiments that are split between positive and negative. This research investigates such sentiments through analyzing comments on the YouTube platform, which are classified using two algorithms, SVM and Naïve Bayes, and three oversampling techniques: Random Oversampling, SMOTE, and ADASYN. A comparative evaluation is conducted to determine the most effective algorithm and oversampling strategy for handling imbalanced sentiment data, where negative comments dominate. Initial experiments showed that Naïve Bayes with SMOTE achieved the best result among baseline models, with 64% accuracy. However, traditional oversampling methods alone were not sufficient to significantly improve classification quality. To address this, the study proposes a hybrid method that combines Easy Data Augmentation (EDA), specifically Synonym Replacement (SR), with oversampling techniques. The proposed method substantially improved performance. Naïve Bayes combined with SR and SMOTE or Random Oversampling achieved 88% accuracy, with F1-scores of 0.84–0.85 for the positive class. The best result was obtained using SVM with SR and Random Oversampling, reaching 97% accuracy and F1-scores of 0.97 (negative) and 0.96 (positive). These findings demonstrate the effectiveness of combining augmentation and oversampling in improving sentiment classification and provide insights for stakeholders in promoting EV adoption.
                        
                        
                        
                        
                            
                                Copyrights © 2025