The rapid growth of AI applications such as CICI, GROK, and Gemini has resulted in a large volume of user reviews on platforms like the Google Play Store, making sentiment analysis a critical tool for understanding user perceptions. This study compares the performance of three machine learning models: Random Forest, Support Vector Machine (SVM), and Logistic Regression in classifying sentiments in 3,500 Indonesian-language reviews. A hybrid feature extraction approach, combining sentiment lexicons with TF-IDF, was applied to improve sentiment classification accuracy. The models were evaluated based on accuracy, precision, recall, and F1-score. Results indicated that all models achieved an accuracy greater than 96%, with Random Forest providing the most consistent and accurate results, achieving an overall accuracy of 99.62%. While SVM excelled in classifying positive and negative sentiments, it faced challenges with neutral reviews due to the ambiguity and overlap in sentiment expression. Logistic Regression also showed strong performance, especially on structured reviews. The findings suggest that Random Forest is the most robust and reliable model for sentiment analysis, particularly in handling diverse AI application reviews. These results offer practical insights for developers seeking to improve application performance by leveraging sentiment analysis on user feedback.