This study aims to analyze the factors that influence students' interest in continuing their education to college using a machine learning approach. Data was collected through an online questionnaire completed by 727 students between July 27 and August 22, 2025, covering 23 variables consisting of respondent identity (gender, grade level, major) as well as internal and external factors such as parental support, learning motivation, and preferred type of college. The data preparation stage was carried out through column cleaning, deletion of empty data, encoding of categorical variables, and division of the dataset into 80% training data and 20% test data. The Naive Bayes algorithm of the CategoricalNB type was used because it was suitable for the categorical nature of the data. The evaluation results showed that the model was able to predict student interest with 96% accuracy. For the class of students interested in continuing their studies, the precision, recall, and F1-score values were above 0.95, while the performance in the class of students who were not interested was slightly lower due to the smaller amount of data. These findings show that Naive Bayes is proven to be effective and reliable in classifying students' interest in continuing their studies and can be the basis for decision-making in designing more targeted educational strategies.
Copyrights © 2026