The rapid growth of the women's fashion industry in the digital era has intensified the need for data-driven approaches to understand customer preferences. This study aims to classify women’s clothing products based on customer reviews by applying CatBoost, a gradient boosting algorithm known for its strong performance with categorical features. The dataset, consisting of 23,486 entries and 11 attributes, was obtained from Kaggle and processed through data cleaning, normalization, exploratory analysis, and model training. Hyperparameter optimization was conducted using Grid Search. Model performance was evaluated using accuracy, precision, recall, and F1-score, and benchmarked against four traditional classifiers: Decision Tree (C4.5), Naïve Bayes, Support Vector Machine (SVM), and K-Nearest Neighbor (KNN). The results show that CatBoost achieved an accuracy of 93.70%, an F1-score of 0.9606, and an AUC of 0.9691, indicating excellent and balanced classification performance. This study demonstrates the effectiveness of CatBoost in handling customer review data and contributes to the development of intelligent product classification systems in the fashion industry
Copyrights © 2025