Classifying mushroom species presents a significant challenge within biological data analysis because of the wide variety of species and their distinct attributes. This research investigates the effectiveness of the Decision Tree classifier for mushroom categorization by comparing two splitting criteria, the Gini Index and Entropy. Additionally, the study employs the Recursive Feature Elimination (RFE) method for dimensionality reduction to enhance model efficiency and performance. The dataset was collected, cleaned, and analyzed exploratorily before feature selection was conducted using RFE. The Decision Tree model was trained and evaluated using accuracy, precision, recall, and F1-score metrics. The results showed that applying RFE improved computational efficiency without compromising model accuracy. The Gini criterion provided more stable results across all metrics, while Entropy demonstrated higher precision in certain cases. Model optimization through parameter tuning produced the best parameter combination at max_depth = 5, min_samples_leaf = 5, and min_samples_split = 10. This study concludes that integrating RFE with the Decision Tree can significantly enhance the performance of high-dimensional dataset classification. The findings are expected to serve as a reference for developing efficient and accurate biological data classification models
Copyrights © 2025