Zhuo, Wang
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

HCRF: an improved random forest algorithm based on hierarchical clustering Zhuo, Wang; Ahmad, Azlin
Indonesian Journal of Electrical Engineering and Computer Science Vol 38, No 1: April 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v38.i1.pp578-586

Abstract

Random forest (RF) selects feature subsets randomly. Useless and redundant features will lower the quality of the selected features and subsequently affect the overall classification accuracy of the RF. This study proposes an improved RF algorithm based on hierarchical clustering (HCRF). The algorithm uses hierarchical clustering algorithms to optimize the feature selection process, by establishing similar feature groups based on the GINI index, and then selecting features from each group proportionally to construct the feature subset. The feature subset is then used to construct a single classifier. This process increases the filtering of feature subsets, reducing the negative impact of useless and redundant features on the model, and improving the model's generalization ability and overall performance. In the experimental verification, ten datasets of different sizes and domains were selected, and the accuracy, precision, recall, F1 score, and running time of HCRF, support vector machine (SVM), RF, classification and regression tree (CART) were compared using 10-fold cross-validation. Combining all the results, the HCRF algorithm showed significant improvements in all evaluation indicators, proving that its performance is superior to the other three classifiers. Therefore, this algorithm has broad application areas and value, and effectively improves the overall performance of the classifier within a lower complexity range.