UAV imagery-based semantic segmentation is crucial for mapping tropical agricultural areas such as oil palm plantations. The main challenges are overlapping vegetation objects, unclear boundaries, and spectral similarities between classes, which reduce the accuracy of conventional models. This study proposes a modified U-Net architecture with a VGG-19 backbone, achieved through hyperparameter tuning (M7) and the integration of residual blocks (M8), to enhance multi-class segmentation performance. Experiments were conducted on aerial imagery with two resolutions (512×512 and 256×256) using four-class and three-class scenarios. The results show that M7 and M8 consistently outperform the baseline model (M2) in terms of accuracy, precision, recall, and average Intersection over Union (IoU). In the 512x512 four-class scenario, M8 achieved the highest accuracy (87.40%), precision (88.32%), recall (86.32%), and MIoU (0.132). M7 reached similar accuracy (>86%) but trained significantly faster than the baseline. In the 256x256 scenario, M8 maintained strong performance with 86.44% accuracy and 0.302 MIoU. For the three-class experiment, M8 reached a top MIoU of 0.178. Accuracy, precision, and recall were all above 87%, showing improved recognition of minority classes such as waterways. Confusion matrix analysis confirmed that M8 provided more balanced class predictions. It also reduced false negatives for oil palm vegetation. M7 showed slight fluctuations, suggesting possible overfitting. These findings support M8 as a robust solution for UAV-based oil palm mapping and large-scale monitoring.
Copyrights © 2026