Enhancing performance for object detection in complex environments is essential for real-world applications that represent complexities, such as stacking objects in the same location or environment. Models for detecting objects developed to this day still have difficulties in detecting objects with environments that have complex backgrounds. The reason is that the model often experiences a decrease in accuracy when the object to be detected is occlusion by other objects and is small in size. Therefore, in this study, a model improvement method was carried out in detecting objects in a complex environment. The algorithm used in this study is YOLOv5s. Optimization is carried out by adding a CBAM (Convolutional Block Attention Module) attention mechanism layer which is integrated with the C3 layer (C3CBAM) in the backbone of the YOLOv5s model architecture. In addition, a P2 feature map is also added to the architecture head. The optimization results carried out were quite satisfactory, namely there was an increase in the precision value by 1.6 %, at mAP@0.5 an increase of 1.4 %, and also mAP@50-95 increased by 0.1%. This proves that the enhancement method applied to YOLOv5s in this study can improve the performance of the model. However, with the addition of the attention mechanism layer, it turns out that it can increase the computational load. Therefore, for future research, a method can be applied to reduce computing load, one of the methods is knowledge distillation.
Copyrights © 2025