Inventory management in warehouses often experiences discrepancies in recording the number of cardboard boxes due to errors during the manual recording process. To overcome this problem, a cardboard detection method was developed using the Default Mask R-CNN model and a modified model using MobileNetV3 Small. The training data was obtained from a collection of cardboard photos which then went through an annotation stage. In the cReonfiguration stage, various anchor scales were applied to determine the bounding box parameters, while the training process used Stochastic Gradient Descent (SGD). The default model is trained with the initial Mask R-CNN settings, while the custom model modifies the backbone and Feature Pyramid Network (FPN) adjustments. The test results show that the custom model has higher efficiency with a parameter count of 20,857,704 and an average FPS of 10.92. However, the accuracy level of the custom model is lower than that of the default model
Copyrights © 2025