The rapid growth of intelligent environmental security systems has intensified the need for accurate and real-time suspicious human activity recognition. Computer vision techniques, particularly deep learning–based object detection models, have emerged as key enablers in addressing these challenges. Among them, You Only Look Once (YOLO) has gained significant attention due to its high detection speed, end-to-end architecture, and suitability for real-time surveillance applications. This review paper presents a comprehensive analysis of the application of YOLO-based models in suspicious human activity recognition for intelligent environmental security systems. It examines the evolution of YOLO architectures, their adaptations for activity and behavior analysis, and their integration with surveillance frameworks. The review further discusses commonly used datasets, performance evaluation metrics, and comparative results reported in existing studies. In addition, key challenges such as occlusion, varying illumination, complex backgrounds, privacy concerns, and computational constraints are highlighted. Finally, the paper outlines future research directions, including hybrid models, multi-modal data fusion, edge-based deployment, and explainable AI, to enhance the robustness and reliability of YOLO-driven security systems. This review aims to provide researchers and practitioners with a structured understanding of current advancements and open issues in YOLO-based suspicious human activity recognition.
Copyrights © 2026