Real-time vehicle detection at night faces significant challenges: ineffective low-light feature extraction, low computational efficiency hampering real-time performance, subpar detection of small and occluded vehicles, and limited cross-scenario generalization. To address these issues, this paper presents CEM-YOLO, an enhanced YOLO algorithm leveraging deep learning for nighttime vehicle detection. Two novel modules, Convolutional Maxpooling Downsampling and Multi-branch Residual Feature Fusion, are introduced to mitigate model complexity, reduce feature redundancy, and safeguard input features. Additionally, the Efficient Multi-Scale Attention Module is integrated into the Neck network’s detection layers. Extensive experiments and ablation studies on benchmark datasets demonstrate that CEM-YOLO excels in nighttime scenarios, achieving an optimal speed-accuracy balance for real-time applications.

CEM-YOLO: multi-branch residual feature fusion and convolutional maxpooling downsampling for real-time vehicle detection in night scenarios

Karimi, Hamid Reza
2025-01-01

Abstract

Real-time vehicle detection at night faces significant challenges: ineffective low-light feature extraction, low computational efficiency hampering real-time performance, subpar detection of small and occluded vehicles, and limited cross-scenario generalization. To address these issues, this paper presents CEM-YOLO, an enhanced YOLO algorithm leveraging deep learning for nighttime vehicle detection. Two novel modules, Convolutional Maxpooling Downsampling and Multi-branch Residual Feature Fusion, are introduced to mitigate model complexity, reduce feature redundancy, and safeguard input features. Additionally, the Efficient Multi-Scale Attention Module is integrated into the Neck network’s detection layers. Extensive experiments and ablation studies on benchmark datasets demonstrate that CEM-YOLO excels in nighttime scenarios, achieving an optimal speed-accuracy balance for real-time applications.
2025
Convolutional Maxpooling Downsampling; Deep learning; MobileViT; Multi-branch Residual Feature Fusion; Real-time vehicle detection;
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1310790
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact