GLCrowd: a weakly supervised global-local attention model for congested crowd counting

Hongmin Zhang; Qianqian Tian; Dingding Yan; Lingyu Bu

doi:10.12086/oee.2024.240174

Opto-Electronic Engineering, Volume. 51, Issue 10, 240174(2024)

GLCrowd: a weakly supervised global-local attention model for congested crowd counting

Hongmin Zhang*... Qianqian Tian, Dingding Yan and Lingyu Bu |Show fewer author(s)

Author Affiliations

School of Electrical and Electronic Engineering,Chongqing University of Technology,Chongqing 400054,China

show less

Abstract Get PDF(in Chinese)

To address the challenges of crowd counting in dense scenes,such as complex backgrounds and scale variations,we propose a weakly supervised crowd counting model for dense scenes,named GLCrowd,which integrates global and local attention mechanisms. First,we design a local attention module combined with deep convolution to enhance local features through context weights while leveraging feature weight sharing to capture high-frequency local information. Second,the Vision Transformer (ViT) self-attention mechanism is used to capture low-frequency global information. Finally,the global and local attention mechanisms are effectively fused,and counting is accomplished through a regression token. The model was tested on the Shanghai Tech Part A,Shanghai Tech Part B,UCF-QNRF,and UCF_CC_50 datasets,achieving MAE values of 64.884,8.958,95.523,and 209.660,and MSE values of 104.411,16.202,173.453,and 282.217,respectively. The results demonstrate that the proposed GLCrowd model exhibits strong performance in crowd counting within dense scenes.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

crowd counting global-local attention Vision Transformer weakly supervised learning

Tools

Get Citation

Copy Citation Text

Hongmin Zhang, Qianqian Tian, Dingding Yan, Lingyu Bu. GLCrowd: a weakly supervised global-local attention model for congested crowd counting[J]. Opto-Electronic Engineering, 2024, 51(10): 240174

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites