YOLOv3 object detection method by introducing Gaussian mask self-attention module

Table 1. Feature map size and prior frame size with picture size of 640×640
View table
View in Article
Table 1. Feature map size and prior frame size with picture size of 640×640
特征图尺寸感受野先验框尺寸
20×20 大（116，90）
（156，198）
（373，326）
40×40 中（30，61）
（62，45）
（59，119）
80×80 小（10，13）
（16，30）
（33，23）

Table 2. Training environment

View table

View in Article

Table 2. Training environment

配置名称	型号、参数
CPU	Intel（R）Core（TM）i9-9900K，8核
固态硬盘	金士顿，512 G
内存	金士顿，16 Gx2，8 Gx2
显卡	NVIDIA TITAN Xp，显存12 G，CUDA 10.2
操作系统	Ubuntu 18.04
程序语言	Python 3.8.11
机器学习框架	PyTorch 1.9.0

Table 3. Training parameters
View table
View in Article
Table 3. Training parameters
参数名称参数值
批处理大小 10
迭代次数 100
学习率 0.01
动量 0.937
置信度阈值 0.5
NMS阈值 0.5
类别 80
自注意力头数量 8

Table 4. Performance evaluation

View table

View in Article

Table 4. Performance evaluation

算法模型	mAP@0.5/%	mAP@0.5∶0.95/%	Precision/%	Recall/%	FPS
Faster R-CNN	58.98	38.54	68.67	56.33	21.12
SSD	47.33	29.49	56.19	44.93	35.56
ASSD	52.73	31.81	60.24	49.48	36.71
YOLOv3	54.32	34.17	61.78	51.96	43.26
YOLOv3-GMSA	56.88	36.19	65.31	53.18	39.38

Tools

Get Citation

Copy Citation Text

Ya-jie KONG, Ye ZHANG. YOLOv3 object detection method by introducing Gaussian mask self-attention module[J]. Chinese Journal of Liquid Crystals and Displays, 2022, 37(4): 539

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites