Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention

Yanhui Li; Zhongchun Fang; Hairong Li

doi:10.3788/LOP241332

Laser & Optoelectronics Progress, Volume. 62, Issue 4, 0412007(2025)

Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention

Yanhui Li^1、*, Zhongchun Fang², and Hairong Li²

¹School of Digital and Intelligent Industry (School of Cyber Science and Technology), Inner Mongolia University of Science & Technology, Baotou 014000, Inner Mongolia , China

²Engineering Training Center (College of Innovation and Entrepreneurship Education), Inner Mongolia University of Science & Technology, Baotou 014000, Inner Mongolia , China

show less

Abstract Get PDF(in Chinese)

To reduce computational costs and efficiently complete lane detection tasks, this paper proposes a multiframe lane detection method using a Swin Transformer embedded with a coordinate attention mechanism for lane detection in continuous multiframe image sequences. In this approach, continuous multiframe image sequences are taken as inputs and the Swin Transformer encoder-decoder architecture is adopted to ensure consistent input and output image sizes. The coordinate attention mechanism is embedded in patch merging from the stage 3 fusion layer of the Swin Transformer model, enhancing the model's focus on long-distance dependencies and its ability to extract both global and local features of lane lines. Additionally, introducing spatiotemporal long-short term memory between the encoder and decoder boosts the model's ability to predict temporal sequence information, significantly improving the lane line detection accuracy. Extensive experiments conducts on the CULane, Tusimple, and VIL-100 datasets demonstrate that the proposed method provides a comprehensive advantage in handling continuous multiframe image sequences, delivering superior detection performance compared to existing studies.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

coordinate attention mechanism lane detection spatiotemporal long-short term memory Swin Transformer

Tools

Get Citation

Copy Citation Text

Yanhui Li, Zhongchun Fang, Hairong Li. Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention[J]. Laser & Optoelectronics Progress, 2025, 62(4): 0412007

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites