Laser & Optoelectronics Progress, Volume. 62, Issue 4, 0412007(2025)

Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention

Yanhui Li1、*, Zhongchun Fang2, and Hairong Li2
Author Affiliations
  • 1School of Digital and Intelligent Industry (School of Cyber Science and Technology), Inner Mongolia University of Science & Technology, Baotou 014000, Inner Mongolia , China
  • 2Engineering Training Center (College of Innovation and Entrepreneurship Education), Inner Mongolia University of Science & Technology, Baotou 014000, Inner Mongolia , China
  • show less
    Figures & Tables(18)
    Overall framework diagram of the proposed model
    Structure of the Swin Transformer network
    Framework of the Swin Transformer block
    Structure of the ST-LSTM
    Structure of the CA module
    Scene distribution of the CULane dataset
    Scene distribution of the VIL-100 dataset
    Detection results by the proposed model on the CULane dataset
    Detection results by the proposed model on the Tusimple dataset
    Detection results by the proposed model on the VIL-100 dataset
    • Table 1. Sampling method for successive input images

      View table

      Table 1. Sampling method for successive input images

      Ground truthStrideSampled frames
      15th112th, 13th, 14th, 15th
      29th, 11th, 13th, 15th
      36th, 9th, 12th, 15th
      19th116th, 17th, 18th, 19th
      213th, 15th, 17th, 19th
      310th, 13th, 16th, 19th
    • Table 2. Structure and contents of the Tusimple dataset

      View table

      Table 2. Structure and contents of the Tusimple dataset

      DatasetTypeLaneEnvironmentLabeled frameLabeled image
      Training setTusimple≤4Highway15th and 19th7252
      Test setTest set 1≤4Highway15th and 19th2465
      Test set 2≤4HighwayAll frames781
    • Table 3. Experimental results on the Tusimple dataset by different models

      View table

      Table 3. Experimental results on the Tusimple dataset by different models

      ModelA /%PRF1 score
      SCNN2096.530.6540.8080.722
      RESA2196.800.7610.7290.745
      LaneNet2296.380.8750.9270.884
      SegNet2396.050.7960.9560.838
      U-Net2496.400.7900.9530.867
      SegNet-convLSTM1997.100.8520.9640.901
      UNet-convLSTM1997.200.8570.9580.904
      ADNet2596.23
      Res18_UFLD2695.950.8836
      Proposed97.600.8680.9710.907
    • Table 4. Comparasion of detection performances for scenes on the CULane dataset by different algorithms

      View table

      Table 4. Comparasion of detection performances for scenes on the CULane dataset by different algorithms

      AlgorithmF1 scoreOverallNFP of CrossroadFPS /(frame·s-1
      NormalCrowdedDazzle lightShadowNo lineArrowCurveNight
      U-Net240.91100.68200.61200.67100.43300.85400.63700.67300.7230204450.1
      LaneNet220.91700.70300.60300.67500.44600.85200.65100.67400.7370200548.8
      SCNN200.90600.69700.58500.66900.43400.84100.64400.66100.713019908.2
      ADNet250.91920.75810.69390.76210.51750.87710.68840.72330.7756113387.0
      Res18_UFLD260.89060.67760.55260.64830.39000.83810.58390.64070.696922156.9
      Proposed0.94300.77500.72300.79100.56200.88100.67200.72300.8310195163.5
    • Table 5. Experimental results on the VIL-100 dataset by different models

      View table

      Table 5. Experimental results on the VIL-100 dataset by different models

      ModelmIoUF1 score
      LaneNet220.6330.721
      RESA210.7020.874
      LaneATT170.6640.823
      MMA-Net270.7050.839
      Proposed0.7260.895
    • Table 6. Comparison of the accuracy on the Tusimple dataset

      View table

      Table 6. Comparison of the accuracy on the Tusimple dataset

      ModelAccuracy of training set /%Accuracy of test set /%
      Easy lanesHard lanesEasy lanesHard lanes
      SCNN2094.1292.8594.2593.12
      ENet2894.9893.6294.7393.74
      UNet2495.4294.1495.5394.08
      SegNet2395.7994.5795.6194.52
      LaneNet2296.5695.3596.4895.42
      Proposed97.2097.1097.1997.12
    • Table 7. Experimental results using different Q values

      View table

      Table 7. Experimental results using different Q values

      Value of QA /%PRF1 score
      196.90.8620.9560.871
      297.40.8670.9610.876
      397.50.8690.9680.905
      497.60.8680.9710.907
      597.30.8660.9700.904
    • Table 8. Results of the ablation experiments

      View table

      Table 8. Results of the ablation experiments

      Base laneTransformerVision TransformerSwin TransformerCBAMCAF1 score
      CULaneTusimpleVIL-100
      0.7210.8360.821
      0.7580.8750.859
      0.7750.8870.864
      0.7890.8950.876
      0.7910.9070.895
    Tools

    Get Citation

    Copy Citation Text

    Yanhui Li, Zhongchun Fang, Hairong Li. Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention[J]. Laser & Optoelectronics Progress, 2025, 62(4): 0412007

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Instrumentation, Measurement and Metrology

    Received: May. 20, 2024

    Accepted: Jul. 10, 2024

    Published Online: Feb. 10, 2025

    The Author Email:

    DOI:10.3788/LOP241332

    CSTR:32186.14.LOP241332

    Topics