Opto-Electronic Engineering, Volume. 52, Issue 1, 240234(2025)

Lightweight Swin Transformer combined with multi-scale feature fusion for face expression recognition

Yanqiu Li1,2, Shengzhao Li1, Guangling Sun1,2、*, and Pu Yan1,2
Author Affiliations
  • 1School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei, Anhui 260601, China
  • 2Anhui International Joint Research Center for Ancient Architecture Intellisencing and Multi-Dimensional Modeling, Hefei, Anhui 230601, China
  • show less

    A lightweight Swin Transformer and multi-scale feature fusion (EMA) module combination is proposed for face expression recognition, which addresses the problems of the Swin Transformer model, such as excessive parameter quantity, poor real-time performance, and limited ability to capture the complex and small expression change features present in the expressions. The method first uses the proposed SPST module to replace the Swin Transformer block module in the fourth stage of the original Swin Transformer model to reduce the number of parameters of the model and realize the lightweight model. Then, the multi-scale feature fusion (EMA) module is embedded after the second stage of the lightweight model, which effectively improves the model's ability to capture the details of facial expressions through multi-scale feature extraction and cross-space information aggregation, thus improving the accuracy and robustness of facial expression recognition. The experimental results show that the proposed method achieves 97.56%, 86.46%, 87.29%, and 70.11% recognition accuracy on four public datasets, namely, JAFFE, FERPLUS, RAF-DB, and FANE, respectively. Compared with the original Swin Transformer model, the number of parameters of the improved model is decreased by 15.8% and the FPS is improved by 9.6%, which significantly enhances the real-time performance of the model while keeping the number of parameters of the model low.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Yanqiu Li, Shengzhao Li, Guangling Sun, Pu Yan. Lightweight Swin Transformer combined with multi-scale feature fusion for face expression recognition[J]. Opto-Electronic Engineering, 2025, 52(1): 240234

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Article

    Received: Oct. 7, 2024

    Accepted: Dec. 3, 2024

    Published Online: Feb. 21, 2025

    The Author Email: Sun Guangling (孙光灵)

    DOI:10.12086/oee.2025.240234

    Topics