Lightweight Swin Transformer combined with multi-scale feature fusion for face expression recognition

Yanqiu Li; Shengzhao Li; Guangling Sun; Pu Yan

doi:10.12086/oee.2025.240234

Opto-Electronic Engineering, Volume. 52, Issue 1, 240234(2025)

Lightweight Swin Transformer combined with multi-scale feature fusion for face expression recognition

Yanqiu Li1...2, Shengzhao Li1, Guangling Sun1,2,*, and Pu Yan12 |Show fewer author(s)

Author Affiliations

¹School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei, Anhui 260601, China

²Anhui International Joint Research Center for Ancient Architecture Intellisencing and Multi-Dimensional Modeling, Hefei, Anhui 230601, China

show less

Abstract Get PDF(in Chinese)

A lightweight Swin Transformer and multi-scale feature fusion (EMA) module combination is proposed for face expression recognition, which addresses the problems of the Swin Transformer model, such as excessive parameter quantity, poor real-time performance, and limited ability to capture the complex and small expression change features present in the expressions. The method first uses the proposed SPST module to replace the Swin Transformer block module in the fourth stage of the original Swin Transformer model to reduce the number of parameters of the model and realize the lightweight model. Then, the multi-scale feature fusion (EMA) module is embedded after the second stage of the lightweight model, which effectively improves the model's ability to capture the details of facial expressions through multi-scale feature extraction and cross-space information aggregation, thus improving the accuracy and robustness of facial expression recognition. The experimental results show that the proposed method achieves 97.56%, 86.46%, 87.29%, and 70.11% recognition accuracy on four public datasets, namely, JAFFE, FERPLUS, RAF-DB, and FANE, respectively. Compared with the original Swin Transformer model, the number of parameters of the improved model is decreased by 15.8% and the FPS is improved by 9.6%, which significantly enhances the real-time performance of the model while keeping the number of parameters of the model low.

AI Picture Guide

AI One Sentence

AI Short Abstract

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

EMA module expression recognition SPST module Swin Transformer

Tools

Get Citation

Copy Citation Text

Yanqiu Li, Shengzhao Li, Guangling Sun, Pu Yan. Lightweight Swin Transformer combined with multi-scale feature fusion for face expression recognition[J]. Opto-Electronic Engineering, 2025, 52(1): 240234

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Article

Received: Oct. 7, 2024

Accepted: Dec. 3, 2024

Published Online: Feb. 21, 2025

The Author Email: Sun Guangling (孙光灵)

DOI:10.12086/oee.2025.240234

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology

微信扫一扫：分享