Optoelectronics Letters, Volume. 21, Issue 6, 342(2025)

Sign language data quality improvement based on dual information streams

Jialiang CAI and Tiantian YUAN

Sign language dataset is essential in sign language recognition and translation (SLRT). Current public sign language datasets are small and lack diversity, which does not meet the practical application requirements for SLRT. However, making a large-scale and diverse sign language dataset is difficult as sign language data on the Internet is scarce. In making a large-scale and diverse sign language dataset, some sign language data qualities are not up to standard. This paper proposes a two information streams transformer (TIST) model to judge whether the quality of sign language data is qualified. To verify that TIST effectively improves sign language recognition (SLR), we make two datasets, the screened dataset and the unscreened dataset. In this experiment, this paper uses visual alignment constraint (VAC) as the baseline model. The experimental results show that the screened dataset can achieve better word error rate (WER) than the unscreened dataset.

Tools

Get Citation

Copy Citation Text

CAI Jialiang, YUAN Tiantian. Sign language data quality improvement based on dual information streams[J]. Optoelectronics Letters, 2025, 21(6): 342

Download Citation

EndNote(RIS)BibTexPlain Text
Save article for my favorites
Paper Information

Received: Jul. 18, 2023

Accepted: Jun. 27, 2025

Published Online: Jun. 27, 2025

The Author Email:

DOI:10.1007/s11801-025-3137-6

Topics