OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across Languages

Selvaraj, Prem; NC, Gokul; Kumar, Pratyush; Khapra, Mitesh

Computer Science > Computation and Language

arXiv:2110.05877 (cs)

[Submitted on 12 Oct 2021]

Title:OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across Languages

Authors:Prem Selvaraj, Gokul NC, Pratyush Kumar, Mitesh Khapra

View PDF

Abstract:AI technologies for Natural Languages have made tremendous progress recently. However, commensurate progress has not been made on Sign Languages, in particular, in recognizing signs as individual words or as complete sentences. We introduce OpenHands, a library where we take four key ideas from the NLP community for low-resource languages and apply them to sign languages for word-level recognition. First, we propose using pose extracted through pretrained models as the standard modality of data to reduce training time and enable efficient inference, and we release standardized pose datasets for 6 different sign languages - American, Argentinian, Chinese, Greek, Indian, and Turkish. Second, we train and release checkpoints of 4 pose-based isolated sign language recognition models across all 6 languages, providing baselines and ready checkpoints for deployment. Third, to address the lack of labelled data, we propose self-supervised pretraining on unlabelled data. We curate and release the largest pose-based pretraining dataset on Indian Sign Language (Indian-SL). Fourth, we compare different pretraining strategies and for the first time establish that pretraining is effective for sign language recognition by demonstrating (a) improved fine-tuning performance especially in low-resource settings, and (b) high crosslingual transfer from Indian-SL to few other sign languages. We open-source all models and datasets in OpenHands with a hope that it makes research in sign languages more accessible, available here at this https URL .

Comments:	Submitted to AAAI22, 13 pages, 9 figures, 6 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
ACM classes:	I.2.7
Cite as:	arXiv:2110.05877 [cs.CL]
	(or arXiv:2110.05877v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2110.05877

Submission history

From: Gokul N.C. [view email]
[v1] Tue, 12 Oct 2021 10:33:02 UTC (3,018 KB)

Computer Science > Computation and Language

Title:OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across Languages

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across Languages

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators