large-Scale Tv Dataset
French Sign Language
(STVD-LSF)

LSF (French Sign Language) is a natural language expressed through hand movements and facial expressions. It shares the same linguistic properties as spoken French. LSF is part of the family of sign languages used worldwide by deaf and hard-of-hearing individuals, such as ASL (American Sign Language) and LSB (Belgian Sign Language).
Sign language recognition is a well-established research topic in computer vision with contributions for multiple languages, including Chinese, Arabic, Vietnamese, English, and French. When a video containing sign language sequences is paired with a speech translation, the task becomes a multimodal recognition problem, involving the integration of audio, textual, and visual data.
The STVD-LSF dataset has been developed to enable scalable multimodal LSF recognition. It comprises several test sets featuring high-resolution videos, multiple interpreters, and a wide range of topics. Each test set is accompanied by corresponding audio speech translations and transcriptions.

French Sign Language (LSF)

Currently, STVD-LSF is available in a β version, which includes a Hello World test set. It provides approximately 7.5 hours of LSF video content, involving 15 interpreters and recorded at standard video quality. In this setup, recognition must be performed from scratch using only the provided video data. A more comprehensive version is planned for future release, expected to include several tens of hours of high-quality audiovisual material.

The dataset is structured as follows:

The dataset has the naming convention described here:

/ SEGMENTS / FileName.mp4
/ index.csv

where,

For understanding purposes, samples of LSF video segments are provided in the following table.

Int Segment 1 Segment 2 Segment 3
1 int1_seg1 int1_seg2 int1_seg3
2 int2_seg1 int2_seg2 int2_seg3
3 int3_seg1 int3_seg2 int3_seg3
4 int4_seg1 int4_seg2 int4_seg3
5 int5_seg1 int5_seg2 int5_seg3

The dataset is available for non-commercial research purposes only. To access the dataset, users must first download the agreement (available in English or French ), complete and sign it, and then send a scanned copy by email to Mathieu Delalandre email. After reviewing and validating the request, we will provide the password required to extract the dataset.

The dataset can be downloaded from the table below, which also presents general statistics. The hosting service at UT typically provides download speeds ranging from 3 MB/s to 16 MB/s, depending on network conditions and concurrent usage. Consequently, the dataset can generally be downloaded within a few minutes.

Period Channel Duration (h) Segments Resolution Encoding (Mb/s) FPS Interpreter Size (GB) Link
Jun-Jul 2022 1 7.5 58 240×384 [0.57;1] 29.97 15 2.42 download

For clarity, we detail here technical and scientific aspects of the STVD-LSF dataset.

French Sign Language (LSF)

  1. F. Rayar, M. Delalandre and V.H. Le. A large-scale TV video and metadata database for French political content analysis and fact-checking. Conference on Content-Based Multimedia Indexing (CBMI), pp. 181-185, 2022.