large-Scale TV Dataset (STVD)

STVD is a multimedia collection of datasets constituted from the TV Workstation platform [1-6]. This collection covers open challenges in different research fields such as the computer vision (CV), the operational research (OR) and the Natural Language Processing (NLP). Three main datasets are proposed up to date for partial video copy detection, parallel machine scheduling and fact-checking.

dataset topic size year link thumb
STVD-PVCD Partial Video Copy Detection 526 GB 2021 download partial video copy detection
STVD-PMS Parallel Machine Scheduling 7 MB 2022 download parallel machine scheduling
STVD-FC Fact-Checking 1.96 TB 2022 download fact-checking
  1. V.H. Le, M. Delalandre and D. Conte. A large-Scale TV Dataset for partial video copy detection. International Conference on Image Analysis and Processing (ICIAP), Lecture Notes in Computer Science (LNCS), vol 13233, pp. 388-399, 2022.
  2. F. Rayar, M. Delalandre and V.H. Le. A large-scale TV video and metadata database for French political content analysis and fact-checking. Conference on Content-Based Multimedia Indexing (CBMI), pp. 181-185, 2022.
  3. V.H. Le, M. Delalandre and D. Conte. Une large base de données pour la détection de segments de vidéos TV. Journées Francophones des Jeunes Chercheurs en Vision par Ordinateur (ORASIS), 2021.
  4. V.H. Le, M. Delalandre and D. Conte. Real-time detection of partial video copy on TV workstation. Conference on Content-Based Multimedia Indexing (CBMI), pp. 1-4, 2021.
  5. M. Delalandre. A Workstation for Real-Time Processing of Multi-Channel TV. Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV), pp. 53-54, 2019.
  6. M. Delalandre. The TV Workstation project: a research scope. FICT seminar, Thanh Hóa, Vietnam, 25th of October 2022. LIFAT seminar, Tours, France, 29th of July 2022. LIFAT seminar, Tours, France, 2sd of July 2021.