large-Scale Tv Dataset (STVD)
STVD is a multimedia collection of datasets constituted from the TV Workstation platform [1-7]. This collection covers open challenges in different research fields such as the Computer Vision (CV), Natural Language Processing (NLP), Knowledge Engineering (KE), Audio Signal Processing (ASP) and Operational Research (OR). Five main datasets are proposed up to date for the topics of partial video copy detection, fact-checking, parallel machine scheduling, multimedia knowledge graph and multimodal named entity recognition.
| dataset | topic | size | year | link | thumb |
| STVD-PVCD | Partial Video Copy Detection | 542 GB | 2021 | download | |
| STVD-FC | Fact-Checking | 1.96 TB | 2022 | download | |
| STVD-PMS | Parallel Machine Scheduling | 7 MB | 2022 | download | |
| STVD-KG | Knowledge Graph | 1.6 GB | 2025 | download | |
| STVD-MNER | Multimodal Named Entity Recognition | 281 GB | 2025 | download |
- M. Delalandre. A Workstation for Real-Time Processing of Multi-Channel TV. Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV), pp. 53-54, 2019.
- V.H. Le, M. Delalandre and D. Conte. Real-time detection of partial video copy on TV workstation. Conference on Content-Based Multimedia Indexing (CBMI), pp. 1-4, 2021.
- V.H. Le, M. Delalandre and D. Conte. Une large base de données pour la détection de segments de vidéos TV. Journées Francophones des Jeunes Chercheurs en Vision par Ordinateur (ORASIS), 2021.
- V.H. Le, M. Delalandre and D. Conte. A large-Scale TV Dataset for partial video copy detection. International Conference on Image Analysis and Processing (ICIAP), Lecture Notes in Computer Science (LNCS), vol 13233, pp. 388-399, 2022.
- F. Rayar, M. Delalandre and V.H. Le. A large-scale TV video and metadata database for French political content analysis and fact-checking. Conference on Content-Based Multimedia Indexing (CBMI), pp. 181-185, 2022.
- V.H. Le, M. Delalandre and H. Cardot. Performance characterization of 2D CNN features for partial video copy detection. International Conference on Computer Analysis of Images and Patterns (CAIP), Lecture Notes in Computer Science (LNCS), vol 14184, pp. 205-215, 2023.
- M. Delalandre. The TV Workstation project: a research scope. LIFAT seminar, Tours, France, 10th of July 2025. HCMUT-FCSE seminar, HCMC, Vietnam, 30th of November 2024. HCMUTE-FIT seminar, HCMC, Vietnam, 29th of November 2024. VNU-ITI seminar, Hanoï, Vietnam, 1st of November 2023. HUST seminar, Hanoï, Vietnam, 2sd of November 2023. FICT seminar, Thanh Hóa, Vietnam, 25th of October 2022. LIFAT seminar, Tours, France, 29th of July 2022. LIFAT seminar, Tours, France, 2sd of July 2021.
- H.G. Vu, N. Friburger, A. Soulet and M. Delalandre. stvd-kg: A Knowledge Graph for French Electronical Program Guides. International Semantic Web Conference (WISE), 2025.