arXiv Open Access 2025

PoliTok-DE: A Multimodal Dataset of Political TikToks and Deletions From Germany

Tomas Ruiz Andreas Nanz Ursula Kristin Schmid Carsten Schwemmer Yannis Theocharis +1 lainnya
Lihat Sumber

Abstrak

We present PoliTok-DE, a large-scale multimodal dataset (video, audio, images, text) of TikTok posts related to the 2024 Saxony state election in Germany. The corpus contains over 195,000 posts published between 01.07.2024 and 30.11.2024, of which over 18,000 (17.3%) were subsequently deleted from the platform. Posts were identified via the TikTok research API and complemented with web scraping to retrieve full multimodal media and metadata. PoliTok-DE supports computational social science across substantive and methodological agendas: substantive work on intolerance and political communication; methodological work on platform policies around deleted content and qualitative-quantitative multimodal research. To illustrate one possible analysis, we report a case study on the co-occurrence of intolerance and entertainment using an annotated subset. The dataset of post IDs is publicly available on Hugging Face, and full content can be hydrated with our provided code. Access to the deleted content is restricted, and can be requested for research purposes.

Topik & Kata Kunci

Penulis (6)

T

Tomas Ruiz

A

Andreas Nanz

U

Ursula Kristin Schmid

C

Carsten Schwemmer

Y

Yannis Theocharis

D

Diana Rieger

Format Sitasi

Ruiz, T., Nanz, A., Schmid, U.K., Schwemmer, C., Theocharis, Y., Rieger, D. (2025). PoliTok-DE: A Multimodal Dataset of Political TikToks and Deletions From Germany. https://arxiv.org/abs/2509.15860

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓