arXiv Open Access 2024

ConfliBERT: A Language Model for Political Conflict

Patrick T. Brandt Sultan Alsarra Vito J. D`Orazio Dagmar Heintze Latifur Khan +3 lainnya

Lihat Sumber

Abstrak

Conflict scholars have used rule-based approaches to extract information about political violence from news reports and texts. Recent Natural Language Processing developments move beyond rigid rule-based approaches. We review our recent ConfliBERT language model (Hu et al. 2022) to process political and violence related texts. The model can be used to extract actor and action classifications from texts about political conflict. When fine-tuned, results show that ConfliBERT has superior performance in accuracy, precision and recall over other large language models (LLM) like Google's Gemma 2 (9B), Meta's Llama 3.1 (7B), and Alibaba's Qwen 2.5 (14B) within its relevant domains. It is also hundreds of times faster than these more generalist LLMs. These results are illustrated using texts from the BBC, re3d, and the Global Terrorism Dataset (GTD).

Topik & Kata Kunci

cs.CL

Penulis (8)

Patrick T. Brandt

Sultan Alsarra

Vito J. D`Orazio

Dagmar Heintze

Latifur Khan

Shreyas Meher

Javier Osorio

Marcus Sianan

Format Sitasi

APA MLA BibTeX

Brandt, P.T., Alsarra, S., D`Orazio, V.J., Heintze, D., Khan, L., Meher, S. et al. (2024). ConfliBERT: A Language Model for Political Conflict. https://arxiv.org/abs/2412.15060

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓