arXiv Open Access 2025

DiscoSum: Discourse-aware News Summarization

Alexander Spangher Tenghao Huang Jialiang Gu Jiatong Shi Muhao Chen
Lihat Sumber

Abstrak

Recent advances in text summarization have predominantly leveraged large language models to generate concise summaries. However, language models often do not maintain long-term discourse structure, especially in news articles, where organizational flow significantly influences reader engagement. We introduce a novel approach to integrating discourse structure into summarization processes, focusing specifically on news articles across various media. We present a novel summarization dataset where news articles are summarized multiple times in different ways across different social media platforms (e.g. LinkedIn, Facebook, etc.). We develop a novel news discourse schema to describe summarization structures and a novel algorithm, DiscoSum, which employs beam search technique for structure-aware summarization, enabling the transformation of news stories to meet different stylistic and structural demands. Both human and automatic evaluation results demonstrate the efficacy of our approach in maintaining narrative fidelity and meeting structural requirements.

Topik & Kata Kunci

Penulis (5)

A

Alexander Spangher

T

Tenghao Huang

J

Jialiang Gu

J

Jiatong Shi

M

Muhao Chen

Format Sitasi

Spangher, A., Huang, T., Gu, J., Shi, J., Chen, M. (2025). DiscoSum: Discourse-aware News Summarization. https://arxiv.org/abs/2506.06930

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓