arXiv Open Access 2025

Training Language Model to Critique for Better Refinement

Tianshu Yu Chao Xiang Mingchuan Yang Pei Ke Bosi Wen +6 lainnya
Lihat Sumber

Abstrak

Large language models (LLMs) have demonstrated remarkable evaluation and critique capabilities, providing insightful feedback and identifying flaws in various tasks. However, limited research has explored which types of critiques are most effective for improving model responses or how to generate such critiques. To address this gap, we introduce \textbf{R}efinement-oriented \textbf{C}ritique \textbf{O}ptimization (RCO), a novel framework designed to train critic models using refinement signals. RCO uses a feedback loop where critiques, generated by the critic model, guide the actor model in refining its responses. The critique utility (CU) quantifies the effectiveness of these refinements, serving as the reward signal for training the critic model. By focusing on critiques that lead to better refinements, RCO eliminates the need for direct critique preference assessment, ensuring that critiques driving meaningful improvements are rewarded. We evaluate RCO across five tasks, i.e., dialog generation, summarization, question answering, mathematical reasoning, and code generation, and show that it significantly outperforms traditional methods and open-source models in terms of critique quality and refinement outcomes. Our contributions include the introduction of RCO, a novel supervision scheme based on refined response preferences, and comprehensive experimental results that highlight the method's effectiveness in enhancing LLM critique-refinement loops.

Topik & Kata Kunci

Penulis (11)

T

Tianshu Yu

C

Chao Xiang

M

Mingchuan Yang

P

Pei Ke

B

Bosi Wen

C

Cunxiang Wang

J

Jiale Cheng

L

Li Zhang

X

Xinyu Mu

C

Chuxiong Sun

M

Minlie Huang

Format Sitasi

Yu, T., Xiang, C., Yang, M., Ke, P., Wen, B., Wang, C. et al. (2025). Training Language Model to Critique for Better Refinement. https://arxiv.org/abs/2506.22157

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓