arXiv Open Access 2026

What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction

Lehui Li Ruining Wang Haochen Song Yaoxin Mao Tong Zhang +6 lainnya
Lihat Sumber

Abstrak

Automated paper reproduction -- generating executable code from academic papers -- is bottlenecked not by information retrieval but by the tacit knowledge that papers inevitably leave implicit. We formalize this challenge as the progressive recovery of three types of tacit knowledge -- relational, somatic, and collective -- and propose \method, a graph-based agent framework with a dedicated mechanism for each: node-level relation-aware aggregation recovers relational knowledge by analyzing implementation-unit-level reuse and adaptation relationships between the target paper and its citation neighbors; execution-feedback refinement recovers somatic knowledge through iterative debugging driven by runtime signals; and graph-level knowledge induction distills collective knowledge from clusters of papers sharing similar implementations. On an extended ReproduceBench spanning 3 domains, 10 tasks, and 40 recent papers, \method{} achieves an average performance gap of 10.04\% against official implementations, improving over the strongest baseline by 24.68\%. The code will be publicly released upon acceptance; the repository link will be provided in the final version.

Topik & Kata Kunci

Penulis (11)

L

Lehui Li

R

Ruining Wang

H

Haochen Song

Y

Yaoxin Mao

T

Tong Zhang

Y

Yuyao Wang

J

Jiayi Fan

Y

Yitong Zhang

J

Jieping Ye

C

Chengqi Zhang

Y

Yongshun Gong

Format Sitasi

Li, L., Wang, R., Song, H., Mao, Y., Zhang, T., Wang, Y. et al. (2026). What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction. https://arxiv.org/abs/2603.01801

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓