arXiv Open Access 2026

IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

Ivaxi Sheth Zhijing Jin Bryan Wilder Dominik Janzing Mario Fritz
Lihat Sumber

Abstrak

In the presence of confounding between an endogenous variable and the outcome, instrumental variables (IVs) are used to isolate the causal effect of the endogenous variable. Identifying valid instruments requires interdisciplinary knowledge, creativity, and contextual understanding, making it a non-trivial task. In this paper, we investigate whether large language models (LLMs) can aid in this task. We perform a two-stage evaluation framework. First, we test whether LLMs can recover well-established instruments from the literature, assessing their ability to replicate standard reasoning. Second, we evaluate whether LLMs can identify and avoid instruments that have been empirically or theoretically discredited. Building on these results, we introduce IV Co-Scientist, a multi-agent system that proposes, critiques, and refines IVs for a given treatment-outcome pair. We also introduce a statistical test to contextualize consistency in the absence of ground truth. Our results show the potential of LLMs to discover valid instrumental variables from a large observational database.

Topik & Kata Kunci

Penulis (5)

I

Ivaxi Sheth

Z

Zhijing Jin

B

Bryan Wilder

D

Dominik Janzing

M

Mario Fritz

Format Sitasi

Sheth, I., Jin, Z., Wilder, B., Janzing, D., Fritz, M. (2026). IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery. https://arxiv.org/abs/2602.07943

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓