arXiv Open Access 2023

Evaluating the Surrogate Index as a Decision-Making Tool Using 200 A/B Tests at Netflix

Vickie Zhang Michael Zhao and Maria Dimakopoulou Anh Le Nathan Kallus
Lihat Sumber

Abstrak

Surrogate index approaches have recently become a popular method of estimating longer-term impact from shorter-term outcomes. In this paper, we leverage 1098 test arms from 200 A/B tests at Netflix to empirically investigate to what degree would decisions made using a surrogate index utilizing 14 days of data would align with those made using direct measurement of day 63 treatment effects. Focusing specifically on linear "auto-surrogate" models that utilize the shorter-term observations of the long-term outcome of interest, we find that the statistical inferences that we would draw from using the surrogate index are ~95% consistent with those from directly measuring the long-term treatment effect. Moreover, when we restrict ourselves to the set of tests that would be "launched" (i.e. positive and statistically significant) based on the 63-day directly measured treatment effects, we find that relying instead on the surrogate index achieves 79% and 65% recall.

Topik & Kata Kunci

Penulis (5)

V

Vickie Zhang

M

Michael Zhao

a

and Maria Dimakopoulou

A

Anh Le

N

Nathan Kallus

Format Sitasi

Zhang, V., Zhao, M., Dimakopoulou, a.M., Le, A., Kallus, N. (2023). Evaluating the Surrogate Index as a Decision-Making Tool Using 200 A/B Tests at Netflix. https://arxiv.org/abs/2311.11922

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓