arXiv Open Access 2025

UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

Ziyun Zhang Xinyi Liu Xiaoyi Zhang Jun Wang Gang Chen +1 lainnya
Lihat Sumber

Abstrak

External knowledge has played a crucial role in the recent development of computer use agents. We identify a critical knowledge-execution gap: retrieved knowledge often fails to translate into effective real-world task execution. Our analysis shows even 90% correct knowledge yields only 41% execution success rate. To bridge this gap, we propose UI-Evol, a plug-and-play module for autonomous GUI knowledge evolution. UI-Evol consists of two stages: a Retrace Stage that extracts faithful objective action sequences from actual agent-environment interactions, and a Critique Stage that refines existing knowledge by comparing these sequences against external references. We conduct comprehensive experiments on the OSWorld benchmark with the state-of-the-art Agent S2. Our results demonstrate that UI-Evol not only significantly boosts task performance but also addresses a previously overlooked issue of high behavioral standard deviation in computer use agents, leading to superior performance on computer use tasks and substantially improved agent reliability.

Topik & Kata Kunci

Penulis (6)

Z

Ziyun Zhang

X

Xinyi Liu

X

Xiaoyi Zhang

J

Jun Wang

G

Gang Chen

Y

Yan Lu

Format Sitasi

Zhang, Z., Liu, X., Zhang, X., Wang, J., Chen, G., Lu, Y. (2025). UI-Evol: Automatic Knowledge Evolving for Computer Use Agents. https://arxiv.org/abs/2505.21964

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓