arXiv Open Access 2025

AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance

Dhaval Patel Shuxin Lin James Rayfield Nianjun Zhou Chathurangi Shyalika +5 lainnya
Lihat Sumber

Abstrak

AI for Industrial Asset Lifecycle Management aims to automate complex operational workflows, such as condition monitoring and maintenance scheduling, to minimize system downtime. While traditional AI/ML approaches solve narrow tasks in isolation, Large Language Model (LLM) agents offer a next-generation opportunity for end-to-end automation. In this paper, we introduce AssetOpsBench, a unified framework for orchestrating and evaluating domain-specific agents for Industry 4.0. AssetOpsBench provides a multimodal ecosystem comprising a catalog of four domain-specific agents, a curated dataset of 140+ human-authored natural-language queries grounded in real industrial scenarios, and a simulated, CouchDB-backed IoT environment. We introduce an automated evaluation framework that uses three key metrics to analyze architectural trade-offs between the Tool-As-Agent and Plan-Executor paradigms, along with a systematic procedure for the automated discovery of emerging failure modes. The practical relevance of AssetOpsBench is demonstrated by its broad community adoption, with 250+ users and over 500 agents submitted to our public benchmarking platform, supporting reproducible and scalable research for real-world industrial operations. The code is accesible at https://github.com/IBM/AssetOpsBench .

Topik & Kata Kunci

Penulis (10)

D

Dhaval Patel

S

Shuxin Lin

J

James Rayfield

N

Nianjun Zhou

C

Chathurangi Shyalika

S

Suryanarayana R Yarrabothula

R

Roman Vaculin

N

Natalia Martinez

F

Fearghal O'donncha

J

Jayant Kalagnanam

Format Sitasi

Patel, D., Lin, S., Rayfield, J., Zhou, N., Shyalika, C., Yarrabothula, S.R. et al. (2025). AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance. https://arxiv.org/abs/2506.03828

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓