arXiv Open Access 2025

AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance

Dhaval Patel Shuxin Lin James Rayfield Nianjun Zhou Chathurangi Shyalika +5 lainnya

Lihat Sumber

Abstrak

AI for Industrial Asset Lifecycle Management aims to automate complex operational workflows, such as condition monitoring and maintenance scheduling, to minimize system downtime. While traditional AI/ML approaches solve narrow tasks in isolation, Large Language Model (LLM) agents offer a next-generation opportunity for end-to-end automation. In this paper, we introduce AssetOpsBench, a unified framework for orchestrating and evaluating domain-specific agents for Industry 4.0. AssetOpsBench provides a multimodal ecosystem comprising a catalog of four domain-specific agents, a curated dataset of 140+ human-authored natural-language queries grounded in real industrial scenarios, and a simulated, CouchDB-backed IoT environment. We introduce an automated evaluation framework that uses three key metrics to analyze architectural trade-offs between the Tool-As-Agent and Plan-Executor paradigms, along with a systematic procedure for the automated discovery of emerging failure modes. The practical relevance of AssetOpsBench is demonstrated by its broad community adoption, with 250+ users and over 500 agents submitted to our public benchmarking platform, supporting reproducible and scalable research for real-world industrial operations. The code is accesible at https://github.com/IBM/AssetOpsBench .

Topik & Kata Kunci

cs.AI cs.MA

Penulis (10)

Dhaval Patel

Shuxin Lin

James Rayfield

Nianjun Zhou

Chathurangi Shyalika

Suryanarayana R Yarrabothula

Roman Vaculin

Natalia Martinez

Fearghal O'donncha

Jayant Kalagnanam

Format Sitasi

APA MLA BibTeX

Patel, D., Lin, S., Rayfield, J., Zhou, N., Shyalika, C., Yarrabothula, S.R. et al. (2025). AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance. https://arxiv.org/abs/2506.03828

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓