A unified, open framework for building, orchestrating, and evaluating domain-specific AI agents in Industry 4.0.
📄 Paper · 🤗 Dataset · 🎮 Playground · 📢 IBM Blog · 🎥 Video · 📊 Kaggle · 🚀 Colab
[!IMPORTANT] 🎉 AssetOpsBench is officially accepted at KDD 2026 (Datasets & Benchmarks Track), Jeju, South Korea, alongside our hands-on tutorial Building Reliable Industrial Agents with MCP. See Publications for the full list of 2025–2026 work.
| 9 Asset classes | 141+ Scenarios | 5 Domain agents | 2 Orchestration frameworks | 20+ University extensions | 500+ Competition submissions |
Built for: maintenance engineers, reliability specialists, facility planners, and Industry 4.0 researchers. Powered by: LLMs + Time Series Foundation Models, orchestrated over live sensor data and Industry 4.0 records (FMEA, work orders, alerts). Now with: simplified interface and native MCP (Model Context Protocol) support.
# Clone and install
git clone https://github.com/IBM/AssetOpsBench.git
cd AssetOpsBench
pip install -e .
# Try a scenario (to be enabled)
python -m assetopsbench.run --scenario "List all sensors of Chiller 6 in MAIN site"
Or jump in instantly: - 🚀 Run on Colab — no install required (illustration of LLM Agent) - 🎮 Try the HF Playground — interactive demo - 📖 Read INSTRUCTIONS.md — full setup, MCP servers, plan-execute runner
[!NOTE] Active development is on
main. The codebase used for various publication venues continues to be maintained on separate branches, for example, ACL 2026IndustryAssetEQAand prior experimental work is maintained onmain-0.x.
AssetOpsBench is a unified framework for developing, orchestrating, and evaluating domain-specific AI agents in industrial asset operations and maintenance. It provides reproducible scenarios, agent tooling, and evaluation pipelines for multi-step workflows in simulated industrial environments.
| MCP Servers | Important tools |
|---|---|
| IoT | get_sites, get_history, get_assets, get_sensors |
| FMSR | get_sensors, get_failure_modes, get_failure_sensor_mapping |
| TSFM | forecasting, timeseries_anomaly_detection |
| WO | get_work_order_distribution, predict_next_work_order, ... |
| Vibration | compute_fft_spectrum, compute_envelope_spectrum, ... |
| ... | ... |
The src/ directory contains MCP servers and a plan-execute runner built on the Model Context Protocol. See INSTRUCTIONS.md for setup.
| Domain | Example Task |
|---|---|
| IoT | "List all sensors of Chiller 6 in MAIN site" |
| FMSR | "Identify failure modes detected by Chiller 6 Supply Temperature" |
| TSFM | "Forecast Chiller 9 Condenser Water Flow for the week of 2020-04-27" |
| WO | "Generate a work order for Chiller 6 anomaly detection" |
Some tasks focus on a single domain, others are multi-step end-to-end workflows. Explore all scenarios on Hugging Face.
Example: MetaAgent leaderboard
12+ contributions across 7 top venues in 2025–2026 from the team behind AssetOpsBench.
⭐ KDD 2026 — Jeju, South Korea (click to expand)
ACL 2026 - San Diego, USA
ICLR 2026 - Brazil
AAAI 2026 — Singapore
IAAI 2026 - Singapore
NeurIPS 2025 — San Diego, USA
EMNLP 2025 — Suzhou, China
📘 Hands-on guides from our team:
AssetOpsBench powers public AI agent competitions that bring together researchers, students, and practitioners worldwide.
Industrial Automation Challenge: Benchmarking Physics-Grounded LLMs for Task Reasoning
A new challenge co-located with IJCAI 2026 that pushes LLM agents on physics-grounded industrial reasoning.
AssetOpsBench-Live: AI Agentic Challenge
Launched in September 2025 at CODS 2025, the competition evaluated multi-agent systems on live industrial scenarios.
| Date | Event |
|---|---|
| 2026-08 | KDD 2026 — AssetOpsBench paper + MCP tutorial · Jeju, South Korea |
| 2026-05-10 | NUS Seminar: AssetOpsBench Applications |
| 2025-12 | NeurIPS 2025 Social: Building Reliable Agentic Benchmarks (2000+ registered) |
| 2025-10-03 | 2-Hour Workshop: AI Agents and Their Role in Industry 4.0 Applications · NJIT ACM |
| 2025-09-01 | CODS 2025 Competition Launch — AssetOpsBench-Live |
| 2025-06-01 | AssetOpsBench v1.0 released — 141 industrial scenarios |
AssetOpsBench is being extended by university research groups exploring new asset classes, evaluation paradigms, and agentic architectures. To list your project, open a PR.
$ claude mcp add AssetOpsBench \
-- python -m otcore.mcp_server <graph>