MCPcopy Index your code
hub / github.com/X-PLUG/MobileAgent

github.com/X-PLUG/MobileAgent @main sqlite

repository ↗ · DeepWiki ↗
9,972 symbols 32,694 edges 887 files 2,595 documented · 26%
README

Mobile-Agent: 强大的GUI智能体家族 by 通义实验室-阿里巴巴集团

MobileAgent | Trendshift

👏 欢迎通过我们的 Modelscope在线Demo 百炼在线Demo 试用 Mobile-Agent-v3.5!

❗️我们在百炼上提供 Mobile-Agent-v3.5 API,方便快速体验。请查看文档

🤗 <a href="https://huggingface.co/collections/mPLUG/gui-owl-15" target="_blank">GUI-Owl-1.5 Collection</a>|
<img src="https://github.com/X-PLUG/MobileAgent/raw/main/assets/tongyi.png" width="14px" style="display:inline;"> <a href="https://modelscope.cn/collections/iic/GUI-Owl-15" target="_blank">GUI-Owl-1.5 Collection</a>






🤗 <a href="https://huggingface.co/mPLUG/GUI-Owl-32B" target="_blank">GUI-Owl-32B</a> | 
<img src="https://github.com/X-PLUG/MobileAgent/raw/main/assets/tongyi.png" width="14px" style="display:inline;"> <a href="https://modelscope.cn/models/iic/GUI-Owl-32B" target="_blank">GUI-Owl-32B</a> |
🤗 <a href="https://huggingface.co/mPLUG/GUI-Owl-7B" target="_blank">GUI-Owl-7B</a> |
<img src="https://github.com/X-PLUG/MobileAgent/raw/main/assets/tongyi.png" width="14px" style="display:inline;"> <a href="https://modelscope.cn/models/iic/GUI-Owl-7B" target="_blank">GUI-Owl-7B</a>

简体中文 | English


📢新闻

  • [2026.5.12]🔥🔥🔥 ToolCUA 正式发布!一个面向 GUI 与工具最优路径编排 的端到端计算机使用智能体,通过两阶段训练流程(基于轨迹的工具合成 → 在线智能体强化学习)掌握何时使用 GUI 操作、何时调用工具及何时切换。论文及模型权重详见 HuggingFace,项目仓库见Github.。
  • [2026.3.31]🔥🔥 Mobile-Agent-v3.5现已在阿里云无影云手机上线——这是一个基于云端的安卓环境,可提供无缝的移动使用体验。了解更多Alibaba Cloud Wuying Cloud Phone | Documentation
  • [2026.3.19]🔥🔥 GUI-Owl-1.5 系列模型现已上线阿里云百炼平台,可供在线推理。请参考 阿里云百炼 魔搭API-Inference
  • [2026.2.14]🔥 GUI-Owl-1.5 正式发布,这是一个全新的原生多平台 GUI 代理基础模型系列(2B/4B/8B/32B/235B;指令与思考)。该新一代原生 GUI 代理模型系列基于 Qwen3-VL 构建,支持桌面/移动/浏览器自动化,并在 20 多个 GUI 基准测试中取得了SOTA 性能,在端到端任务、接地、工具/MCP 调用和长时域记忆方面均表现出色。模型权重可在 HuggingFace 获取。技术报告可在 链接 获取。详情请参阅 GUI-Owl 1.5 README
  • [2025.11.25] GUI-Owl系列模型现已支持在线推理,感谢阿里云百炼提供的算力支持。详情请见链接
  • [2025.10.30] 我们发布了 OSWorld-MCP,这是一个用于评估模型上下文协议 (MCP) 工具在实际场景中调用能力的基准测试工具。请参阅链接
  • [2025.9.24] 我们在 ModelScope 上发布了基于无影云电脑和云手机的 demo。无需本地部署模型或准备设备,只需输入指令即可体验 Mobile-Agent-v3! ModelScope Demo 链接 百炼 Demo 链接。限时免费的 Mobile-Agent-v3 API请查看文档。基于Qwen-3-VL的新版本即将到来。
  • [2025.9.19] GUI-Critic-R1 已被 第三十九届神经信息处理系统年会 (NeurIPS 2025) 接收。我们发布了最新成果 UI-S1:通过半在线强化学习推进 GUI 自动化论文代码模型 现已开源。
  • [2025.9.16] 我们发布了最新成果 UI-S1:通过半在线强化学习推进 GUI 自动化。论文(https://www.arxiv.org/abs/2509.11543)、代码(https://github.com/X-PLUG/MobileAgent/tree/main/UI-S1)、数据集(https://huggingface.co/datasets/mPLUG/UI_S1_dataset)和模型(https://huggingface.co/mPLUG/UI-S1-7B)现已开源。
  • [2025.9.16] 我们在 OSWorld、AndroidWorld 和实际移动场景中开源了 GUI-Owl 和 Mobile-Agent-v3 的代码。请参阅 OSWorld 代码。GUI-Owl 的 OSWorld 强化学习 checkpoint 也已发布。请参阅 AndroidWorld 代码真实场景代码
  • [2025.8.20]全新 GUI-OwlMobile-Agent-v3 正式发布!技术报告可在此处查看(https://arxiv.org/abs/2508.15144)。模型检查点将在 GUI-Owl-7BGUI-Owl-32B 上发布。
  • GUI-Owl 是一个多模态跨平台 GUI 虚拟层模型 (VLM),具备 GUI 感知、落地和端到端操作能力。
  • Mobile-Agent-v3 是一个基于 GUI-Owl 的跨平台多智能体框架,提供规划、进度管理、反射和内存管理等功能。
  • [2025.8.14]Mobile-Agent-v3 在第二十四届全国计算语言学大会 (CCL 2025) 上荣获 最佳演示奖
  • [2025.3.17] PC-Agent 已被 ICLR 2025 研讨会 接收。
  • [2024.9.26] Mobile-Agent-v2 已被 第三十八届神经信息处理系统年会 (NeurIPS 2024) 接收。
  • [2024.7.29] Mobile-Agent 在第二十三届全国计算语言学大会 (CCL 2024) 上荣获 最佳演示奖
  • [2024.3.10] Mobile-Agent 已被 ICLR 2024 研讨会 录用。

📊效果

👀特点

📝系列工作

📺Demo

<h3>了解Mobile-Agent-v3.5</h3>
<video src= "https://github.com/user-attachments/assets/c5b7c858-574d-4a6e-b784-5b6778f85071"/>

📱手机

<h3>帮我在小红书、抖音看一下“魔搭ModelScope社区”账号,并告诉我这两个平台的总粉丝数。</h3>
<video src= "https://github.com/user-attachments/assets/a12f4f8b-3e24-4349-ba51-b46f7d7e7154"/>







<h3>今天是2025年2月15号星期日,在携程旅行上搜索五天后从广州到成都的机票,查看最便宜的航班,然后搜索同路线最便宜的火车票,告诉我它们的价格。</h3>
<video src= "https://github.com/user-attachments/assets/2f04e4a1-a0a8-46bb-a9ee-aa22b119bcdb"/>

💻PC + 🌐网页

<h3>分别搜索苹果公司和英伟达公司的股票价格。然后在WPS Office中创建一个新的电子表格。在A列输入公司名称,在B列输入搜索到的股票价格。</h3>
<video src= "https://github.com/user-attachments/assets/819f87d5-6eac-4370-96bb-2d0a87dac3c6"/>








<h3>在WPS Office中创建一个新文档,撰写一段阿里巴巴的简介,并把字号设为12。从Edge浏览器搜索阿里巴巴的logo,复制一张图像并粘贴到WPS文档的最后。</h3>
<video src= "https://github.com/user-attachments/assets/302d6972-f21e-4176-965f-c82e4306ff07"/>

⭐Star History

Star History Chart

📑引用

如果您发现 Mobile-Agent 对您的研究和应用有用,请使用此 BibTeX 进行引用:

@article{xu2026mobile,
  title={Mobile-Agent-v3. 5: Multi-platform Fundamental GUI Agents},
  author={Xu, Haiyang and Zhang, Xi and Liu, Haowei and Wang, Junyang and Zhu, Zhaozai and Zhou, Shengjie and Hu, Xuhao and Gao, Feiyu and Cao, Junjie and Wang, Zihua and others},
  journal={arXiv preprint arXiv:2602.16855},
  year={2026}
}

@article{ye2025mobile,
  title={Mobile-Agent-v3: Foundamental Agents for GUI Automation},
  author={Ye, Jiabo and Zhang, Xi and Xu, Haiyang and Liu, Haowei and Wang, Junyang and Zhu, Zhaoqing and Zheng, Ziwei and Gao, Feiyu and Cao, Junjie and Lu, Zhengxi and others},
  journal={arXiv preprint arXiv:2508.15144},
  year={2025}
}

@article{lu2025ui,
  title={UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning},
  author={Lu, Zhengxi and Ye, Jiabo and Tang, Fei and Shen, Yongliang and Xu, Haiyang and Zheng, Ziwei and Lu, Weiming and Yan, Ming and Huang, Fei and Xiao, Jun and others},
  journal={arXiv preprint arXiv:2509.11543},
  year={2025}
}

@article{wanyan2025look,
  title={Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation},
  author={Wanyan, Yuyang and Zhang, Xi and Xu, Haiyang and Liu, Haowei and Wang, Junyang and Ye, Jiabo and Kou, Yutong and Yan, Ming and Huang, Fei and Yang, Xiaoshan and others},
  journal={arXiv preprint arXiv:2506.04614},
  year={2025}
}

@article{liu2025pc,
  title={PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC},
  author={Liu, Haowei and Zhang, Xi and Xu, Haiyang and Wanyan, Yuyang and Wang, Junyang and Yan, Ming and Zhang, Ji and Yuan, Chunfeng and Xu, Changsheng and Hu, Weiming and Huang, Fei},
  journal={arXiv preprint arXiv:2502.14282},
  year={2025}
}

@article{wang2025mobile,
  title={Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks},
  author={Wang, Zhenhailong and Xu, Haiyang and Wang, Junyang and Zhang, Xi and Yan, Ming and Zhang, Ji and Huang, Fei and Ji, Heng},
  journal={arXiv preprint arXiv:2501.11733},
  year={2025}
}

@article{wang2024mobile2,
  title={Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration},
  author={Wang, Junyang and Xu, Haiyang and Jia, Haitao and Zhang, Xi and Yan, Ming and Shen, Weizhou and Zhang, Ji and Huang, Fei and Sang, Jitao},
  journal={arXiv preprint arXiv:2406.01014},
  year={2024}
}

@article{wang2024mobile,
  title={Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception},
  author={Wang, Junyang and Xu, Haiyang and Ye, Jiabo and Yan, Ming and Shen, Weizhou and Zhang, Ji and Huang, Fei and Sang, Jitao},
  journal={arXiv preprint arXiv:2401.16158},
  year={2024}
}

Core symbols most depended-on inside this repo

call
called by 637
Mobile-Agent-v3.5/android_world_v3.5/android_world/agents/function_call_mobile_answer.py
call
called by 634
Mobile-Agent-v3/android_world_v3/android_world/agents/function_call_mobile_answer.py
get
called by 605
Mobile-Agent-v3/mobile_v3/utils/schema.py
get
called by 456
UI-S1/verl/utils/memory_buffer.py
slice
called by 441
UI-S1/verl/protocol.py
remove
called by 393
UI-S1/verl/utils/py_functional.py
add
called by 367
UI-S1/verl/utils/seqlen_balancing.py
concat
called by 352
UI-S1/verl/protocol.py

Shape

Function 4,906
Method 3,790
Class 1,125
Route 151

Languages

Python75%
TypeScript25%

Modules by API surface

Mobile-Agent-v3/android_world_v3/apps/java/com/google/androidenv/miniwob/app/assets/html/core/d3.v3.min.js389 symbols
Mobile-Agent-v3.5/android_world_v3.5/apps/java/com/google/androidenv/miniwob/app/assets/html/core/d3.v3.min.js389 symbols
Mobile-Agent-v3/android_world_v3/apps/java/com/google/androidenv/miniwob/app/assets/html/core/jquery-ui/external/jquery/jquery.js99 symbols
Mobile-Agent-v3.5/android_world_v3.5/apps/java/com/google/androidenv/miniwob/app/assets/html/core/jquery-ui/external/jquery/jquery.js99 symbols
Mobile-Agent-v3/android_world_v3/apps/java/com/google/androidenv/miniwob/app/assets/html/flight/AA/js/libs/jquery/jquery-1.11.1.min.js75 symbols
Mobile-Agent-v3.5/android_world_v3.5/apps/java/com/google/androidenv/miniwob/app/assets/html/flight/AA/js/libs/jquery/jquery-1.11.1.min.js75 symbols
Mobile-Agent-v3/android_world_v3/android_world/task_evals/single/markor.py73 symbols
Mobile-Agent-v3.5/android_world_v3.5/android_world/task_evals/single/markor.py73 symbols
Mobile-Agent-E/static/js/fontawesome.all.min.js70 symbols
Mobile-Agent-v3/android_world_v3/apps/java/com/google/androidenv/miniwob/app/assets/html/common/special/click-pie/raphael.min.js66 symbols
Mobile-Agent-v3.5/android_world_v3.5/apps/java/com/google/androidenv/miniwob/app/assets/html/common/special/click-pie/raphael.min.js66 symbols
Mobile-Agent-v3.5/grounding_and_kb/eval_gui_knowledge_benchmark.py65 symbols

Dependencies from manifests, versioned

Jinja23.1.4 · 1×
Keras-Preprocessing1.1.2 · 1×
Markdown3.6 · 1×
MarkupSafe2.1.5 · 1×
MouseInfo0.1.3 · 1×
Pillow11.1.0 · 1×
PyAutoGUI0.9.54 · 1×
PyGetWindow0.0.9 · 1×
PyMsgBox1.0.9 · 1×
PyRect0.2.0 · 1×
PyScreeze1.0.1 · 1×

For agents

$ claude mcp add MobileAgent \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact