MCPcopy
hub / github.com/RLinf/RLinf

github.com/RLinf/RLinf @v0.2 sqlite

repository ↗ · DeepWiki ↗ · release v0.2 ↗
4,971 symbols 18,162 edges 474 files 1,807 documented · 36%
README

RLinf-logo

Hugging Face Ask DeepWiki

English 简体中文

RLinf: 为具身智能和智能体而生的强化学习框架

RLinf 是一个灵活且可扩展的开源框架,专为具身智能和智能体而设计。名称中的 “inf” 既代表 Infrastructure,强调其作为新一代训练坚实基础的作用;也代表 Infinite,寓意其支持开放式学习、持续泛化以及智能发展的无限可能。

RLinf-overview

最新动态

核心特性

RLinf具有高度灵活性,可支持多种强化学习训练工作流(PPO、GRPO、SAC等),同时隐藏了分布式编程的复杂性。用户无需修改代码即可轻松将强化学习训练扩展至大量GPU节点,满足强化学习训练日益增长的计算需求。

这种高灵活性使 RLinf 能够探索更高效的调度与执行模式。在具身强化学习中,混合执行模式的吞吐量可达现有框架的 2.434 倍。

多后端集成支持

  • FSDP + HuggingFace/SGLang/vLLM: 快速适配新模型与新算法,非常适合初学者和快速原型验证。
  • Megatron + SGLang/vLLM: 针对大规模训练进行了优化,为专家用户提供最大化效率。

具身智能

模拟器 真机 模型 算法

智能体强化学习

Core symbols most depended-on inside this repo

get
called by 811
rlinf/utils/timers.py
apply
called by 219
rlinf/utils/patcher.py
update
called by 200
rlinf/runners/agent_eval_runner.py
keys
called by 191
rlinf/data/datasets/world_model.py
get
called by 169
rlinf/agents/wideseek_r1/utils/webpage.py
wait
called by 167
rlinf/scheduler/collective/async_work.py
stop
called by 162
rlinf/utils/timers.py
log
called by 121
rlinf/utils/metric_logger.py

Shape

Method 3,308
Function 1,068
Class 580
Route 15

Languages

Python89%
TypeScript11%

Modules by API surface

docs/source-zh/_static/typesense.min.js144 symbols
docs/source-en/_static/typesense.min.js144 symbols
tests/unit_tests/test_comm.py140 symbols
tests/unit_tests/test_channel.py72 symbols
tests/unit_tests/test_placement.py57 symbols
rlinf/scheduler/worker/worker.py57 symbols
rlinf/scheduler/collective/collective_group.py57 symbols
rlinf/scheduler/dynamic_scheduler/manager.py55 symbols
tests/unit_tests/bench_channel.py52 symbols
rlinf/data/io_struct.py52 symbols
rlinf/utils/placement.py51 symbols
rlinf/data/replay_buffer.py48 symbols

Dependencies from manifests, versioned

Jinja23.1.6 · 1×
Markdown3.8.2 · 1×
MarkupSafe3.0.2 · 1×
PyYAML6.0.2 · 1×
Pygments2.19.2 · 1×
Sphinx8.1.3 · 1×
accessible-pygments0.0.5 · 1×
alabaster1.0.0 · 1×
anyio4.9.0 · 1×
babel2.17.0 · 1×
beautifulsoup44.13.4 · 1×
certifi2025.6.15 · 1×

For agents

$ claude mcp add RLinf \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact