Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/karpathy/nanochat
/ chat_rl.py
File
chat_rl.py
scripts/chat_rl.py:None–None ·
view source on GitHub ↗
Source
from the content-addressed store, hash-verified
1
""
"
2
Reinforcement learning on GSM8K via
"GRPO"
.
3
4
I put GRPO in quotes because we actually end up
with
something a lot
Callers
nothing calls this directly
Calls
15
autodetect_device_type
Function · 0.90
compute_init
Function · 0.90
DummyWandb
Class · 0.90
load_model
Function · 0.90
Engine
Class · 0.90
GSM8K
Class · 0.90
print0
Function · 0.90
get_base_dir
Function · 0.90
save_checkpoint
Function · 0.90
compute_cleanup
Function · 0.90
get_batch
Function · 0.85
run_gsm8k_eval
Function · 0.85
Tested by
no test coverage detected