MCPcopy
hub / github.com/FareedKhan-dev/train-llm-from-scratch / grpo_live.py

File grpo_live.py

tests/grpo_live.py:None–None  ·  view source on GitHub ↗

Source from the content-addressed store, hash-verified

1"""Live-progress GRPO-ascends-reward proof (prints every few iters). See verify_rl_optimizes.py."""
2import torch
3from src.models.transformer import Transformer
4from src.post_training.grpo import group_advantages, grpo_loss

Callers

nothing calls this directly

Calls 7

set_seedFunction · 0.90
TransformerClass · 0.90
make_frozen_copyFunction · 0.90
generate_with_logprobsFunction · 0.90
group_advantagesFunction · 0.90
compute_logprobsFunction · 0.90
grpo_lossFunction · 0.90

Tested by

no test coverage detected