Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/FareedKhan-dev/train-llm-from-scratch
/ train_reward.py
File
train_reward.py
scripts/train_reward.py:None–None ·
view source on GitHub ↗
Source
from the content-addressed store, hash-verified
1
""
"
2
Train the reward model on preference pairs
with
the Bradley-Terry loss.
3
4
Initializes the reward backbone
from
the SFT checkpoint, adds a scalar reward head, and
Callers
nothing calls this directly
Calls
1
main
Function · 0.70
Tested by
no test coverage detected