MCPcopy
hub / github.com/PRIME-RL/PRIME / load_fsdp_optimizer

Function load_fsdp_optimizer

training/verl/utils/fsdp_utils.py:115–122  ·  view source on GitHub ↗
(optimizer, device_id)

Source from the content-addressed store, hash-verified

113
114
115def load_fsdp_optimizer(optimizer, device_id):
116 for param_group in optimizer.param_groups:
117 for param in param_group['params']:
118 state = optimizer.state[param]
119 for key, value in state.items():
120 if isinstance(value, torch.Tensor):
121 state[key] = value.to(device_id, non_blocking=True)
122 torch.cuda.empty_cache()

Callers 3

update_actorMethod · 0.90
update_criticMethod · 0.90
compute_rm_scoreMethod · 0.90

Calls 1

toMethod · 0.80

Tested by

no test coverage detected