Update method for fixed KL controller (no-op). Args: current_kl (float): Current KL divergence value (unused). n_steps (int): Number of steps taken (unused).
(self, current_kl, n_steps)
| 181 | self.value = kl_coef |
| 182 | |
| 183 | def update(self, current_kl, n_steps): |
| 184 | """Update method for fixed KL controller (no-op). |
| 185 | |
| 186 | Args: |
| 187 | current_kl (float): Current KL divergence value (unused). |
| 188 | n_steps (int): Number of steps taken (unused). |
| 189 | """ |
| 190 | pass |
| 191 | |
| 192 | |
| 193 | def get_kl_controller(kl_ctrl): |
no outgoing calls