MCPcopy
hub / github.com/DLR-RM/stable-baselines3 / learn

Method learn

stable_baselines3/sac/sac.py:304–320  ·  view source on GitHub ↗
(
        self: SelfSAC,
        total_timesteps: int,
        callback: MaybeCallback = None,
        log_interval: int = 4,
        tb_log_name: str = "SAC",
        reset_num_timesteps: bool = True,
        progress_bar: bool = False,
    )

Source from the content-addressed store, hash-verified

302 self.logger.record("train/ent_coef_loss", np.mean(ent_coef_losses))
303
304 def learn(
305 self: SelfSAC,
306 total_timesteps: int,
307 callback: MaybeCallback = None,
308 log_interval: int = 4,
309 tb_log_name: str = "SAC",
310 reset_num_timesteps: bool = True,
311 progress_bar: bool = False,
312 ) -> SelfSAC:
313 return super().learn(
314 total_timesteps=total_timesteps,
315 callback=callback,
316 log_interval=log_interval,
317 tb_log_name=tb_log_name,
318 reset_num_timesteps=reset_num_timesteps,
319 progress_bar=progress_bar,
320 )
321
322 def _excluded_save_params(self) -> list[str]:
323 return super()._excluded_save_params() + ["actor", "critic", "critic_target"] # noqa: RUF005

Callers 15

test_her_normalizationFunction · 0.95
test_full_replay_bufferFunction · 0.95
test_sacFunction · 0.95
test_n_criticsFunction · 0.95
test_train_freqFunction · 0.95
test_train_freq_failFunction · 0.95
test_auto_wrapFunction · 0.45

Calls

no outgoing calls

Tested by 15

test_her_normalizationFunction · 0.76
test_full_replay_bufferFunction · 0.76
test_sacFunction · 0.76
test_n_criticsFunction · 0.76
test_train_freqFunction · 0.76
test_train_freq_failFunction · 0.76
test_auto_wrapFunction · 0.36