hub / github.com/DLR-RM/stable-baselines3 / learn

Method learn

stable_baselines3/sac/sac.py:304–320 · view source on GitHub ↗

(
        self: SelfSAC,
        total_timesteps: int,
        callback: MaybeCallback = None,
        log_interval: int = 4,
        tb_log_name: str = "SAC",
        reset_num_timesteps: bool = True,
        progress_bar: bool = False,
    )

Source from the content-addressed store, hash-verified

302	self.logger.record("train/ent_coef_loss", np.mean(ent_coef_losses))
303
304	def learn(
305	self: SelfSAC,
306	total_timesteps: int,
307	callback: MaybeCallback = None,
308	log_interval: int = 4,
309	tb_log_name: str = "SAC",
310	reset_num_timesteps: bool = True,
311	progress_bar: bool = False,
312	) -> SelfSAC:
313	return super().learn(
314	total_timesteps=total_timesteps,
315	callback=callback,
316	log_interval=log_interval,
317	tb_log_name=tb_log_name,
318	reset_num_timesteps=reset_num_timesteps,
319	progress_bar=progress_bar,
320	)
321
322	def _excluded_save_params(self) -> list[str]:
323	return super()._excluded_save_params() + ["actor", "critic", "critic_target"] # noqa: RUF005

Callers 15

test_save_load_pytorch_varFunction · 0.95

test_her_normalizationFunction · 0.95

test_goal_selection_strategyFunction · 0.95

test_save_load_replay_bufferFunction · 0.95

test_full_replay_bufferFunction · 0.95

test_sac_train_with_batch_normFunction · 0.95

test_sacFunction · 0.95

test_n_criticsFunction · 0.95

test_train_freqFunction · 0.95

test_train_freq_failFunction · 0.95

test_auto_wrapFunction · 0.45

test_subclassed_space_envFunction · 0.45

Calls

no outgoing calls

Tested by 15

test_save_load_pytorch_varFunction · 0.76

test_her_normalizationFunction · 0.76

test_goal_selection_strategyFunction · 0.76

test_save_load_replay_bufferFunction · 0.76

test_full_replay_bufferFunction · 0.76

test_sac_train_with_batch_normFunction · 0.76

test_sacFunction · 0.76

test_n_criticsFunction · 0.76

test_train_freqFunction · 0.76

test_train_freq_failFunction · 0.76

test_auto_wrapFunction · 0.36

test_subclassed_space_envFunction · 0.36