MCPcopy
hub / github.com/ddbourgin/numpy-ml / update

Method update

numpy_ml/rl_models/agents.py:87–98  ·  view source on GitHub ↗

r""" Update the agent parameters according to the rewards accrued on the current episode. Returns ------- avg_reward : float The average reward earned by the best `retain_prcnt` theta samples on the current episode.

(self)

Source from the content-addressed store, hash-verified

85
86 @abstractmethod
87 def update(self):
88 r"""
89 Update the agent parameters according to the rewards accrued on the
90 current episode.
91
92 Returns
93 -------
94 avg_reward : float
95 The average reward earned by the best `retain_prcnt` theta samples
96 on the current episode.
97 """
98 raise NotImplementedError
99
100
101class CrossEntropyAgent(AgentBase):

Callers 6

_train_episodeMethod · 0.45
set_paramsMethod · 0.45
extract_gradsMethod · 0.45
extract_gradsMethod · 0.45
trainMethod · 0.45
trainMethod · 0.45

Calls

no outgoing calls

Tested by 2

trainMethod · 0.36
trainMethod · 0.36