File tutorial_PG.py

examples/reinforcement_learning/tutorial_PG.py:None–None · view source on GitHub ↗

Source from the content-addressed store, hash-verified

1	"""
2	Vanilla Policy Gradient(VPG or REINFORCE)
3	-----------------------------------------
4	The policy gradient algorithm works by updating policy parameters via stochastic gradient ascent on policy performance.

nothing calls this directly

PolicyGradientClass · 0.85

resetMethod · 0.45

get_actionMethod · 0.45

stepMethod · 0.45

store_transitionMethod · 0.45

learnMethod · 0.45

saveMethod · 0.45

loadMethod · 0.45

no test coverage detected