MCPcopy
hub / github.com/tensorlayer/TensorLayer / tutorial_PG.py

File tutorial_PG.py

examples/reinforcement_learning/tutorial_PG.py:None–None  ·  view source on GitHub ↗

Source from the content-addressed store, hash-verified

1"""
2Vanilla Policy Gradient(VPG or REINFORCE)
3-----------------------------------------
4The policy gradient algorithm works by updating policy parameters via stochastic gradient ascent on policy performance.

Callers

nothing calls this directly

Calls 8

PolicyGradientClass · 0.85
resetMethod · 0.45
get_actionMethod · 0.45
stepMethod · 0.45
store_transitionMethod · 0.45
learnMethod · 0.45
saveMethod · 0.45
loadMethod · 0.45

Tested by

no test coverage detected