MCPcopy Index your code
hub / github.com/lazyprogrammer/machine_learning_examples / test_agent

Function test_agent

rl/cartpole_gym0.19.py:69–80  ·  view source on GitHub ↗
(model, env, n_episodes=20)

Source from the content-addressed store, hash-verified

67
68
69def test_agent(model, env, n_episodes=20):
70 reward_per_episode = np.zeros(n_episodes)
71 for it in range(n_episodes):
72 done = False
73 episode_reward = 0
74 s = env.reset()
75 while not done:
76 a = epsilon_greedy(model, s, eps=0)
77 s, r, done, info = env.step(a)
78 episode_reward += r
79 reward_per_episode[it] = episode_reward
80 return np.mean(reward_per_episode)
81
82
83def watch_agent(model, env, eps):

Callers 1

Calls 3

epsilon_greedyFunction · 0.70
resetMethod · 0.45
stepMethod · 0.45

Tested by

no test coverage detected