MCPcopy Index your code
hub / github.com/MorvanZhou/Reinforcement-learning-with-tensorflow

github.com/MorvanZhou/Reinforcement-learning-with-tensorflow @main sqlite

repository ↗ · DeepWiki ↗
414 symbols 998 edges 56 files 12 documented · 3%
README
<a href="https://www.youtube.com/watch?v=pieI7rOXELI&list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba" target="_blank">
<img width="60%" src="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/blob/master/RL_cover.jpg" style="max-width:100%;">
</a>

Reinforcement Learning Methods and Tutorials

In these tutorials for reinforcement learning, it covers from the basic RL algorithms to advanced algorithms developed recent years.

If you speak Chinese, visit 莫烦 Python or my Youtube channel for more.

As many requests about making these tutorials available in English, please find them in this playlist: (https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba)

Table of Contents

Some RL Networks

Deep Q Network

Double DQN

Dueling DQN

Actor Critic

Deep Deterministic Policy Gradient

A3C

Proximal Policy Optimization (PPO)

Curiosity Model

Donation

If this does help you, please consider donating to support me for better tutorials. Any contribution is greatly appreciated!

Paypal

Patreon

Core symbols most depended-on inside this repo

reset
called by 21
contents/11_Dyna_Q/maze_env.py
step
called by 21
contents/11_Dyna_Q/maze_env.py
render
called by 15
contents/11_Dyna_Q/maze_env.py
add
called by 9
contents/5.2_Prioritized_Replay_DQN/RL_brain.py
step
called by 8
experiments/2D_car/car_env.py
reset
called by 8
experiments/2D_car/car_env.py
update
called by 8
contents/12_Proximal_Policy_Optimization/DPPO.py
render
called by 7
experiments/2D_car/car_env.py

Shape

Method 321
Class 72
Function 21

Languages

Python100%

Modules by API surface

experiments/Solve_BipedalWalker/DDPG.py25 symbols
experiments/Solve_LunarLander/DuelingDQNPrioritizedReplay.py22 symbols
experiments/Robot_arm/arm_env.py18 symbols
contents/5.2_Prioritized_Replay_DQN/RL_brain.py18 symbols
experiments/Robot_arm/DDPG.py16 symbols
experiments/2D_car/DDPG.py16 symbols
experiments/2D_car/car_env.py14 symbols
contents/9_Deep_Deterministic_Policy_Gradient_DDPG/DDPG.py14 symbols
contents/3_Sarsa_maze/RL_brain.py11 symbols
contents/11_Dyna_Q/RL_brain.py10 symbols
experiments/Solve_LunarLander/A3C.py9 symbols
experiments/Solve_BipedalWalker/A3C_rnn.py9 symbols

For agents

$ claude mcp add Reinforcement-learning-with-tensorflow \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact