MCPcopy
hub / github.com/Farama-Foundation/Gymnasium

github.com/Farama-Foundation/Gymnasium @v1.3.0 sqlite

repository ↗ · DeepWiki ↗ · release v1.3.0 ↗
2,027 symbols 8,520 edges 259 files 1,176 documented · 58%
README

Python PyPI arXiv pre-commit License Code style: black

<a href="https://gymnasium.farama.org/" target = "_blank">
<img src="https://raw.githubusercontent.com/Farama-Foundation/Gymnasium/main/gymnasium-text.png" width="500px" />

Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. This is a fork of OpenAI's Gym library by its maintainers (OpenAI handed over maintenance a few years ago to an outside team), and is where future maintenance will occur going forward.

The documentation website is at gymnasium.farama.org, and we have a public discord server (which we also use to coordinate development work) that you can join here: https://discord.gg/bnJ6kubTg6

Environments

Gymnasium includes the following families of environments along with a wide variety of third-party environments * Classic Control - These are classic reinforcement learning based on real-world problems and physics. * Box2D - These environments all involve toy games based around physics control, using box2d based physics and PyGame-based rendering * Toy Text - These environments are designed to be extremely simple, with small discrete state and action spaces, and hence easy to learn. As a result, they are suitable for debugging implementations of reinforcement learning algorithms. * MuJoCo - A physics engine based environments with multi-joint control which are more complex than the Box2D environments. * Atari - Emulator of Atari 2600 ROMs simulated that have a high range of complexity for agents to learn. * Third-party - A number of environments have been created that are compatible with the Gymnasium API. Be aware of the version that the software was created for and use the apply_env_compatibility in gymnasium.make if necessary.

Installation

To install the base Gymnasium library, use pip install gymnasium

This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like pip install "gymnasium[atari]" or use pip install "gymnasium[all]" to install all dependencies.

We support and test for Python 3.10, 3.11, 3.12 and 3.13 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.

API

The Gymnasium API models environments as simple Python env classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:

import gymnasium as gym
env = gym.make("CartPole-v1")

observation, info = env.reset(seed=42)
for _ in range(1000):
    action = env.action_space.sample()
    observation, reward, terminated, truncated, info = env.step(action)

    if terminated or truncated:
        observation, info = env.reset()
env.close()

Notable Related Libraries

Please note that this is an incomplete list, and just includes libraries that the maintainers most commonly point newcomers to when asked for recommendations.

  • CleanRL is a learning library based on the Gymnasium API. It is designed to cater to newer people in the field and provides very good reference implementations.
  • PettingZoo is a multi-agent version of Gymnasium with a number of implemented environments, for example, multi-agent Atari environments.
  • The Farama Foundation also has a collection of many other environments that are maintained by the same team as Gymnasium and use the Gymnasium API.

Environment Versioning

Gymnasium keeps strict versioning for reproducibility reasons. All environments end in a suffix like "-v0". When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion. These were inherited from Gym.

Contributing

We welcome contributions from the community! Please see our CONTRIBUTING.md for details on how to get started.

Support Gymnasium's Development

If you are financially able to do so and would like to support the development of Gymnasium, please join others in the community in donating to us.

Citation

You can cite Gymnasium using our related paper (https://arxiv.org/abs/2407.17032) as:

@article{towers2024gymnasium,
  title={Gymnasium: A Standard Interface for Reinforcement Learning Environments},
  author={Towers, Mark and Kwiatkowski, Ariel and Terry, Jordan and Balis, John U and De Cola, Gianluca and Deleu, Tristan and Goul{\~a}o, Manuel and Kallinteris, Andreas and Krimmel, Markus and KG, Arjun and others},
  journal={arXiv preprint arXiv:2407.17032},
  year={2024}
}

Core symbols most depended-on inside this repo

make
called by 213
gymnasium/envs/registration.py
data_equivalence
called by 135
gymnasium/utils/env_checker.py
close
called by 126
gymnasium/core.py
keys
called by 93
gymnasium/spaces/dict.py
seed
called by 85
gymnasium/spaces/dict.py
register
called by 63
gymnasium/envs/registration.py
spec
called by 32
gymnasium/core.py
batch_space
called by 30
gymnasium/vector/utils/space_utils.py

Shape

Method 953
Function 803
Class 248
Route 23

Languages

Python100%

Modules by API surface

gymnasium/spaces/utils.py52 symbols
gymnasium/core.py49 symbols
gymnasium/vector/vector_env.py43 symbols
gymnasium/envs/mujoco/mujoco_rendering.py42 symbols
gymnasium/vector/utils/space_utils.py40 symbols
gymnasium/envs/registration.py31 symbols
gymnasium/wrappers/rendering.py30 symbols
gymnasium/envs/tabular/blackjack.py30 symbols
gymnasium/wrappers/common.py27 symbols
tests/test_core.py26 symbols
gymnasium/wrappers/transform_observation.py25 symbols
tests/utils/test_play.py24 symbols

Dependencies from manifests, versioned

cloudpickle1.2.0 · 1×
farama-notifications0.0.1 · 1×
numpy1.21.0 · 1×
sphinx-gallery0.14.0 · 1×
typing-extensions4.3.0 · 1×

For agents

$ claude mcp add Gymnasium \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact