MCPcopy
hub / github.com/google-deepmind/acme

github.com/google-deepmind/acme @0.4.0 sqlite

repository ↗ · DeepWiki ↗ · release 0.4.0 ↗
1,940 symbols 7,043 edges 360 files 891 documented · 46%
README

Acme: a research framework for reinforcement learning

PyPI Python Version PyPI version acme-tests Documentation Status

Acme is a library of reinforcement learning (RL) building blocks that strives to expose simple, efficient, and readable agents. These agents first and foremost serve both as reference implementations as well as providing strong baselines for algorithm performance. However, the baseline agents exposed by Acme should also provide enough flexibility and simplicity that they can be used as a starting block for novel research. Finally, the building blocks of Acme are designed in such a way that the agents can be written at multiple scales (e.g. single-stream vs. distributed agents).

Getting started

The quickest way to get started is to take a look at the detailed working code examples found in the examples subdirectory. These show how to instantiate a number of different agents and run them within a variety of environments. See the quickstart notebook for an even quicker dive into using a single agent. Even more detail on the internal construction of an agent can be found inside our tutorial notebook. Finally, a full description Acme and its underlying components can be found by referring to the documentation. More background information and details behind the design decisions can be found in our technical report.

NOTE: Acme is first and foremost a framework for RL research written by researchers, for researchers. We use it for our own work on a daily basis. So with that in mind, while we will make every attempt to keep everything in good working order, things may break occasionally. But if so we will make our best effort to fix them as quickly as possible!

Installation

We have tested Acme on Python 3.7, 3.8 and 3.9. To get up and running quickly just follow the steps below:

  1. While you can install Acme in your standard python environment, we strongly recommend using a Python virtual environment to manage your dependencies. This should help to avoid version conflicts and just generally make the installation process easier.

    bash python3 -m venv acme source acme/bin/activate pip install --upgrade pip setuptools wheel

  2. While the core dm-acme library can be installed directly, the set of dependencies included for installation is minimal. In particular, to run any of the included agents you will also need either JAX or TensorFlow depending on the agent. As a result we recommend installing these components as well, i.e.

    bash pip install dm-acme[jax,tensorflow]

  3. Finally, to install a few example environments (including gym, dm_control, and bsuite):

    bash pip install dm-acme[envs]

  4. Installing from github: if you're interested in running the bleeding-edge version of Acme, you can do so by cloning the Acme GitHub repository and then executing following command from the main directory (where setup.py is located):

    bash pip install .[jax,tf,testing,envs]

Citing Acme

If you use Acme in your work, please cite the accompanying technical report:

@article{hoffman2020acme,
    title={Acme: A Research Framework for Distributed Reinforcement Learning},
    author={Matt Hoffman and Bobak Shahriari and John Aslanides and Gabriel
        Barth-Maron and Feryal Behbahani and Tamara Norman and Abbas Abdolmaleki
        and Albin Cassirer and Fan Yang and Kate Baumli and Sarah Henderson and
        Alex Novikov and Sergio Gómez Colmenarejo and Serkan Cabi and Caglar
        Gulcehre and Tom Le Paine and Andrew Cowie and Ziyu Wang and Bilal Piot
        and Nando de Freitas},
    year={2020},
    journal={arXiv preprint arXiv:2006.00979},
    url={https://arxiv.org/abs/2006.00979},
}

Core symbols most depended-on inside this repo

init
called by 71
acme/agents/tf/d4pg/agent.py
run
called by 63
acme/environment_loop.py
sample
called by 41
acme/datasets/tfds.py
increment
called by 33
acme/utils/counting.py
run
called by 32
setup.py
write
called by 27
acme/utils/loggers/base.py
signature
called by 24
acme/adders/reverb/base.py
save
called by 22
acme/agents/jax/bc/learning.py

Shape

Method 1,122
Class 417
Function 400
Route 1

Languages

Python100%

Modules by API surface

acme/testing/fakes.py37 symbols
acme/tf/savers.py34 symbols
acme/jax/utils.py29 symbols
acme/tf/networks/recurrence.py24 symbols
acme/jax/networks/distributional.py24 symbols
acme/jax/running_statistics.py21 symbols
acme/jax/networks/atari.py21 symbols
acme/agents/jax/normalization.py21 symbols
acme/tf/networks/atari.py20 symbols
acme/utils/loggers/filters.py19 symbols
acme/tf/savers_test.py18 symbols
acme/agents/jax/ail/networks.py18 symbols

For agents

$ claude mcp add acme \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact