MCPcopy Index your code
hub / github.com/google-deepmind/learning-to-learn

github.com/google-deepmind/learning-to-learn @main sqlite

repository ↗ · DeepWiki ↗
124 symbols 281 edges 12 files 77 documented · 62%
README

Learning to Learn in TensorFlow

Dependencies

Training

python train.py --problem=mnist --save_path=./mnist

Command-line flags:

  • save_path: If present, the optimizer will be saved to the specified path every time the evaluation performance is improved.
  • num_epochs: Number of training epochs.
  • log_period: Epochs before mean performance and time is reported.
  • evaluation_period: Epochs before the optimizer is evaluated.
  • evaluation_epochs: Number of evaluation epochs.
  • problem: Problem to train on. See Problems section below.
  • num_steps: Number of optimization steps.
  • unroll_length: Number of unroll steps for the optimizer.
  • learning_rate: Learning rate.
  • second_derivatives: If true, the optimizer will try to compute second derivatives through the loss function specified by the problem.

Evaluation

python evaluate.py --problem=mnist --optimizer=L2L --path=./mnist

Command-line flags:

  • optimizer: Adam or L2L.
  • path: Path to saved optimizer, only relevant if using the L2L optimizer.
  • learning_rate: Learning rate, only relevant if using Adam optimizer.
  • num_epochs: Number of evaluation epochs.
  • seed: Seed for random number generation.
  • problem: Problem to evaluate on. See Problems section below.
  • num_steps: Number of optimization steps.

Problems

The training and evaluation scripts support the following problems (see util.py for more details):

  • simple: One-variable quadratic function.
  • simple-multi: Two-variable quadratic function, where one of the variables is optimized using a learned optimizer and the other one using Adam.
  • quadratic: Batched ten-variable quadratic function.
  • mnist: Mnist classification using a two-layer fully connected network.
  • cifar: Cifar10 classification using a convolutional neural network.
  • cifar-multi: Cifar10 classification using a convolutional neural network, where two independent learned optimizers are used. One to optimize parameters from convolutional layers and the other one for parameters from fully connected layers.

New problems can be implemented very easily. You can see in train.py that the meta_minimize method from the MetaOptimizer class is given a function that returns the TensorFlow operation that generates the loss function we want to minimize (see problems.py for an example).

It's important that all operations with Python side effects (e.g. queue creation) must be done outside of the function passed to meta_minimize. The cifar10 function in problems.py is a good example of a loss function that uses TensorFlow queues.

Disclaimer: This is not an official Google product.

Core symbols most depended-on inside this repo

meta_minimize
called by 9
meta.py
initial_state_for_inputs
called by 6
networks.py
get_net_path
called by 4
util.py
get_default_net_config
called by 4
util.py
ensemble
called by 3
problems.py
initial_state_for_inputs
called by 3
networks.py
initial_state_for_inputs
called by 3
networks.py
initial_state_for_inputs
called by 3
networks.py

Shape

Method 68
Function 35
Class 21

Languages

Python100%

Modules by API surface

networks.py31 symbols
problems_test.py16 symbols
networks_test.py16 symbols
meta.py15 symbols
problems.py11 symbols
meta_test.py10 symbols
preprocess_test.py9 symbols
preprocess.py6 symbols
util.py5 symbols
convergence_test.py3 symbols
train.py1 symbols
evaluate.py1 symbols

For agents

$ claude mcp add learning-to-learn \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact