MCPcopy Index your code
hub / github.com/Netflix/metaflow

github.com/Netflix/metaflow @2.19.35 sqlite

repository ↗ · DeepWiki ↗ · release 2.19.35 ↗
13,463 symbols 46,267 edges 718 files 2,193 documented · 16%
README

Metaflow_Logo_Horizontal_FullColor_Ribbon_Dark_RGB

Metaflow

Metaflow is a human-centric framework designed to help scientists and engineers build and manage real-life AI and ML systems. Serving teams of all sizes and scale, Metaflow streamlines the entire development lifecycle—from rapid prototyping in notebooks to reliable, maintainable production deployments—enabling teams to iterate quickly and deliver robust systems efficiently.

Originally developed at Netflix and now supported by Outerbounds, Metaflow is designed to boost the productivity for research and engineering teams working on a wide variety of projects, from classical statistics to state-of-the-art deep learning and foundation models. By unifying code, data, and compute at every stage, Metaflow ensures seamless, end-to-end management of real-world AI and ML systems.

Today, Metaflow powers thousands of AI and ML experiences across a diverse array of companies, large and small, including Amazon, Doordash, Dyson, Goldman Sachs, Ramp, and many others. At Netflix alone, Metaflow supports over 3000 AI and ML projects, executes hundreds of millions of data-intensive high-performance compute jobs processing petabytes of data and manages tens of petabytes of models and artifacts for hundreds of users across its AI, ML, data science, and engineering teams.

From prototype to production (and back)

Metaflow provides a simple and friendly pythonic API that covers foundational needs of AI and ML systems:

  1. Rapid local prototyping, support for notebooks, and built-in support for experiment tracking, versioning and visualization.
  2. Effortlessly scale horizontally and vertically in your cloud, utilizing both CPUs and GPUs, with fast data access for running massive embarrassingly parallel as well as gang-scheduled compute workloads reliably and efficiently.
  3. Easily manage dependencies and deploy with one-click to highly available production orchestrators with built in support for reactive orchestration.

For full documentation, check out our API Reference or see our Release Notes for the latest features and improvements.

Getting started

Getting up and running is easy. If you don't know where to start, Metaflow sandbox will have you running and exploring in seconds.

Installing Metaflow

To install Metaflow in your Python environment from PyPI:

pip install metaflow

Alternatively, using conda-forge:

conda install -c conda-forge metaflow

Once installed, a great way to get started is by following our tutorial. It walks you through creating and running your first Metaflow flow step by step.

For more details on Metaflow’s features and best practices, check out: - How Metaflow works - Additional resources

If you need help, don’t hesitate to reach out on our Slack community!

Deploying infrastructure for Metaflow in your cloud

While you can get started with Metaflow easily on your laptop, the main benefits of Metaflow lie in its ability to scale out to external compute clusters and to deploy to production-grade workflow orchestrators. To benefit from these features, follow this guide to configure Metaflow and the infrastructure behind it appropriately.

Get in touch

We'd love to hear from you. Join our community Slack workspace!

Contributing

We welcome contributions to Metaflow. Please see our contribution guide for more details.

Extension points exported contracts — how you extend this code

Artifact (Interface)
(no doc)
metaflow/plugins/cards/ui/src/types.ts
DagData (Interface)
(no doc)
metaflow/plugins/cards/ui/src/types.ts
DagStep (Interface)
(no doc)
metaflow/plugins/cards/ui/src/types.ts
CardResponse (Interface)
(no doc)
metaflow/plugins/cards/ui/src/types.ts
CardResponseMetaData (Interface)
(no doc)
metaflow/plugins/cards/ui/src/types.ts

Core symbols most depended-on inside this repo

append
called by 875
metaflow/plugins/cards/component_serializer.py
join
called by 809
metaflow/tutorials/01-playlist/playlist.py
push
called by 611
metaflow/plugins/aws/step_functions/step_functions_client.py
n
called by 447
metaflow/plugins/cards/card_modules/main.js
items
called by 402
metaflow/datastore/task_datastore.py
format
called by 399
metaflow/plugins/cards/card_modules/main.js
get
called by 319
metaflow/plugins/datatools/s3/s3.py
split
called by 312
metaflow/_vendor/importlib_metadata/_text.py

Shape

Method 6,319
Function 5,492
Class 1,583
Route 43
Interface 26

Languages

Python67%
TypeScript33%

Modules by API surface

metaflow/plugins/cards/card_modules/main.js4,455 symbols
metaflow/plugins/argo/argo_workflows.py258 symbols
metaflow/_vendor/typing_extensions.py231 symbols
metaflow/_vendor/v3_6/typing_extensions.py227 symbols
metaflow/_vendor/v3_7/typing_extensions.py201 symbols
metaflow/client/core.py145 symbols
metaflow/_vendor/v3_7/importlib_metadata/__init__.py124 symbols
metaflow/_vendor/v3_6/importlib_metadata/__init__.py124 symbols
metaflow/_vendor/importlib_metadata/__init__.py124 symbols
metaflow/_vendor/click/core.py123 symbols
test/unit/test_graph_structure.py103 symbols
test/unit/test_artifact_serializer.py100 symbols

Dependencies from manifests, versioned

@iconify/svelte3.1.4 · 1×
@sveltejs/vite-plugin-svelte3.0.1 · 1×
@tsconfig/svelte5.0.2 · 1×
@types/node20.10.4 · 1×
@typescript-eslint/eslint-plugin6.14.0 · 1×
cypress15.15.0 · 1×
cypress-svelte-unit-test3.3.4 · 1×
eslint8.55.0 · 1×
eslint-plugin-svelte2.35.1 · 1×
postcss8.5.12 · 1×

For agents

$ claude mcp add metaflow \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact