AIDEX
Metaflow logo

Metaflow

by Netflix / Outerbounds

Python framework for building and managing real-life data science and ML projects at scale

Open SourceOpen source (Apache 2.0), free to useOpen Source api
Visit Metaflow

About Metaflow

Metaflow, originally developed at Netflix, is a Python framework for building production data science and ML workflows. It manages compute resources, data versioning, and workflow scheduling while letting data scientists write normal Python code. Metaflow deploys to AWS (Step Functions, Batch) and Kubernetes.

Key Features

  • Python-native workflows
  • Automatic versioning
  • Compute scheduling
  • AWS/K8s integration
  • Data artifact management
  • Resume from failures
  • Parallel execution

Pros

  • Pythonic API
  • Production-tested at Netflix
  • Good data management
  • Simple to start

Cons

  • AWS-centric
  • Smaller community than Airflow
  • Limited UI

Tags

ml-workflowopen-sourcepythonnetflixproduction