Dagster is a data orchestrator for machine learning, analytics, and ETL.

  • Implement components in any tool, such as Pandas, Spark, SQL, or DBT.
  • Define your pipelines in terms of the data flow between reusable, logical components.
  • Test locally and run anywhere with a unified view of data pipelines and assets.