Connect to everything in your data ecosystem

Dagster integrates with every major tool in the modern data and AI stack—so your orchestrator isn’t the blocker, it’s the bridge.

Explore Integrations

Explore docs, guides, and examples for Dagster’s integrations—spanning orchestration, ingestion, storage, transformation, and beyond.

Airbyte

Orchestrate Airbyte connections and schedule syncs alongside upstream or downstream dependencies.

Anthropic

Integrate Anthropic's Claude AI models into your Dagster pipelines for advanced conversational AI capabilities.

Apache Airflow

Accelerate the migration of Airflow DAGs to Dagster assets with opinionated tooling.

AWS Athena

This integration allows you to connect to AWS Athena and analyze data in Amazon S3 using standard SQL within your Dagster pipelines.

AWS CloudWatch

This integration allows you to send Dagster logs to AWS CloudWatch, enabling centralized logging and monitoring of your Dagster jobs.

AWS ECR

This integration allows you to connect to AWS Elastic Container Registry (ECR), enabling you to manage your container images more effectively in your Dagster pipelines.

AWS EMR

The AWS EMR integration allows you to seamlessly integrate AWS EMR into your Dagster pipelines for petabyte-scale data processing using open-source tools like Apache Spark, Hive, Presto, and more.

AWS Glue

The AWS Glue integration enables you to initiate AWS Glue jobs directly from Dagster, seamlessly pass parameters to your code, and stream logs and structured messages back into Dagster.

AWS Lambda

Using the AWS Lambda integration with Dagster, you can leverage serverless functions to execute external code in your pipelines.

AWS Redshift

Using this integration, you can seamlessly integrate AWS Redshift into your Dagster workflows, leveraging Redshift’s data warehousing capabilities for your data pipelines.

AWS S3

The AWS S3 integration allows data engineers to easily read and write objects to the durable AWS S3 storage, enabling engineers to have a resilient storage layer when constructing their pipelines.

AWS Secrets Manager

This integration allows you to manage, retrieve, and rotate credentials, API keys, and other secrets using AWS Secrets Manager.

AWS Systems Parameter Store

The excerpt for the document is: "The Dagster AWS Systems Manager (SSM) Parameter Store integration allows you to manage and retrieve parameters stored in AWS SSM Parameter Store directly within your Dagster pipelines.

Azure Data Lake Storage Gen 2 (ADLS2)

Get utilities for ADLS2 and Blob Storage.

Bash / Shell

Execute a Bash/shell command, directly or as a read from a script file.

Census

Trigger Census synchs from within your Dagster pipelines.

Cube

Push changes from upstream data sources to Cube's semantic layer.

Databricks

The Databricks integration enables you to initiate Databricks jobs directly from Dagster, seamlessly pass parameters to your code, and stream logs and structured messages back into Dagster.

Datadog

Publish metrics to Datadog from within Dagster ops and entralize your monitoring metrics.

dbt Cloud

Run dbt Cloud™ jobs as part of your data pipeline.

Orchestrate your dbt™ transformation steps

dbt™

Put your dbt transformations to work, directly from within Dagster.

Delta Lake

Integrate your pipelines into Delta Lake.

dlt

Easily ingest and replicate data between systems with dlt through Dagster.

Docker

Run runs external processes in docker containers directly from Dagster.

DuckDB

Read and write natively to DuckDB from Software Defined Assets.

Fivetran

Orchestrate Fivetran connectors and schedule syncs with upstream or downstream dependencies.

GCP BigQuery

Integrate with GCP BigQuery.

GCP Dataproc

Integrate with GCP Dataproc.

GCP GCS

Integrate with GCP GCS.

Gemini

Integrate Google's Gemini AI models into your Dagster pipelines for advanced AI capabilities.

Github

Integrate with GitHub Apps and automate operations within your github repositories.

Great Expectations

Yield an expectation and its output with all relevant metadata.

Hashicorp Vault

Centrally manage credentials and certificates, then use them in your pipelines.

Hex

Work in Hex, then pull Hex apps in to your pipeline as Software Defined Assets.

Hightouch

Trigger syncs and monitor them until they complete.

Jupyter

Dagstermill eliminates the tedious "productionization" of Jupyter notebooks.

Kubernetes

Launch Kubernetes pods and execute external code directly from Dagster.

lakeFS

lakeFS provides version control and complete lineage over the data lake.

Looker

The Looker integration allows you to monitor your Looker instance as assets in Dagster, along with other data assets.

Meltano

Tap into open source configurable ETL+ and the Singer integration library.

Microsoft SharePoint

Connect Dagster with Microsoft SharePoint document libraries using the Graph API. Enable automated file operations, folder management, and data extraction from SharePoint.

Microsoft Teams

Keep your team up to speed with Teams messages.

MLflow

Streamline the process of productionizing, maintaining and monitoring machine learning models.

Modal

Run serverless Python functions at scale with Modal directly from your Dagster pipelines.

OpenAI

Integrate OpenAI calls into your Dagster pipelines, without breaking the bank.

Open Metadata

Configure and schedule Dagster metadata and profiler workflows from the OpenMetadata UI.

Pagerduty

Centralize your monitoring with the dagster-pagerduty integration.

Pandas

Implement validation on pandas DataFrames.

Pandera

Generate Dagster Types from Pandera dataframe schemas.

Polars

Use Polars eager or lazy DataFrames as inputs and outputs in your Dagster assets and ops.

Power BI

The Power BI integration allows you to monitor your Power BI workspace as assets in Dagster, along with other data assets.

Prometheus

Integrate with Prometheus via the prometheus_client library.

Ray

Scale your Python workloads with Ray's distributed computing framework directly from Dagster.

Salesforce

Integrate Salesforce CRM data into your Dagster pipelines. Query records, perform bulk operations, and sync customer data with support for multiple authentication methods.

SDF

Put your SDF transformations to work, directly from within Dagster.

Secoda

Help your team understand metadata from Dagster by adding context in Secoda.

SFTP

High-performance secure file transfer integration with support for parallel transfers, batch operations, and advanced filtering. Built on asyncSSH for optimal performance.

Sigma

The Sigma integration allows you to monitor your Sigma organization as assets in Dagster, along with other data assets.

Slack

Up your notification game and keep stakeholders in the loop.

Sling

Extract and load data from popular data sources to destinations with Sling through Dagster.

SLURM

Bridge high-performance computing with modern data orchestration. Run Dagster assets seamlessly across laptops, CI pipelines, and supercomputers with full observability.