Download the e-book primer on how to build data platforms

Learn the fundamental concepts to build a data platform in your organization; covering common design patterns for data ingestion and transformation, data modeling strategies, and data quality tips.

Tips and tricks for data modeling and data ingestion patterns

Explore the benefits of an observation layer across your data pipelines

Learn the key strategies for ensuring data quality for your organization

Latest posts

Orchestrating Nanochat: Training the Models
Orchestrating Nanochat: Training the Models

December 9, 2025

Orchestrating Nanochat: Training the Models

Training an LLM isn’t one job—it’s a sequence of carefully managed stages. This part shows how Dagster coordinates your training steps on RunPod so every experiment is reproducible, scalable, and GPU-efficient.

Orchestrating Nanochat: Building the Tokenizer
Orchestrating Nanochat: Building the Tokenizer

December 3, 2025

Orchestrating Nanochat: Building the Tokenizer

Every great model starts with great data. This first part walks through how to structure ingestion with Dagster, prepare your text corpus, and build a tokenizer that shapes how your model understands the world.

When (and When Not) to Optimize Data Pipelines
When (and When Not) to Optimize Data Pipelines

November 17, 2025

When (and When Not) to Optimize Data Pipelines

Engineers often optimize the wrong parts of their pipelines, here's a profiling-first framework to identify real bottlenecks and avoid the premature optimization trap.