Oops! Something went wrong while submitting the form.
October 25, 2022
Build a poor man’s data lake from scratch with DuckDB
DuckDB is so hot right now. Learn how to build a data lake from dbt using DuckDB for SQL transformations, along with Python, Dagster, and Parquet files.
Pete Hunt
Engineering
October 19, 2022
The Unreasonable Effectiveness of Data Pipeline Smoke Tests
Data practitioners waste time writing unit tests to catch bugs they could have caught with smoke tests.
Sandy Ryza
Engineering
October 17, 2022
Web Workers are not the Answer
A tale of overstretched logs, counterintuitive web worker behavior, and ultimately a troublesome cursor issue.
Jordan Sanders
Engineering
October 4, 2022
Postgres: a Better Message Queue than Kafka?
When lots of event logs must be stored and indexed, Kafka is the obvious choice. Naturally, our queue runs on Postgres.
Pete Hunt
Engineering
August 17, 2022
Spend Less Time Debugging with Dagster
It’s not uncommon for a data engineer to devote 80% of their day to debugging. Dagster radically improves on this.
Sandy Ryza
Engineering
August 3, 2022
The Open Core Business Model
The relationship between Dagster, the open-source project, and Dagster Cloud, our hosted SaaS platform.