January 3, 2024 • 1 minute read •
Podcast: Machine Learning Pipelines Are Still Data Pipelines
- Name
- Sandy Ryza
- Handle
- @s_ryz
In this episode of The Data Stack Show, hosts Eric and Kostas chat with Sandy Ryza, Lead Engineer at Dagster Labs.
Sandy shares insights on data cleaning, data engineering processes, and the need for improved tools. He introduces Dagster, an orchestrator that focuses on assets like tables, datasets, and machine learning models, and contrasts it with traditional workflow systems. He also explains Dagster’s integration with dbt, while also exploring the changing dynamics in data roles, the impact of modern tooling, the potential for increased creativity in the field, and more.
We're always happy to hear your feedback, so please reach out to us! If you have any questions, ask them in the Dagster community Slack (join here!) or start a Github discussion. If you run into any bugs, let us know with a Github issue. And if you're interested in working with us, check out our open roles!
Follow us:
Podcast: Value Driven Data Science - The Impact of Data Science on Data Orchestration
- Name
- Sandy Ryza
- Handle
- @s_ryz