Back to Glossary Index

Dagster Data Engineering Glossary:

Resilient Distributed Dataset (RDD)

A fault-tolerant collection of elements that can be processed in parallel, fundamental data structure of Spark,