Back to Glossary Index
Dagster Data Engineering Glossary:
Resilient Distributed Dataset (RDD)
A fault-tolerant collection of elements that can be processed in parallel, fundamental data structure of Spark,
Dagster Data Engineering Glossary:
A fault-tolerant collection of elements that can be processed in parallel, fundamental data structure of Spark,