Sollicitatievraag bij Verizon

What is RDD?

Antwoord op sollicitatievraag

Anoniem

10 okt 2017

The main abstraction Spark provides is a resilient distributed dataset (RDD), which is a collection of elements partitioned across the nodes of the cluster that can be operated on in parallel.