Because Spark’s RDDs can be shared across the system, you can freely mix SQL, machine learning, stream processing, and graph processing in the same program. Resilient distributed data sets The ...