The core ability of Spark is to operate on data that is distributed in the cluster (RDDs aka Resilient Distributed Datasets). In this post I am giving a reference of the available transformations you can use along with some examples.
Continue reading
Transforming data in Spark
Reply