See also

Again, be sure to download and explore the Dataset source file, which is about 2500+ lines from GitHub. Exploring the Spark source code is the best way to learn advanced programming in Scala, Scala Annotations, and Spark 2.0 itself.

Noteworthy for Pre-Spark 2.0 users:

  • SparkSession is the single entry ...

Get Apache Spark 2.x Machine Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.