- Documentation for Dataset is available at http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.Dataset
- Documentation for KeyValue grouped Dataset is available at http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.KeyValueGroupedDataset
- Documentation for relational grouped Dataset http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.RelationalGroupedDataset
Again, be sure to download and explore the Dataset source file, which is about 2500+ lines from GitHub. Exploring the Spark source code is the best way to learn advanced programming in Scala, Scala Annotations, and Spark 2.0 itself.
Noteworthy for Pre-Spark 2.0 users:
- SparkSession is the single entry ...