O'Reilly logo
  • Sam Zeitlin thinks this is interesting:

SchemaRDD

From

Cover of Learning Spark
  • 9. Spark SQL
  • from Learning Spark
  • by Matei Zaharia, Patrick Wendell, Andy Konwinski, Holden Karau
  • Publisher: O'Reilly Media, Inc.
  • Released: February 2015

Note

SchemaRDD is gone, these are dataframes now

this works:
val topTweets = hiveCtx.sql("SELECT text, retweetCount FROM tweets ORDER BY retweetCount LIMIT 10");