Book description
- Understand Spark unified data processing platform
- Howto run Spark in Spark Shell or Databricks
- Use and manipulate RDDs
- Deal with structured data using Spark SQL through its operations and advanced functions
- Build real-time applications using Spark Structured Streaming
- Develop intelligent applications with the Spark Machine Learning library
Product information
- Title: Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
- Author(s):
- Release date: August 2018
- Publisher(s): Apress
- ISBN: 9781484235799
You might also like
video
Apache Spark with Java - Learn Spark from a Big Data Guru
This course covers all the fundamentals of Apache Spark with Java and teaches you everything you …
video
Building an End-to-End Batch Data Pipeline with Apache Spark
Explore Big Data architectures and the tools you can leverage to build an end-to-end data platform. …
video
Apache Spark Streaming with Python and PySpark
Spark Streaming is becoming incredibly popular, and with good reason. According to IBM, 90% of the …
book
Scala Programming for Big Data Analytics : Get Started With Big Data Analytics Using Apache Spark
Gain the key language concepts and programming techniques of Scala in the context of big data …