Scala 2.11

Scala is a general purpose programming language used for both object-oriented programming and functional programming. Apache Spark is, in fact, written in the Scala programming language. However, as described in Chapter 1, The Big Data Ecosystem, Spark applications can be written in a variety of languages, including Java, Scala, Python, and R. Though the pros and cons of Scala versus Python is beyond the scope of this book, Scala is generally faster than Python within the context of data analysis and naturally more tightly integrated with Spark. Python, however, currently offers a more comprehensive library of advanced third-party data science tools and frameworks and is arguably easier to learn and use. The code examples provided ...

Get Machine Learning with Apache Spark Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.