Jupyter Notebook

Jupyter Notebook is an open source, web-based application designed for interactive analytics that comes bundled with the Anaconda distribution. Since it is designed for interactive analytics, it is best suited for ad hoc queries, live simulations, prototyping, and a means to visualize your data and to look for any trends and patterns prior to developing production-ready data science models. Apache Zeppelin is another example of an open source, web-based notebook used for similar purposes. Notebooks such as Jupyter Notebook and Apache Zeppelin tend to support multiple kernels, meaning that you can use various general purpose programming languages including Python and Scala.

One of the core advantages of notebooks is that they ...

Get Machine Learning with Apache Spark Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.