Features

Sparkling Water provides transparent integration for the H2O engine and its machine learning algorithms into Spark platforms, which enables the following:

  • Use of H2O algorithms in the Spark workflow
  • Transformation between H2O and Spark data structures
  • Use of Spark RDDs and DataFrames as input for H2O algorithms
  • Use of H2O frames as input for MLlib algorithms
  • Transparent execution of Sparkling Water applications on top of Spark

Get Apache Spark for Data Science Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.