Building a classification system with Gradient Boosted Trees (GBT) in Spark 2.0

In this recipe, we will explore the Gradient Boosted Tree (GBT) classification implementation in Spark. The GBT requires more care with hyper-parameters and several tries before deciding the final outcome. One must remember that it is completely OK to grow shorter trees if using GBT.

Get Apache Spark 2.x Machine Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.